Jump to content
Chinese-Forums
  • Sign Up

Possible software for children to study Chinese.


thechamp

Recommended Posts

I just found out I've got onto a start-up incubator, so am leaving my current job to take up the funding and design/build a product for a year. If my future team mates think it's a good idea, I'm quite interested in a Chinese studying app for children, with perhaps a gaming element to it. Does anyone have any experience of studying Chinese that they think would be particularly fun/engaging for children? 

 

Has anyone taught Chinese to Western children and noticed something novel about how children approach tones/characters? 

I have a few ideas but I thought it might be an interesting discussion, and a year and a team is quite a decent amount of time to make something nice, if things go to plan.

 

  • Like 1
Link to comment
Share on other sites

Boring. Use one of the new AI as a service offerings (Google's new one looks fun) to teach computers how to give meaningful feedback on learner pronunciation. *

 

* you owe me $$$$$$$$$$$$$$$$$$$$

  • Like 1
Link to comment
Share on other sites

The technical test for it actually involved hooking up to Microsoft Azure or AWS's image recognition services. I chose AWS and am generally quite interested in these services but unless they expose a lot of the code (which from what I've seen, they don't) then I don't see how you can give them these kinds of specific training sets. 

 

Also all my exposure to that kind of statistical analysis is in excel rather than doing it in code so would be a big ask, but we have a year so maybe it can be done!

Sort of like a phone app where you talk into it and it assigns you a score, based on how close you are to 'putonghua' pronunciation, based on your natural pitch?

Link to comment
Share on other sites

Not just a score. What I'm imagining (and I'm so far out of my depth...)

 

Say you give it umpteen thousand examples of 'correct' speech. 

Then you give it umpteen thousand examples of 'incorrect' speech, but coded by actual humans*. Initially maybe for 'problem with initial / final / tone' but perhaps getting more specific with common errors. 'Pronounced pinyin x as English sh' for example. 

Then magic happens. 

Then the AI can take an example of learner speech and say 'you're getting q and ch mixed up and your first tone is dipping. Try again... ok, that's better...'

 

Maybe that's impossible, or trivial. But it'd be great. 

 

PS. GPUs, Imron? Old.

 

*You have funding, yeah?

Link to comment
Share on other sites

Imron based on the test I am assuming that the stacks we'll be using we won't actually be building out specific AI but relying on these cloud services. There are a couple of AWS ones that look like they could be re-purposed for this kind of product but that's the thing - I think they actually provide only extremely specific services and you're not given access to any of the underlying software. You basically hook up to the API and pass values for processing.

 

Also, I have friends who experimented with Tensorflow on my MSc and I'm a little bit scared tbh.

Link to comment
Share on other sites

Yeah, you're going to struggle find an xkcd cartoon I haven't already seen. 

 

What do you reckon, Imron? Plausible? Speech recognition has obviously come on in leaps and bounds. How long till feedback goes beyond a tone graph?

Link to comment
Share on other sites

Anyway, in general, I agree with Roddy.  The market is saturated with the 'learning apps for children'.  The pronunciation feedback App is a much better idea.

 

1 minute ago, roddy said:

What do you reckon, Imron? Plausible?

I reckon both plausible and a good idea.

 

Building the training set would likely be the most difficult part.

Link to comment
Share on other sites

Thanks Imron, I needed a new signature.

 

Building the training set is difficult, but feasible, no? Coding schema, several dozen BLCU CSL students on their summer break, lots of Chinese students around the world recording phrases in exchange for the coded feedback...

  • Like 1
Link to comment
Share on other sites

Hahaha,

 

I like those XKCD comics too. I have done enough stats in the past to have be aware of linear regression and classifications etc. Most of Machine Learning seems to pretty much be statistical analysis and classification anyway, and then coding up the application. Hmmm, maybe it is doable?! I've never had a year to program anything end to end, and we're in teams of 4 (although 1 person will be a business person) so I actually think perhaps it is plausible?

Link to comment
Share on other sites

I actually wonder if it's a viable business model *WITHOUT* the AI. Think how many students would like reliable, actionable feedback on their pronunciation. Now think what someone in the US will pay for that, and how much you'd have to pay someone in China for something they can do on the bus with headphones in. Either way your first step, a comprehensive review of Chinese pronunciation and what goes wrong with it, is the same. Much smaller moat without the AI though. 

 

And *surely* - think of how many phonemes Alexa and Siri and Google whateveritis are processing every day. I find it hard to believe they cant be taught to tell you that you just said zh instead of ch. In fact this'd work great as an Alexa Skill. 

Link to comment
Share on other sites

That's what I mean though - you don't have access to the code when you use these cloud services. Hooking up to them involves a lot of command line fiddling and then you pass files (in the case of the test it was JPEGs....and the image recognition on for AWS is the only one I've really used (except a Microsoft cloud service we use in our finance system)). You can't really 'teach' eg Amazon Lex....you have to essentially rebuild Amazon Lex, to do what you want it to do. It's quite a big ask. I've been told though that they're not expecting the product to be market ready after a year, just MVP as something to show to investors. Maybe it is plausible but it's really ambitious. It would basically involve taking one of the tensorflow templates and re-purposing it.

Link to comment
Share on other sites

Wow, a lot to think about! I didn't expect the discussion to go in this direction(!)

 

Yes building the training set would be very time intensive, but with this kind of thing I'm expecting there'll be a lot of time when the business person isn't marketing etc....and I think that would probably be a good time for them to do this.

Link to comment
Share on other sites

I've realised you could actually also cheat with this, a lot, because you could use the speech recognition services as they are (they return text of the input audio) and then do your pronunciation checks on how accurate the text was at writing the input audio. I really don't know how that would solve the tones issue though.

 

I'm not a machine learning researcher I'm a semi-competent software engineer trying to get a product ready in 1 year!

 

Link to comment
Share on other sites

Join the conversation

You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Click here to reply. Select text to quote.

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...