Jump to content
Chinese-Forums
  • Sign Up

Most common SPOKEN words


Ed Log

Recommended Posts

Does anyone have a comprehensive list of the most common spoken words in Chinese. There are loads of lists of most common words but these are all for the written language as far as I can tell. Again looking for a ranked list of most common SPOKEN words.

Thanks

Link to comment
Share on other sites

I've been looking for similar data recently and haven't found anything yet. Such data would need to be based off of a spoken Chinese Corpus. But I haven't found such a corpus. Most spoken corpora are based off of news transcripts, unscripted interview shows, and occasionally recorded family conversations and business meetings.

I don't know if such data has been compiled in a systematic way for the Chinese language. I believe LDC has some Chinese corpora based on news show transcripts, but that's not a cheap solution. If you find something, let me know.

Link to comment
Share on other sites

  • 9 years later...

My serious answer would be to search for existing word corpora, and, if no satisfactory ones exist, find someone with up-to-date basic data analysis skills, gather a bunch of transcripts, and make your own. I myself will do this at some point, but right now, I'm studying Mandarin with a funky method, rather than thinking seriously about an ideal learning approach (which might involve word and character frequency analysis).

 

My tongue-in-cheek answer, however, is:

The Power of Hao 好

Over appetizers of hummus and baba ghanoush in Riyadh, I asked two Chinese colleagues, “What’s the most frequently used word in Mandarin?” They were uncertain but ventured a couple of guesses. “No,” I disagreed, “but I know what it is.”

I’ve never looked at a Mandarin word corpus, nor have I ever researched the question at all. I’ve never even done a Google search. Nonetheless, I brashly affirm: The most frequently used word in Mandarin is:

HAO 好

 

Read more . . .

Link to comment
Share on other sites

19 hours ago, victorhart said:

My serious answer would be to search for existing word corpora, and, if no satisfactory ones exist, find someone with up-to-date basic data analysis skills, gather a bunch of transcripts, and make your own.

 

This was done about a year after this thread started by researchers analyzing movie and TV subtitles. 

http://crr.ugent.be/programs-data/subtitle-frequencies/subtlex-ch

  • Like 1
Link to comment
Share on other sites

On 9/23/2018 at 4:17 AM, victorhart said:

The most frequently used word in Mandarin is:

HAO 好

 

13 hours ago, Shelley said:

Having time to think about it I would say 很 is up there near the top.

 

Well according to both the character and word frequencies of Subtlex, it's neither of these.

 

好 just scrapes in to the top 10 by word and top 15 by characters, and 很 makes it in to the top 25 by word and top 25 by character.

 

Here are the top 10 rankings first by word:

 









我们

 

And now by character:

 










 

  • Like 1
Link to comment
Share on other sites

Join the conversation

You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Click here to reply. Select text to quote.

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...