Jump to content
Chinese-forums.com
Learn Chinese in China

  • Why you should look around

    Since 2003, Chinese-forums.com has been helping people learn Chinese faster and get to China sooner. Our members can recommend beginner textbooks, help you out with obscure classical vocabulary, and tell you where to get the best street food in Xi'an. And we're friendly about it too. 

    Have a look at what's going on, or search for something specific. We hope you'll join us. 
jannesan

Also sharing: Pleco character-frequency dictionary

Recommended Posts

jannesan

Hi everyone,

the other day I found the thread with the word-frequency Pleco user dictionary (https://www.chinese-forums.com/forums/topic/56816-sharing-a-pleco-word-frequency-user-dictionary/) and started using it. As a complementary dictionary I made a character-frequency dictionary with entries for single characters, denoting the position in a character frequency ranking (contrary to the frequency among all words like in the word-frequency dictionary).

 

The sources I used were subtlex (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2880003/) and the list by Jun Da (http://lingua.mtsu.edu/chinese-computing/statistics/char/list.php?Which=MO). The entries show up both in the list of search results when you search for a single characters and in the DICT tab of a single character entry.

 

To add the character-frequency dictionary to your Pleco download the cfreq.pgb, open Pleco - Settings - Manage Dictionaries - Add, select EXISTING and select the cfreq.pgb file on your phone.

 

cheers,

Jannes

 

cfreq.pqb

  • Like 1
  • Thanks 1
  • Helpful 1

Share this post


Link to post
Share on other sites
Site Sponsors:
Pleco for iPhone / Android iPhone & Android Chinese dictionary: camera & hand- writing input, flashcards, audio.
Study Chinese in Kunming 1-1 classes, qualified teachers and unique teaching methods in the Spring City.
Learn Chinese Characters Learn 2289 Chinese Characters in 90 Days with a Unique Flash Card System.
Hacking Chinese Tips and strategies for how to learn Chinese more efficiently
Popup Chinese Translator Understand Chinese inside any Windows application, website or PDF.
Chinese Grammar Wiki All Chinese grammar, organised by level, all in one place.

js6426

This is an excellent resource!  Thank you!

Share this post


Link to post
Share on other sites
Weyland

This is very much appreciated! Question though; would there be a quick way to create a flashcard system out of these? I don't like admitting it, but I have been neglecting my traditional characters. It would be nice if there was a way to learn the traditional version of all the characters based on their frequency through flashcards. 

Share this post


Link to post
Share on other sites
jannesan
On 9/14/2019 at 5:53 AM, Weyland said:

This is very much appreciated! Question though; would there be a quick way to create a flashcard system out of these?

 

Yea, that should be pretty easy, depending on what flashcard system you use.

In Pleco you can just import a list of words and automatically create flashcards (http://iphone.pleco.com/manual/30200/flash.html#import).

I'm appending the text files of the frequency lists if you'd like to try.

 

 

junda2004.txt subtlex2010.txt

Share this post


Link to post
Share on other sites
mungouk

Excellent effort...


How difficult would it be to add annotations to the dictionary entries to show the level of HSK vocabulary?

 

In other words, for learners to know... at my level am I expected to know this word?

 

 

 

 

Share this post


Link to post
Share on other sites
mikelove

That's actually built-in, just go into Organize, tap the (i) button next to a category (on iOS) or long-press it (on Android), and tap the option to Add Tag; that tag it will then show up at the top right corner for entries in that category.

 

(we're working on adding this to the popup reader + search results too)

  • Like 1

Share this post


Link to post
Share on other sites
DavyJonesLocker

interesting comparison between Jun' Das list and the subtlex one for the character frequency. Some are wildly different in terms of frequency position   Some in the top thousand of Jun Da 'Combined' list can be a up to 2000 places lower in the subtle one. E.g 亦 is placed 501 in the JunDa Combined list and 2583 in the subtlex list. 

 

It should be noted that Jun da lists contain several categories imaginative, informative classic etc

 

Which begs the question: if trying to go through the top 1000 characters, which list should you choose! i have been using Jun Da's Modern list (combined from the categories of 'Imagnative' and 'informative' Chinese texts as the authors original categorised them), however they are still fairly different from subtlex. Subtlex would include a lot of the popular historical tv dramas 

 

Hence the often advertised "top 1000 characters covers 90% of all chinese" is quite inaccurate imo

 

Plod on ....

 

 

 

Share this post


Link to post
Share on other sites
jannesan
20 hours ago, mungouk said:

How difficult would it be to add annotations to the dictionary entries to show the level of HSK vocabulary?

 

very easy actually, I will update this character dictionary with that when I find the time.

It would be a bit more interesting on a word level though, so I may also update the dictionary posted by BearXiong.

 

4 hours ago, DavyJonesLocker said:

Which begs the question: if trying to go through the top 1000 characters, which list should you choose!

 

I'd argue you shouldn't be going through any frequency list at all to learn characters, but use them only as an indicator if it is worth to learn a new encountered word/character. But you're right it would probably be best to answer this question if it is worth by choosing a frequency list based on your interests/use case.

Share this post


Link to post
Share on other sites
mungouk
19 hours ago, mikelove said:

That's actually built-in,

 

Thanks @mikelove but I meant for the whole dictionary, so every time you look something up you see the HSK level (so you know if you "need to know it" yet).

 

Like MDBG does. 

 

The only flashcards I have are "Uncategorized" and pressing (i) on that says the category can't be tagged...

 

 

Share this post


Link to post
Share on other sites
DavyJonesLocker
12 minutes ago, jannesan said:

I'd argue you shouldn't be going through any frequency list at all to learn characters, but use them only as an indicator if it is worth to learn a new encountered word/character. But you're right it would probably be best to answer this question if it is worth by choosing a frequency list based on your interests/use case.

 

 

 

 

I agree in principle not to go through frequency list unless there is a very good reason. Personally I found allocating time per day  to handwrite the top 1000 characters purely for the purposes of remembering them exactly and noting subtle differences e.g 已 and 己 to be very helpful. Every student is certainly going to encounter every one of the top 100) character within a year of study irrespective of the source material.

 

(E.g I was reading a book earlier and saw the term 大冢 I immediately thought 大家 but knew something did not look quite right, I think if I hadn't learned to handwrite a pile of characters I would have been highly confused)

 

Gong from 1000 to 2000+ I think the argument starts to lose weight 

 

Word lists: yes I agree not to go through unless you know you are/will cover the material.(eg HSK) Although again the top 1000 words is still worthwhile going through (in conjunction with your other learning) as it's highly likely a learner of  Chinese going to encounter almost all quite frequently within several months 

 

 

Share this post


Link to post
Share on other sites
mikelove
1 hour ago, mungouk said:

The only flashcards I have are "Uncategorized" and pressing (i) on that says the category can't be tagged...

 

Go to Import/Export / "Install Premade Cards" in the sidebar menu to install our premade HSK lists; you can then create tags for them. Those tags will appear in all dictionary entries for those words, not just when looking at flashcards.

Share this post


Link to post
Share on other sites

Join the conversation

You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Click here to reply. Select text to quote.

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.


×
×
  • Create New...