Jump to content
Chinese-Forums
  • Sign Up

I'm Giving Up On The Import Feature!


drahnier

Recommended Posts

After more than one frustrating hour with zdt's import function I'm giving up! I simple don't understand what zdt is complaining about.

The file I'd like to import looks simple enough: UTF-8 coded with entries such as

安静 ān jìng quiet; peaceful; calm

安排 ān pái to arrange; to plan; to set up

爸爸 bà ba (informal) father

to list the first three. I attached the complete file (zipped) to this post.

So the general syntax of items in the file according to zdt should be

S TAB P TAB D (see attached image #1).

On running import I get a list of parsing errors (see attached image #2). The imported list itself contains a lot of false entries (see attached image #3). Something really screws up the parsing but I can't figure out what it is.

Any help would be greatly appreciated.

1774_thumb.attach

1775_thumb.attach

1776_thumb.attach

l1w.zip

Link to comment
Share on other sites

  • 2 weeks later...

I'm having the same trials and tribulations with importing data.

Am onto version 0.7.0 b3 but it still wont read in my file (in S P D format, tab delimited).

It's failing on at least 50% of lines, here's a small sample...

Line 8: 我们 tā men they

Unable to find in current dictionary.

Line 13: 他是美国人吗? Tā shì měi guó rén ma? Is he American?

Unable to parse line.

I think what I need is an "Ignore all errors on the input" check box :lol:

Link to comment
Share on other sites

Line 8: 我们 tā men they

Unable to find in current dictionary.

Line 13: 他是美国人吗? Tā shì měi guó rén ma? Is he American?

Unable to parse line.

Line 8 -> 我们 is wo3 men5, not ta1 men5

Line 13 -> Try ZDT's annotator for phrases and sentences.

I think what I need is an "Ignore all errors on the input" check box

I also think this could be useful, but would want to see the entry flagged so that I researched and corrected it, if necessary. There could be a global option to show or hide the flag.

Link to comment
Share on other sites

Thanks Luobot - I hadn't even thought it might be checking Char against Pinyin definitions...so thats why it runs so slow ! :wink:

This is a little toooo thorough for me. I've thousands of lines of data, much of it in sentences.

Agree it would be ideal, (for me at least) if discrepancies were just flagged rather than fail to input.

Link to comment
Share on other sites

Join the conversation

You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Click here to reply. Select text to quote.

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...