Jump to content
Chinese-forums.com
Learn Chinese in China

How to decode/recode Chinese gibberish (乱码 or mojibake)


vellocet
 Share

Recommended Posts

I have a plain text file with what I know is Chinese.  But when opening it, it displays gibberish.  I know it's some kind of encoding problem, but how can I fix it? I already tried opening as UTF-8 and it didn't work. Here is a sample text.  The numbers are unrelated.  

 

3
00:00:34,761 --> 00:00:37,216
¡]ÄÁÁn¡^

4
00:01:06,281 --> 00:01:08,168
¡]­µ¼ÖÅT°_¡^

5
00:01:20,489 --> 00:01:22,725
§O®`²Û

6
00:01:22,792 --> 00:01:25,792
Åý·P±¡¦Û¥Ñ©b¬y§a

7
00:01:27,401 --> 00:01:29,605
§O®`©È

8
00:01:29,673 --> 00:01:32,804
¤£µM¨S¤Hª¾¹D§A¦b¨º¨à

9
00:01:34,089 --> 00:01:36,707
©ù°_§AªºÀY

10
00:01:36,777 --> 00:01:39,647
Åý·P±¡ºÉ±¡«Å¬ª§a

11
00:01:40,905 --> 00:01:43,458
¤£­n®`²Û

12
00:01:43,529 --> 00:01:46,464
Åý·P±¡¦Û¥Ñ©b¬y§a

Link to comment
Share on other sites

Site Sponsors:
Pleco for iPhone / Android iPhone & Android Chinese dictionary: camera & hand- writing input, flashcards, audio.
Study Chinese in Kunming 1-1 classes, qualified teachers and unique teaching methods in the Spring City.
Learn Chinese Characters Learn 2289 Chinese Characters in 90 Days with a Unique Flash Card System.
Hacking Chinese Tips and strategies for how to learn Chinese more efficiently
Popup Chinese Translator Understand Chinese inside any Windows application, website or PDF.
Chinese Grammar Wiki All Chinese grammar, organised by level, all in one place.

I have Editpad, which lets you do that, but nothing I could see worked.  

 

I downloaded this from a Chinese site so it must work for someone.  This has to be a common problem with a common solution.  

 

If I open it in Notepad, I get Chinese, but it's the wrong Chinese.

 

1
00:00:24,361 --> 00:00:28,420
牧羘

2
00:00:29,609 --> 00:00:33,351
牧羘

3
00:00:34,761 --> 00:00:37,216
牧羘

4
00:01:06,281 --> 00:01:08,168
贾臫癬

5
00:01:20,489 --> 00:01:22,725
甡槽

6
00:01:22,792 --> 00:01:25,792
琵稰薄パ゜瑈

7
00:01:27,401 --> 00:01:29,605
甡┤

8
00:01:29,673 --> 00:01:32,804
ぃ礛⊿笵êㄠ

9
00:01:34,089 --> 00:01:36,707
癬繷

10
00:01:36,777 --> 00:01:39,647
琵稰薄荷薄

11
00:01:40,905 --> 00:01:43,458
ぃ璶甡槽

12
00:01:43,529 --> 00:01:46,464
琵稰薄パ゜瑈

Link to comment
Share on other sites

Outstanding!  I knew something like this had to exist somewhere!  But gosh, I didn't expect you had to code it yourself.  Thanks though, you've done the world a service.  And me!  Bookmarked!

 

Having decoded the file (finally), I find now that it is in Traditional Chinese. 😒  And the time codes are off.  If it ain't one thing, it's another. I'm off to find a converter - which has been done before by many others.  And VLC has some kind of time shifting for subtitles, which I need to look up.  Thanks to Demonic_Duck for saving the day.  :P

  • Like 1
Link to comment
Share on other sites

Join the conversation

You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Click here to reply. Select text to quote.

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

 Share

×
×
  • Create New...