martindbp Posted May 17, 2022 at 01:13 PM Report Share Posted May 17, 2022 at 01:13 PM Hey everyone! I'm a software/ML/computer vision engineer by trade and I've spent some time building an OCR subtitle extraction algorithm for videos, and made them accessible through a browser extension. My goal is to make available pretty much any video of interest online, whether on Youtube, Netflix or chinese sites like Bilibili. As of now I'm working only with Youtube though. You can download it here, and find short installation/user guide here. For now it's what I would consider this "beta" software, and it's Chrome only and requires manual installation. The current list of processed shows can be found at browse.zimu.ai. The list is pretty short right now but I'm processing new videos every day. As you probably know, there are quite a few similar extensions for soft subs (which are also supported naturally), but I'm trying out a slightly different concept/philosophy for the subtitles. The idea is we want to display the minimal yet sufficient information such that a learner can understand the content in a reasonable time frame. From the start, the pinyin, hanzi and word translations are visible for all words. Gradually you can hide information you know, but new unknown words are thus visible by default, hopefully keeping you in flow. If you keep learning until all the subtitles are completely hidden, viola, you're fluent! At least that's the idea. But, naturally everyone is free to use it as it suits best, I've tried to keep enough settings to make it flexible to use. The extension comes with the standard Anki CSV file export. You can export the usual basic or cloze notes, but I've also added the ability to export the JSON of the whole containing sentence, along with dictionary info, so that you can build very advanced cards in Anki if you wish (example cards are provided in the guide). That said, (deep) knowledge tracing has been a research interest of mine for quite a while and I do see a big potential in minimizing the amount of time we spend in SRS by helping us encode memories more efficiently, and use inter-card dependencies to improve the scheduling. Therefore at some point I'll probably take a stab at an embedded SRS. As for funding, I'm making this browser extension available for free. I'm putting as much functionality as I can client-side (in the browser), and optimizing for low cost so that each additional user has very low marginal cost. For full disclosure, my philosophy here is to try and reach and provide something useful to as many people as possible, and try to find other ways to support it financially rather than a subscription or locking important features behind a paywall. That might be Patreon donations, selling the OCR as a SaaS, or even VPN/affiliate ads on the browsing site (not in the extension). So, are there any cool Youtube videos or channels with hard subs (or soft) you've been wanting to watch? Any and all feedback is warmly welcome! Hope you find it useful! 4 Quote Link to comment Share on other sites More sharing options...
Join the conversation
You can post now and select your username and password later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.