4/6/2023 0 Comments Subler subtitler![]() ![]() Most likely it'll be a common issue that fixes other episodes too.ĥ. So you can start watching your episodes now (after muxing in the new subs), and if you see anything that doesn't make sense, you go back and fix it in the SRT files again and remux again. However, we should be at least 80% done by now unless you're extremely unlucky and the OCR is incredibly bad. There will still be cases that the spell checker misses or things that don't make sense in grammar because the OCR fucked up. Even after doing all this, your subs will still not be 100% perfect. It took me ~3 days of 5 hours each to get through 291 episode of DBZ with the spell checker.Ĥ. But you can imagine it takes quite a while. This isn't so bad because once you correct one word, you can add it to a dictionary for the spell checker and it won't ask you again. You need to be there to tell the spell checker if the word that it thinks is not in the English dictionary should stay because it's a name or some foreign word or not. This is the most time-consuming task on the list, because everything else you can script (which I did and it took less than a day to get through Steps 1-2), but this step you cannot let a computer do automatically. You can't automate this because you need to make sure you're only replacing incorrect words and not names or Japenese words that aren't in the English dictionary (for example in my case with anime). To do that, use a spell checking program like aspell (or whatever you want) to go through each SRT file. Now that most of your common OCR issues are fixed, you still have plenty of spelling mistakes to fix. They need to be something like "Replace 5 followed by letters with S followed by the same letters".ģ. And they have to be complex regex filters, not just simple "replace 5 with S" because that will screw up everything. My final filter file had over 200 replacements, and I had to figure out each one by hand. Create a filter with whatever find/replace program you want (I use sed), and replace all the common issues in the SRT files. Go through the SRT files and look for common issues such as I being written as | or i, or S being written as 5 or vice versa. To fix the OCR errors, you first need to recognize all the common problems. Now you have a bunch of SRT files for all your videos, but they will have loads of spelling errors (no matter how good the OCR is), and also OCR errors (such as I being recognized as a | or incorrect spacing or incorrect newlines).Ģ. Do this with whatever software you like, I used vobsub2srt which uses tesseract as the OCR backend. This is going to be a really high-level explanation.Įxtract all VOBSUBs from files with ffmpeg, then OCR the VOBSUBs to SRT. Here is what I had to go through to get perfect subtitles for an anime I ripped from DVD that had VOBSUBs (Dragon Ball Z Dragon Box Set). It's not as simple as putting it through one program and that's that. I just recently did this and it is much more laborious and time-consuming than you think. It worked great and I recommend it for anyone else struggling with VobSubs. Has anyone else had success with these tools? Are there any other tools I should be using?Įdit: I've found success using SubtitleEdit. This could be a problem with the source files, so I've been trying to get one of the other tools working.Īvidemux - I read that there should be an option to convert VOBSUB under "tools". SubRip - This works the best of the tools I've tried, but it entirely freezes and crashes at 51%. I've tried changing the space detection sensitivity to both ends of the range without any difference. ![]() I have to enter entire lines for them to be recorded correctly and the algorithm doesn't seem to learn and improve the detection, making this unusable. VobSub - I can begin the OCR process, but it seems to fail to detect any spaces. ![]() Almost all of the recommendations I've seen are many years old. I've tried using a few different tools that I see circulating without any success. I've been trying to convert some truly ugly VOBSUBs to SRT subtitles. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |