Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#761 | Link |
Registered User
Join Date: Nov 2018
Posts: 33
|
Currently I'm using Tesseract 4.00 with engine mode Tesseract + LSTM.
Unfortunately I have to do massive post work. 'I' is sometimes a 'L' ,'l' or '!'. Music symbol isn't recognized right (it's a P or D). And I don't know what I have overlooked. Is there anything I could improve in my settings? EDIT: I'm using binary image compare now and creating new database for every BluRay. Works. Last edited by FLX90; 7th January 2019 at 23:53. |
![]() |
![]() |
![]() |
#762 | Link | |||
Registered User
Join Date: Jul 2003
Location: Brazil
Posts: 233
|
Hello, dear all.
I got the newest version from Subtitle Edit and try to convert a subtitle format to other. The original subtitle is *.vtt fomat. The output is to be *.srt format. But after the convertion the subtitle in srt show some strange errors. Eg: Quote:
Quote:
Quote:
Do you know how to fix that? I can make manual correction after the convertion to srt, but I would prefer that the program could do it at the first attempt, without any "errors" or strange "symbols" after the task. Thanks for your time. Best regards. Last edited by johner23; 14th January 2019 at 02:07. |
|||
![]() |
![]() |
![]() |
#763 | Link |
Registered User
Join Date: Jan 2019
Posts: 1
|
Only first text is converted on batch mode
Hi
If I convert from sub to srt on batch-mode (/convert) the resulting srt file contains all timeframes but only the first timeframe contains a text. If I do this with the GUI, the whole text is converted and all timeframes contains text. What do I wrong? Thanks Hendi |
![]() |
![]() |
![]() |
#764 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 216
|
@Hendi: That's a bug, sorry - should be fixed in latest beta: https://github.com/SubtitleEdit/subt...leEditBeta.zip
Next final version should be out soon... |
![]() |
![]() |
![]() |
#765 | Link | |
Registered User
Join Date: May 2005
Posts: 1,311
|
Quote:
Btw, why OCR-ing?! That's so 1985! ![]()
__________________
Gorgeous, delicious, deculture! |
|
![]() |
![]() |
![]() |
#766 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 216
|
Quote:
https://github.com/SubtitleEdit/subt...leEditBeta.zip SE 3.5.9 should be out soon... |
|
![]() |
![]() |
![]() |
#767 | Link |
Registered User
Join Date: Mar 2016
Posts: 3
|
I'm using SE to convert from bitmap Chinese/Japanese/Korean subs (sup, idx/sub, DVDsub).
There is a possibility to save one sub picture at a time (with right click) while importing , but could you add a function of saving all the sub pictures during OCR? For example, just a new option in the "export" (during OCR): export all the pictures as *.png? (just the images, no xml because it would be more complicated with "dirty" idx/subs (i.e with errors).) It's because after finishing the OCR and saving the OCRed srt, I have to correct the (many) mistakes "offline" , but there is no way to compare the OCRed text to the original pictures. Thanks for your great work! Last edited by dngnt; 10th February 2019 at 09:11. |
![]() |
![]() |
![]() |
#769 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 216
|
And SE 3.5.9 is out: https://github.com/SubtitleEdit/subtitleedit/releases
Some of the changes (mostly related to OCR) listed below: * NEW: * Bookmarks - thx OmrSi/marb99 * Image export - option to have single lines top justified - thx joedmartin * IMPROVED: * Improve Binary OCR of comma / apostrophe - thx Tuukka * Improve quote/italic detection in binary OCR - thx Miggu * Add context menu to OCR spell check * Binary OCR auto detect best DB - thx Mr. Rage * FIXED: * Fix missing/bad html tags after "Auto br" - thx iromafia111 * Fix crash in OCR window when closing - thx spetragl * Fix OCR in batch convert - thx danstraughn * Fix crash parsing empty word in OCR via Tesseract - thx Barry |
![]() |
![]() |
![]() |
#771 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 216
|
@varekai: Yes, Tesseract302 is included in the installer version as well.
Tesseract 4 is available as an in-program download, but note that T4 does not support italic detection and is a lot slower, but T4 is available in more languages and may work better for small unclear fonts. |
![]() |
![]() |
![]() |
#773 | Link | |
Registered User
Join Date: Mar 2016
Posts: 3
|
Quote:
Subtitle Edit is the best for its import/export versatility for srt, and with these 2 export abilities for sup,idx/sub and DVDsub, it's truly the Swiss knife for subtitling! |
|
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|