Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
3rd May 2020, 21:20 | #921 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 428
|
@GCRaistlin:
>1. After loading the OCR results to the main window SE behaves as if an unchanged SRT file is open I cannot re-create this... what file format and how do you open the file exactly? >3. Open a BD SUP... Yeah, that's a feature. Do any real work and SE will prompt. >3. The ability to load DVD SUP files from UI... That works here in main window via File - Open... or drag-n-drop. How/where does it not work? |
3rd May 2020, 22:48 | #923 | Link | |
Registered User
Join Date: Apr 2020
Location: Poland
Posts: 143
|
@GCRaistlin
Quote:
For me in the title of the main window after loading the subtitles after OCR the window name changes to: * D: \ full path \ filename.html \ index.srt - SubtitleEdit... If I am now trying to close the program I get a prompt to save a new file. I don't know why it doesn't work for you. After saving, the window name indicates where to save the index.srt file without *. If I change anything in the text, the program name will start with * again.
__________________
Sorry for my mistakes - I'm using a translator. Last edited by Janusz; 3rd May 2020 at 23:09. |
|
4th May 2020, 12:37 | #925 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 428
|
@GCRaistlin: Beta updated: https://github.com/SubtitleEdit/subt...leEditBeta.zip
Fixed dvd sup from cmd line + ass paste + change detection with file+ocr from cmd line + compare issue + image save as number + remember column paste options - hopefully some of that also works you? |
4th May 2020, 17:33 | #926 | Link |
Registered User
Join Date: Apr 2020
Location: Poland
Posts: 143
|
@Nikse555
I don't know how understandable this translator text will be, but I'll try. In connection with the problem of correct recognition by the OCR systems of the lowercase "L" and the "I", please explain briefly the principles which follow Subtitle Edit for automatic correction during OCR. Why i ask?
Finally, my suggestion to consider in the distant or near future. When automating the OCR process, give the opportunity to use (set) a second dictionary. Subtitles are usually a translation of one language into another. However, proper names, first names, last names etc. which do not have equivalents in a given language are often not translated. Using a second language outside the main language - will generate fewer errors, making the process more intelligent. We don't always create or correct subtitles in one language. If the translation is not understandable enough, I am sorry. I wanted to help You and myself.
__________________
Sorry for my mistakes - I'm using a translator. Last edited by Janusz; 4th May 2020 at 19:19. |
4th May 2020, 20:52 | #927 | Link |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,542
|
I really can't find a pixel space to binary OCR the italic part of this sup. No problems at all with SupRip.
Can you please tell me if you can and what parameters are you using?
__________________
@turment on Telegram |
5th May 2020, 00:28 | #928 | Link | ||
Registered User
Join Date: Jun 2006
Posts: 350
|
Not sure what you mean. The other fixes are confirmed, thank you.
Quote:
My logic is pretty simple: if there is any recognized data (even just one char) SE should ask about discarding if the user press Cancel or tries to close the window. Quote:
Bug: if we open a DVD SUP from the command line and then press Cancel in 'Import/OCR' dialog SE remains open.
__________________
Windows 8.1 x64 Magically yours Raistlin |
||
5th May 2020, 11:47 | #929 | Link | |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 428
|
@GCRaistlin:
"Fixed dvd sup from cmd line" is about the "*" in the title bar. OK, OCR window should now prompt for save changes if anything has been added. File - Import/OCR' for BD SUP will now also allow dvd sup. I just always use File -> Open... >Bug: if we open a DVD SUP from the command line and then press Cancel in 'Import/OCR' dialog SE remains open. Thx, should now hopefully be fixed. Latest beta updated: https://github.com/SubtitleEdit/subt...leEditBeta.zip Quote:
|
|
5th May 2020, 13:47 | #931 | Link |
Acid fr0g
Join Date: May 2002
Location: Italy
Posts: 2,542
|
@Nikse
This too has problem with Italic. Perhaps there was some regression at some point, because almost all titles I am doing OCR are "cursed". Can you fix the binary compare engine, regarding Italic? You are telling that it's easily fixed by OCR error rules but I can't find any universally suitable.
__________________
@turment on Telegram |
5th May 2020, 15:50 | #932 | Link | |
Registered User
Join Date: Jun 2006
Posts: 350
|
Quote:
Feature requests:
A lot of mistakes in word boundaries detection could be avoided if the vertical lines of a character, if any, were taken into account in the first place. Therefore we need a new setting - 'No of pixels from/to vertical line is space'; this setting takes precedence over simple 'No of pixels is space'. Examples:
__________________
Windows 8.1 x64 Magically yours Raistlin Last edited by GCRaistlin; 5th May 2020 at 15:53. |
|
5th May 2020, 22:24 | #933 | Link |
Registered User
Join Date: Jun 2006
Posts: 350
|
Feature request: use different fonts for list view and text boxes. For text boxes, Courier New is sometimes a better choice than Tahoma as it clearly shows the difference between a double quote and doubled apostrophe (OCR error). But list view looks ugly with Courier.
__________________
Windows 8.1 x64 Magically yours Raistlin |
6th May 2020, 01:22 | #934 | Link |
Registered User
Join Date: Jun 2006
Posts: 350
|
There's some incompatibility with Ditto, the clipboard manager:
__________________
Windows 8.1 x64 Magically yours Raistlin |
6th May 2020, 19:57 | #940 | Link |
Registered User
Join Date: Feb 2004
Location: Mars
Posts: 428
|
@GCRaistlin: Regarding Ditto, I have seen the error like 1 time in 100 pastes and I'm afraid I've no idea what's wrong or how to fix it. Ideas/fixes are welcome.
@tormento: Writing a new image-to-letter-splitter and integrating it into SE is a lot of work - could easily take 14+ days full time. Is SupRip open source? Edit: Thx for the sample files with italic! Surely uses more tilt than I've seen before. |
Thread Tools | Search this Thread |
Display Modes | |
|
|