View Single Post
Old 18th May 2020, 21:07   #989  |  Link
Janusz
Registered User
 
Join Date: Apr 2020
Location: Poland
Posts: 143
Quote:
Originally Posted by Nikse555 View Post
@Janusz: And now really fixed the italic-space-stuff in nOcr: https://github.com/SubtitleEdit/subt...leEditBeta.zip
It's perfect now. Two lines in pol_OCRFixReplaceList.xml

<WordPart from = "ą" to = "ą " />
<WordPart from = "j" to = " j" />

divide expressions consisting of two or even three combined words into single words. With an earlier amendment regarding "l" and "I", the text consisting of 1189 lines, of which almost half was written in italics, is read almost 100%. There are two mistakes to improve. If you add them to your replacements, the effectiveness will be 100%.
Really good work @ Nikse555. Thank you again.
__________________
Sorry for my mistakes - I'm using a translator.
Janusz is offline   Reply With Quote