I have another feature request, could we have a checkbox to omit all <i> </i> tags, they are being used for only half lines when the whole line is italic, they are also being used when there are no italic lines at all.
Right now after I rip a sub I am going through and doing find/replace to delete them all, but it would be great to have that as a feature in Subtitle Edit.
very often !! gets detected as ll
Is this something that can be fixed? or is there something I can do to help with the detection of exclamation points? or do I have to wait till tesseract is updated?
EDIT: on a side note, whatever you did for MS MODI OCR seems to have worked. and it definitely does help!
here is an example of the ll instead of " or !!