Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion.

Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

 

Go Back   Doom9's Forum > General > Subtitles

Reply
 
Thread Tools Search this Thread Display Modes
Old 23rd October 2012, 05:25   #1  |  Link
peppi_le_piou
Registered User
 
Join Date: Dec 2005
Posts: 70
Special Characters not showing properly

I am using VirtualDubMod v1.5.10.2 with the TextSub 2.23 plugin.

I have some subs that I want to put into a video file but some of the European characters are not recognised.

Using Notepad I typed in, for example, ....fraülein.... but adding the TextSub Filter and browsing to my subs file, in the VDMod preview window I see .....fraA¼lein.....

The subs format is .ssa. Everything else that I have done using override codes and fonts and style, etc is fine except for this problem.

Any help please?

ps.... I just noticed that using VLC to play the video file with the same, but seperate .ssa subs file (not hard coded) that it does display these characters correctly.

Last edited by peppi_le_piou; 23rd October 2012 at 05:35.
peppi_le_piou is offline   Reply With Quote
Old 23rd October 2012, 09:51   #2  |  Link
Groucho2004
 
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
You saved the file in UTF-8 format and TextSub doesn't seem to be able to interpret this properly. Save the file to ANSI (CP-1252 in your case I guess).

Edit: Actually, TextSub should be able to read UTF-8. I have no problem with UTF-8 .srt files. Anyway, I'm certain that there are many threads here about the same problem.

Last edited by Groucho2004; 23rd October 2012 at 10:08.
Groucho2004 is offline   Reply With Quote
Old 23rd October 2012, 13:07   #3  |  Link
filler56789
SuperVirus
 
filler56789's Avatar
 
Join Date: Jun 2012
Location: Antarctic Japan
Posts: 1,294
First thing, check whether the UTF8-coded file starts with a "B.O.M" --- if it doesn't, add the UTF-8 B.O.M. to it,
OR... convert it to real Unicode (UCS2, UTF16, whatever).

Secondly, TextSub 2.23 is outdated, upgrade to VSFilter.dll 2.39 at least.

Lastly, friends don't let friends use VirtuaDubMod.
filler56789 is offline   Reply With Quote
Old 23rd October 2012, 17:23   #4  |  Link
peppi_le_piou
Registered User
 
Join Date: Dec 2005
Posts: 70
Where do I look for this "B.O.M" thing. I don't see a reference to it in my ssa file (note that I did mention that it displays ok using VLC, so I don't think that it is my file problem).
peppi_le_piou is offline   Reply With Quote
Old 23rd October 2012, 17:54   #5  |  Link
the_weirdo
Yes, I'm weird.
 
the_weirdo's Avatar
 
Join Date: May 2010
Location: Southeast Asia
Posts: 271
Quote:
Originally Posted by peppi_le_piou View Post
Where do I look for this "B.O.M" thing.
Notepad++ can do that. (If on its status bar displays "UTF-8" then that means "UTF-8 with BOM"). However, like filler56789 has said, you should use VSFilter 2.39+. At least in my experiences, it works with "UTF-8 with BOM" subtitles just fine.
__________________
“Never argue with stupid people, they will drag you down to their level and then beat you with experience.” — Mark Twain
the_weirdo is offline   Reply With Quote
Old 24th October 2012, 03:45   #6  |  Link
peppi_le_piou
Registered User
 
Join Date: Dec 2005
Posts: 70
I did a test, wrote 2 lines in Notepad, one for STYLES: and one for DIALOGUE:
Found TextSub 2.39 and added it to VirtualDubMod Plugin folder, all showing good now in the video.
Saved one as ANSI.ssa and the other as UTF-8.ssa, both worked.
Nothing to do with "BOM" when I open them again with Notepad....(still don't know what that means).


Thanks for all of the suggestions.
peppi_le_piou is offline   Reply With Quote
Old 24th October 2012, 09:32   #7  |  Link
Groucho2004
 
Join Date: Mar 2006
Location: Barcelona
Posts: 5,034
Quote:
Originally Posted by peppi_le_piou View Post
Nothing to do with "BOM" when I open them again with Notepad....(still don't know what that means)..
In this context, BOM stands for byte order mark and is a sequence at the beginning of a file indicating the type (UTF-8, UTF-16,...) and for some types also the byte order (little/big endian).
For UTF-8 it should be "EF BB BF" (hex).

TextSub in VSFilter.dll needs the BOM in UTF-8 and UTF-16 files in order to render the characters properly.

Last edited by Groucho2004; 24th October 2012 at 09:50.
Groucho2004 is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 04:52.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2022, vBulletin Solutions Inc.