Welcome to Doom9's Forum, THE in-place to be for everyone interested in DVD conversion. Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules. |
![]() |
#1981 | Link |
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
In waveform, there really needs to be a way to continuously loop between _any_ two points. Press and hold a key, click point A, click point B, and the waveform is looped, A-B, until the key is released. While the key is held, either point can be dragged and the loop responds.
|
![]() |
![]() |
![]() |
#1982 | Link |
Registered User
Join Date: May 2009
Location: Belgium
Posts: 1,761
|
Hi Nikse,
is there a way to replace an uppercase by a lowercase when it follows a coma and a space ? For example ; Code:
Hello, How are you Hunter ? Code:
Hello, how are you Hunter ? Code:
(\,\s)([A-Z]) Code:
$1\l$2 edit : I finally added a case for each letter ; Code:
(\,\s)(A) Code:
$1a Last edited by Music Fan; 25th March 2025 at 09:56. |
![]() |
![]() |
![]() |
#1983 | Link |
Registered User
Join Date: Jun 2006
Posts: 372
|
The default DirectShow Video Player has an issue with audio sync. mpv library that SE downloads doesn't work on Windows 8.1 x64. Windows 8.1 users should replace it with mpv-dev-x86_64-20240922-git-71f2220.7z manually.
__________________
Windows 8.1 x64 Magically yours Raistlin |
![]() |
![]() |
![]() |
#1984 | Link |
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Faster editing
I'm editing the subtitles for "The Ghost and Mrs. Muir" [1947], DVD. Timing wise, they are a mess. And much of the audio is too soft to see it in waveforms.
With Waveforms losing half their resolution (by design), and with no function (by design) that loops like this: Push key, click point A, click point B, the audio loops A-to-B, drag A (as the audio loops) in order to find where an utterance starts, drag B (as the audio loops) in order to find where an utterance ends, release key, I instead have to drag A, play, drag A again, play, drag A again, etc., drag B, play, drag B again, play, drag B again, etc. Without smarter functions, better thought out and designed functions, editing just takes forever. I am in despair. I suggest better operations here and get no responses. Does no one give a sh!t? READ ME: See https://forum.doom9.org/showthread.p...45#post2017445 for the resolution of this issue. Last edited by markfilipak; 11th April 2025 at 03:54. |
![]() |
![]() |
![]() |
#1985 | Link | |
Registered User
Join Date: Jan 2025
Posts: 54
|
Quote:
I have had a look, and it's all out there, ready to be got...
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 |
|
![]() |
![]() |
![]() |
#1986 | Link |
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Faster editing
Good grief. Thank you, but my comment is not about the movie. It's about how poorly thought out SE's editing functions are, and how my suggestions get no response. Correcting subtitle times in waveforms is crude and incredibly tedious because the editing functions are crude.
Last edited by markfilipak; 8th April 2025 at 23:18. |
![]() |
![]() |
![]() |
#1987 | Link | |
Registered User
Join Date: Jan 2025
Posts: 54
|
Quote:
I've only just recently started using Whisper.... It's all very time consuming, at the best of times. Good luck.
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 Last edited by TR-7970X; 9th April 2025 at 00:09. |
|
![]() |
![]() |
![]() |
#1988 | Link |
Banana User
Join Date: Sep 2008
Posts: 1,131
|
Can you PM me the audio and timestamps where "audio is too soft to see"?
__________________
InpaintDelogo, DoomDelogo, JerkyWEB Fixer, Standalone Faster-Whisper - AI subtitling |
![]() |
![]() |
![]() |
#1989 | Link | |
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Faster editing
Quote:
Sometimes the audio is just a flat line but there's actually several frames of utterance there -- sometimes _seconds_ of utterance. Sometimes the utterance is buried in music, so it's all just jagged. If you've tried to set subtitles precisely (meaning: within 10 frames or so), you've run across this problem. You cannot rely on the waveform to show you where an utterance starts and ends. You have to hear it, and it's best to hear it in a loop and to have the power to move the cues while hearing the loop. Right now there's no good way to audition an utterance, so there's no good way to set in- and out-cues quickly. I have posted a couple of ways to speed up editing. The latest also has "Faster editing" as the subject. I conservatively estimate that providing that function would speed up editing in the waveform window by at least 10x. My audition between points A & B (looping, with A & B both actively dragable) is not the same as simply looping from in-cue to out-cue. Please, read what I wrote and I'm sure you will 'get it'. If you don't 'get it', ask. My proposed method includes a button assignment, clicking A, clicking B, draging A and/or draging B while hearing the audition, and releasing the button. That audition is then automatically followed by setting of the length of in-pad and out-pad with and without intervening shot change. In other words, everything beautify does, but beautify is incapable of listening to utterances. READ ME: See https://forum.doom9.org/showthread.p...45#post2017445 for the resolution of this issue. Last edited by markfilipak; 11th April 2025 at 03:55. |
|
![]() |
![]() |
![]() |
#1990 | Link | |||
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Quote:
Quote:
Quote:
|
|||
![]() |
![]() |
![]() |
#1991 | Link |
Registered User
Join Date: Jan 2025
Posts: 54
|
"Whisper" is an "add on" for SE, that performs an audio to text operation, that is, it creates subtitles from audio.
However, if your video/audio isn't "loud" enough Whisper may not be able to do it's job. I've tried it on a couple of movies that I can't get any subtitles for, and it definitely does a pretty good job...there would be some reviewing & editing, but at least it's a very good start. https://www.youtube.com/watch?v=4YZ0...el=DavidMbugua https://www.youtube.com/watch?v=ZDXy...ingwithClaudia
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 Last edited by TR-7970X; 9th April 2025 at 03:04. |
![]() |
![]() |
![]() |
#1992 | Link | |
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Quote:
I've watched quite a few YouTubes, but they weren't useful. All the ones I've seen review how to use SE, not how to deal with difficult subs, and not with how SE can be improved. |
|
![]() |
![]() |
![]() |
#1993 | Link | |
Registered User
Join Date: Jan 2025
Posts: 54
|
Quote:
I'm actually running an old movie that I can't get any subs for, and I'm using a "bigger" library/model, and it's taking forever, I hope it finds everything & accurately too.
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 |
|
![]() |
![]() |
![]() |
#1994 | Link | ||
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Quote:
Quote:
|
||
![]() |
![]() |
![]() |
#1995 | Link | |
Registered User
Join Date: Jan 2025
Posts: 54
|
Quote:
I generally use gMKVExtractGUI. I have done a couple of tests with a basic Whisper model, and despite the odd typo or misinterpretation, the timing was pretty good. I will let you know how this current job turns out, it's STILL going, it's been well over 2 hours for a movie that 1.5 hours But if turns out good, then it's better than the alternative, I guess.
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 |
|
![]() |
![]() |
![]() |
#1996 | Link | |
Registered User
Join Date: Jan 2025
Posts: 54
|
Quote:
You're saying that the audio is "soft"...what if you extracted the audio and amplified it, and then see if the waveform process works for you !!
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 |
|
![]() |
![]() |
![]() |
#1997 | Link | |
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Yes. I'm satisfied with it. Not perfect, but very good. Kudos to Nik.
Quote:
Last edited by markfilipak; 9th April 2025 at 07:34. |
|
![]() |
![]() |
![]() |
#1998 | Link | |
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Faster editing
Quote:
2) I would have to mux the louder audio into the movie at the beginning, and mux it out at the end. 3) Doing so would not improve the situation -- I still have to listen -- and would only add more time to the effort. The problem isn't that I can't hear the utterances. The problem is that I can't see the actual start and end of the utterances. That's mainly (partly) because waveforms could have twice it's current resolution, but doesn't. The solution is one that facilitates setting in- and out-cues while simultaneously listening, and doing so much more rapidly than is currently possible. Compare these methods: Current SE: A is an in-cue, B is an out-cue. Audio is the sub. Click-drag A, press a key to listen to A plus a little bit, release key. Click-drag A again, repeat the hunt until A coincides with the start of the utterance. Click-drag B, press a key to listen to the whole subtitle in order to hear the end, release key. Click-drag B again, repeat the hunt until B coincides with the end of the utterance. Manually add in-padding and out-padding by again dragging A, and again dragging B. It takes many clicks, many drags, and many listen-key presses to accomplish this. Proposed SE: A is an out-cue, B is an in-cue. Audio is the space between subs. Press and hold a key, click A, click B, (SE continuously loops A-to-B). Click-drag A while audio loops and drop it where utterance A ends. Click-drag B while audio loops and drop it where utterance B begins. Release key, (SE automatically adds out- and in-padding while taking shot changes into account). It takes one mode-key press-and-hold, two clicks, and two drags to accomplish this. You see, the proposed is not editing subs, it's editing the spaces between subs! Large gaps between subs exist of course. For them, set A & B using the current, hunting method, above. However, small gaps greatly outnumber large gaps in real videos, so the proposed will work in the vast majority of cases. READ ME: See https://forum.doom9.org/showthread.p...45#post2017445 for the resolution of this issue. Last edited by markfilipak; 11th April 2025 at 03:56. |
|
![]() |
![]() |
![]() |
#1999 | Link | |
Registered User
Join Date: Jan 2025
Posts: 54
|
Quote:
However, I thought I saw that you can export the audio to a text file....and also grab the subs from just the audio track. I ended up stopping that Whisper run, @ 4 hours, it kept what it had done, and it got up to just over an hour thru the movie, there was a lot of extra stuff generated (not needed), but the timing was pretty good, and there weren't too many typos. I'm going to try a different library/model... Has the author of SE got a "git" page ??? maybe you need to post your concerns there, not here.... I might try Whisper on the "The Ghost and Mrs Muir" that I got the other day, even tho it came with subs.
__________________
Main Systems:- Threadripper 7970X on Asus Pro WS TRX50-Sage WiFi Ryzen 9 9950X3D on MSI Carbon X670E Ryzen 9 7950X on Gigabyte Aorus Elite B650 Intel 13900KF on MSI Tomahawk B660 |
|
![]() |
![]() |
![]() |
#2000 | Link | ||||||
Registered User
Join Date: Jul 2016
Location: Mansfield, Ohio (formerly, Silicon Valley in California)
Posts: 428
|
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
|
||||||
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
Display Modes | |
|
|