So that's PRE-AIKAClassic Aika.. almost 10 years old! I love her old stuff. Not a lot of dialogue, but it was worth doing. I've managed to piece together a 4.6gb version of this video and it's one of my favorites. KV-119.
I use Quicktime to convert, into m4a. I think it's better than using mp3. Although I think it takes a little while...But m4a is more precise imo. Also do you know if is there any free alternative to movavi for mac?I use a movavi suite in it a file converter is included with the program that it includes it also has the option to upload the audio of the file when converting to an audio file (m4a) it even has the option to eliminate noise and also the option to equalize the audio of voice (that the loudest audio of voices are equal).
Aika PRED-405. I'm not usually into the 'drugging the girl' (pheremones) videos, but some of the scenes were really good. Another Whisper sub. I tried three different levels of VAD thresholds, and the higher I got the more unique lines were found. However, way more time errors and repeats. I just tested out a PIYO video and got 3600 lines, but with only 200 or so repeats. Still better than any other methods I've used and I'm happy even with only 500 or so lines here. Enjoy!
All 1080p has corrupted audio, you should download 4K version that's not corrupted.""Danger, Danger ... Will Robinson! (In case any of you remember "Lost-in-Space"!) I have been trying to use Whisper to Translate TOEN-049. For the first time the srt timing is off, even after correcting for the beginning of the video, it slips again a couple of times! Arrrg! It seems my copies of TOEN-049 from supposedly from differt sites all contain the same hiccup between the video stream and the audio. Anyway, I'm trying to correct and edit my subtitle (probably for my own use, because I kinda like the movie). Just wondering if anyone else has encountered similar problems. I don't think it's the fault of Whisper BTW.
It's strange because the video players I have seem to sync back up and their audio and video will continue to match. But if you look carefully you can see the video fuzz out at several places in the video and then snap back. Apparently this is something that Whisper doesn't like?
Hi - may I know the logical explanation on why m4a is better than mp3? So that I will also use m4a in the future.I use Quicktime to convert, into m4a. I think it's better than using mp3. Although I think it takes a little while...But m4a is more precise imo. Also do you know if is there any free alternative to movavi for mac?
Hi - may I know the logical explanation on why m4a is better than mp3? So that I will also use m4a in the future.
It's a more modern audio compression format(m4a is aac) than mp3 and it's better at compressing than mp3 is. But opus beats both since it's even more modern and compresses even better so if you're going to convert it, which I wouldn't unless you're modifying the audio in some way, then better use that instead.Hi - may I know the logical explanation on why m4a is better than mp3? So that I will also use m4a in the future.
""Danger, Danger ... Will Robinson! (In case any of you remember "Lost-in-Space"!) I have been trying to use Whisper to Translate TOEN-049.
Thanks but I think I have corrupted copies of the movie as this subtitle file acts the same way about 57 mins into the movie the subtitle and the video are out of sync. I have had difficulty locating another copy of the movie to use but will keep trying.There is an existing sub for TOEN-049. Check it out if this does the job:
I used .5, .6, and .8. The .8 had the most lines, but was off by up to 30-seconds timewise. It wasn't a good test movie as there was a lot of mumbling. I'm going to try something else a little clearer.Can I ask which VAD thresholds you tried? It defaults to .4. Was wondering how high to raise or lower it.
I used .5, .6, and .8. The .8 had the most lines, but was off by up to 30-seconds timewise. It wasn't a good test movie as there was a lot of mumbling. I'm going to try something else a little clearer.
That's odd. I just tried processing a file at .6 then .2, and the .2 gave more lines. One would think if it is a detection threshold for audio, the lower number would catch more.
Wow, that is extremely strange. If the code remains the same, it should return the same result with the same input every time.The problem with comparing setting is that whisper isn't reliable. Even using the exact same settings, you'll get a different number of lines every time(never tested the same audio more than 4 times though but that's a lot already).
No idea how it can sometimes fail and sometimes not with the same everything but it does.
May i know the average size of an audio that is extracted from 1-hour video? Because when i am converting a 1-hour video to mp3 it has an average size of 120mb. I wonder what is the size of an extracted audio compared to mp3. Thank you for your explanation btw.It's a more modern audio compression format(m4a is aac) than mp3 and it's better at compressing than mp3 is. But opus beats both since it's even more modern and compresses even better so if you're going to convert it, which I wouldn't unless you're modifying the audio in some way, then better use that instead.
Whisper is going to read the original audio just fine so just demux(extract) it instead, zero quality loss this way.
May i know the average size of an audio that is extracted from 1-hour video? Because when i am converting a 1-hour video to mp3 it has an average size of 120mb. I wonder what is the size of an extracted audio compared to mp3. Thank you for your explanation btw.![]()
I used mp3 in the beginning, changed to mp4 and now wav. Whatever input you give, it converts it to wav mono.About Whisper. Does Whisper Collab really accept other formats than mp3? It's the format that is written on the audio_path: line and I can't find any info on audio formats that can be used in the help.