This is a long message but please read.
Several Months ago I posted a very large (file size) group of .srt files. Most of them were reasonably accurate with the occasional hullucination. However I found that after uploading those files that there were a bunch of files between 0 kb and 5 kb. There are many reasons that a subtitle could be that small: almost no dialog (Many compilations would fall into that category) or other explanations are that WhisperAI started hallucinating after a long dialogless section, and continuted to hallucinate after the dialog resumed. It is an endless game of accuracy vs hallucination (Most of the hallucinations whisper has 'learned' to inject phrases that it has gleaned by watching youtube videos which explains comments like 'subscribe to my channel' Every time you try to tweak one problem away, another one arises so it is not a matter of tweaking until you get it perfect, it is a matter of tweaking until it is good enough, and "good enough' is highly subjective.
The total files I have added many new files and deleted others so this new list may have a few that are the same as my originals but there are lots of new files here.
And so, I decided to give the sub-creation process another go with all of the files that I had created that had subs that were >5kb. I did some extra tweaking and the attached zip is the result of about 5 Months processing. NOt perfect, but good enough by my standards.
For really new people who may use these, My file names are slightly odd because of an attempt to modify the names, for the earlier set of subs, to only include the code with no extraneous naming, IE, ABC-123 some stupid fuck decided to add a novel after the code.mp4 became ABC-123.mp4 My renaming process was not perfect so I ended up with many files that may look like; ABC-123.mp4 s.mp4
This is where you will have to do a little work. If your file, ABC.123 has other characters you will have to rename the 2 files to be the same for them to get recognized by your media player.
Another note for the people new to subtitle files. You can easily edit these .srt files. Open one in Word, Open Office, or some other word-processing program, and just edit the text as you want it. So, for example, if the .srt, at some point says, 'Subsribe to my channel" You could open the .srt and change that text to whatever you like or you could use the find and replace option in your word-processing program and replace "Subscribe to me channel" with a blank, or some other comment if you want. Do that for 'all' and it will remove all instances of that particular comment. I'm only really describing this for people that are unfamiliar with the technology, the point is that you can easily edit these files
My original set of .srt's were accompanied with the cover image for easy referance. This zip is .srt's only so the entire 21k is just .srt files. My originals, with covers are still available on Akiba in an earlier post.
One last message for the purists in this thread. I know these are not perfect. After Thousands of tweaks to Faster-Whisper AI and a constant push and pull of balancing accuracy with hallucinations. This is the best result that I can achieve. Ultimately most JAV Dialog involves an endless repettition of the same phrases but when there is a clear conversation/dialog, my settings will usually translate these accurately and without hallucination. That is not perfect, but it is good enough for me.
And so, after much babbling, here are the files. Please enjoy.
Download 21000-SRTS.zip fast and secure
rapidgator.net