Gokkun

Does anyone know who either of these 2 girls are. They have names on the images but I think that they are fake names for a Soft On Demand title, SDDM-617. They sometimes give fake names for videos that they are trying to portray as amateur/employees, IN any case both of them look sort of familiar. The video was from 2005 so it may be long gone figureing it out but if anyone knows who either of these girls are please let me know. The second one especially is incredibly cute. ThanksView attachment 3800995View attachment 3800996
1st girl, I entered her name into AI and asked how it would be written in katakana, and it came out as “Yua Takase.” There’s no record under JAVLibrary, but I did find another video of her https://projavhard.com/tag/高瀬ゆあ
 
  • Like
Reactions: DScott
1st girl, I entered her name into AI and asked how it would be written in katakana, and it came out as “Yua Takase.” There’s no record under JAVLibrary, but I did find another video of her https://projavhard.com/tag/高瀬ゆあ
wow, thanks Krack. She must be a one-off. I did some searching as well and I downloaded the cover that you linked to and uploaded to Google Images, This is a good way to get the correct translation as OCR can sometims glitch. Google too gave me Yua Takase so it seems that she's disappeared into the ether. Thanks so much for having a go at this I really appreciat it.
 
  • Like
Reactions: krack
I had a conversation with someone recently about the Japanese creativity and today I discovered this. Who but the Japanese would think of eating cum off of a harmonica? WHO? PSS-019 harm.jpg
 
This is a long message but please read.

Several Months ago I posted a very large (file size) group of .srt files. Most of them were reasonably accurate with the occasional hullucination. However I found that after uploading those files that there were a bunch of files between 0 kb and 5 kb. There are many reasons that a subtitle could be that small: almost no dialog (Many compilations would fall into that category) or other explanations are that WhisperAI started hallucinating after a long dialogless section, and continuted to hallucinate after the dialog resumed. It is an endless game of accuracy vs hallucination (Most of the hallucinations whisper has 'learned' to inject phrases that it has gleaned by watching youtube videos which explains comments like 'subscribe to my channel' Every time you try to tweak one problem away, another one arises so it is not a matter of tweaking until you get it perfect, it is a matter of tweaking until it is good enough, and "good enough' is highly subjective.

And so, I decided to give the sub-creation process another go with all of the files that I had created that had subs that were >5kb. I did some extra tweaking and the attached zip is the result of about 5 Months processing. NOt perfect, but good enough by my standards.

For really new people who may use these, My file names are slightly odd because of an attempt to modify the names, for the earlier set of subs, to only include the code with no extraneous naming, IE, ABC-123-some-stupid-fuck-decided-to-add-a-novel-after-the-code.mp4 became ABC-123.mp4 My renaming process was not perfect so I ended up with many files that may look like; ABC-123.mp4 s.mp4

This is where you will have to do a little work. If your file, ABC.123 has other characters you will have to rename one of the 2 files to be the same for them to get recognized by your media player.

Another note for the people new to subtitle files. You can easily edit these .srt files. Open one in Word, Open Office, or some other word-processing program, and just edit the text as you want it. So, for example, if the .srt, at some point says, 'Subsribe to my channel" You could open the .srt and change that text to whatever you like or you could use the find and replace option in your word-processing program and replace "Subscribe to me channel" with a blank, or some other comment if you want. Do that for 'all' and it will remove all instances of that particular comment. I'm only really describing this for people that are unfamiliar with the technology, the point is that you can easily edit these files

My original set of .srt's were accompanied with the cover image for easy referance. This zip is .srt's only so the entire 21k is just .srt files. My originals, with covers are still available on Akiba in an earlier post.

One last detail that I forgot to mention. The total of this new group of .srts are comprised of some of the original group that I uploaded several Months ago, Some which have been removed from that original list, and a bunch of new titles that I did not have when I originally created the massive .srt zips.

And so, after much babbling, here are the files. Please enjoy.


 
Last edited:
for subs i kinda cheat a bit by using the Faster-Whisper-XXL Windows standalone version. it takes about 10 mins per video. it only gives a very basic idea of what’s being spoken, but most of the time that’s good enough for me.

i tried WhisperJav too, but for some reason it’s very slow on my budget gpu, and the results aren’t much better anyway. i did find out that CloudAI does a better job at getting the context right.

so now i use Faster-Whisper-XXL just for transcription, then translate it using DeepSeek through WhisperJav. i run both subs on the video, top and bottom, and compare them. DeepSeek seems to do a better job, but nothing groundbreaking. worth the little pennies paid, i guess. i put $2 into it and have translated about 100 movies so far, with around 40 cents left.
 
  • Like
Reactions: DScott
for subs i kinda cheat a bit by using the Faster-Whisper-XXL Windows standalone version. it takes about 10 mins per video. it only gives a very basic idea of what’s being spoken, but most of the time that’s good enough for me.

i tried WhisperJav too, but for some reason it’s very slow on my budget gpu, and the results aren’t much better anyway. i did find out that CloudAI does a better job at getting the context right.

so now i use Faster-Whisper-XXL just for transcription, then translate it using DeepSeek through WhisperJav. i run both subs on the video, top and bottom, and compare them. DeepSeek seems to do a better job, but nothing groundbreaking. worth the little pennies paid, i guess. i put $2 into it and have translated about 100 movies so far, with around 40 cents left.
Yeah, I totally agree, What is "good enough" for me? That is the Million dollar question. And as I said in my over-long description, the vast bulk of JAV 'dialog' is just the same thing over and over and over. For me, the only time dialog matters is during interviews or to a large degree the pornless scenes. These scenes I am often curious about the comments. My 'fetish' I suppose you could call it, is post action interviews. I am interested in what the the girls have to say after they've just been bukkaked or swallowed a ton of cum. If anyone has some suggestions On that score I would be happy to have them, But back to the point, YOu are completely right Krack, Just about all of most JAV titles don't really need comprehensive subs. There is probably about 1% of all of the titles where I even give a fuck what is being said, otherwise I already know what is being said.. However, my objective doing these was not so much to please myself as it was a project to contribute to the community.
 
  • Like
Reactions: krack
Yeah, I totally agree, What is "good enough" for me? That is the Million dollar question. And as I said in my over-long description, the vast bulk of JAV 'dialog' is just the same thing over and over and over. For me, the only time dialog matters is during interviews or to a large degree the pornless scenes. These scenes I am often curious about the comments. My 'fetish' I suppose you could call it, is post action interviews. I am interested in what the the girls have to say after they've just been bukkaked or swallowed a ton of cum. If anyone has some suggestions On that score I would be happy to have them, But back to the point, YOu are completely right Krack, Just about all of most JAV titles don't really need comprehensive subs. There is probably about 1% of all of the titles where I even give a fuck what is being said, otherwise I already know what is being said.. However, my objective doing these was not so much to please myself as it was a project to contribute to the community.
i love those before-and-after interviews. it was probably all just for the camera, but it worked on me too lol. also, storylines like “your gf won’t swallow? i will do it then” like in MIAA-735 at 17m12s. this part is hard for AI to get right, so i usually use it as a benchmark to get an idea of how good a model or my settings are. it probably means nothing, because doing well here doesn’t guarantee the same results for the rest of the video or for other videos.

from my limited testing, i feel like offline methods are already good enough for speech detection and transcription. there aren’t that many things that can go wrong there. the real limitation seems to be translation with quantized offline models. so uploading jp subs and letting cloud AI help with the translation/context part might give better results. just a suggestion for anyone who hasn’t tried this setup yet.
 
i love those before-and-after interviews. it was probably all just for the camera, but it worked on me too lol. also, storylines like “your gf won’t swallow? i will do it then” like in MIAA-735 at 17m12s. this part is hard for AI to get right, so i usually use it as a benchmark to get an idea of how good a model or my settings are. it probably means nothing, because doing well here doesn’t guarantee the same results for the rest of the video or for other videos.

from my limited testing, i feel like offline methods are already good enough for speech detection and transcription. there aren’t that many things that can go wrong there. the real limitation seems to be translation with quantized offline models. so uploading jp subs and letting cloud AI help with the translation/context part might give better results. just a suggestion for anyone who hasn’t tried this setup yet.
One of the funniest things about this whole , insane, project is that the technology is in its nacient stage. I don't expect that it will be very long, technology-wise, before Whisper et-all, and all of my loony hard work will be for nothing because browsers or media players et centera (VLC has been promising an on-the-fly translator for a while now) but I don't expect that it will be long before the technology makes my efforts moot. Tbh, I hope that it is soon because from just about every angle, From Install on down, WhisperAI is FUCKED!
 
One of the funniest things about this whole , insane, project is that the technology is in its nacient stage. I don't expect that it will be very long, technology-wise, before Whisper et-all, and all of my loony hard work will be for nothing because browsers or media players et centera (VLC has been promising an on-the-fly translator for a while now) but I don't expect that it will be long before the technology makes my efforts moot. Tbh, I hope that it is soon because from just about every angle, From Install on down, WhisperAI is FUCKED!

LLPlayer can translate on the fly. It uses the Faster-Whisper-XXL engine as well. It takes around 20 seconds for the subtitles to appear after loading a video, so it’s not instantaneous, but the timestamps are accurate.

For browsers, there is ViiTor Translate. It’s free but requires an email account. The translation is pretty rough, but gets the general idea across.
 

Attachments

  • Clipboard01.jpg
    Clipboard01.jpg
    77.5 KB · Views: 5
  • Clipboard02.jpg
    Clipboard02.jpg
    96.1 KB · Views: 2
Last edited: