Post your JAV subtitle files here - JAV Subtitle Repository (JSP)★NOT A SUB REQUEST THREAD★

Attention: Re. my marathon Subtitle post. I have discovered a few glitches with my subs. I have a post-processing script that was supposed to remove Hallucinations as well as clean up the subs and replace key words. Apparantly somewhere during the 10 Billion edits to my script I lost contact with it and it was not being referanced. Net result, my subs did not have all the groovy clean-up it was supposed to have. SO, I am starting over, sigh. I am deleting all of my generated .srt files, I am in the process of removing all of the extraneous text from the filenames, There's a few exceptions but the vast majority of my files will now be named ???-??? and nothing more, perhaps one character. The new subs will reflect these changes. It is a pain-in-the-ass but if this job is going to be done right then this is the solution. So..... if you've downloaded my zips, keep em if you want, but I suggest deleting them because my new files will be better. I will be posting them without cover images so the files should be smaller too. I'm re-encoding "A" right now, should be a couple of days and I'll get that uploaded. Sorry for the inconvenience. Cheers.

It's good that you caught it.

I think you might also have a situation where some audio extractions have failed. I was just running stats on the A set and got this below. There are many files which just are filled with repetiotions. I contribute that to audio failure, I guess.



1749850353504.png
 
It's good that you caught it.

I think you might also have a situation where some audio extractions have failed. I was just running stats on the A set and got this below. There are many files which just are filled with repetiotions. I contribute that to audio failure, I guess.



View attachment 3680909
A lot of these are tricky files , for example the Agemix files, these are almost always compillation videos which edit out all of the bits with dialog and just get to the ,, OH you have to forgive me for this awful pun, but they just get to the climax. So you could have a 4 Hour, 8 Hour, etc. video with almost no dialog so even the most finely tuned Whisper is still going to end up with those damned hallucinations. I'm always tweaking whisper, I know that you know the story, but without knowing Japanese and with over 20Thousand files to deal with I have to just take the best that I can get. Presently I'm mostly happy with the results that I am getting. One little issue is that occasionally whisper/VAD will completely ignore dialog that is obvious, clearly spoken and without any surface noise. I have tweaked and tweaked and tweaked my files to try to resolve this but the trade-off is more hallucinations, I hate hallucinations so my current configs are set to agressively remove hallucinations and if there are bits of dialog missed, shrug... Oh, One other thing, it would be interesting to reevaluate these files after my re-scan of everything, If you're up to it, see what you find after I've run everything through again, I expect to be done with 'A' maybe Tomorrow if all goes well, I have a ton of other background work todo, renaming all my files, I'm extracting all of the useless text after the filename, One problem with that is I am blanket-removing all text past 11 Characters, 11 because there are files named longer than others so I don't want to remove some of the code but the end result is I'm getting a lot of files named, ASW-234 de , The de is a result of that 11 character issue, most of the new files will have this but at least all of that idiotic long text will be gone. IN any case, I've babbled too long, Thanks for the excellent feedback Mei.
 
Last edited:
I'm extracting all of the useless text after the filename, One problem with that is I am blanket-removing all text past 11 Characters, 11 because there are files named longer than others so I don't want to remove some of the code but the end result is I'm getting a lot of files named, ASW-234 de , The de is a result of that 11 character issue, most of the new files will have this but at least all of that idiotic long text will be gone. IN any case, I've babbled too long, Thanks for the excellent feedback Mei.

Is there a reason you don't use tools like JavLuv or Javinizer to rename / normalise the file names?
 
Is there a reason you don't use tools like JavLuv or Javinizer to rename / normalise the file names?
I use renamer. I was unaware taht Javluv could rename correctly, My problem is there are codes with 7 characters all the way up to over 20 characters, like the Caribbean.com files or the fc2-ppv-xxxxxxxxxxx but most are xxx-xxx or xxx-xxxxx or xx-xxx. there are all kinds of variations in the names so it is , from my knowledge base, only possible to generalize if you're removing characters from a spacific point. Are you saying that JAVluv can batch rename like that? I don't want to rename each file individually, that would take forever. I want to rename by folder, for istance, earlier tonight I renamed all of my "M" files, that's over 8000 files, some wiht 6 characters, some with 7, some wiht 8, so the way, again I may be ignorant of the possibilities, but to rename everything and not lose characters from files with longer names is to set my renamer to remove all characters beyond 11. Net result, some have a few extra characters but if you know a way to batch rename and have them all correct, fantastic. IN the mean time I will try and figure out what you're referring to with Javluv. Thanks Mei
 

[Reducing Mosaic]RCT-918 Incest Mother And Son Chinguri Intercrural Sex​

1749869833285.png

[Reducing Mosaic]RCT-957 Incest Mother And Son In Chinguri Intercrural Sex 2​

1749870188347.png
Here are Subs for two "oldie", reduced mosaic Rocket JAVs. I used WhisperJAV0.7 to create these Subs and I attempted to clean them up a bit and re-interpreted some of the meaningless/ "lewd-less" dialog. Again, I don't understand Japanese so my re-interpretations might not be totally accurate but I try to match what is happening in the scene. Anyway, enjoy and let me know what you think.

Casting could have been better...most of the mothers look younger than their "sons"! :)
 

Attachments

GIRO-048 Married woman accidentally inhales aphrodisiac instead of anesthetic at the dentist

Cover.jpg
  • Code: GIRO-048
  • Release date: 2019-05-24
  • Studio: Prestige
  • Label: GIRO
  • Tags: Drug, Hospital / Clinic, Married Woman
  • Series: Gets with aphrodisiac!
  • Actor: Atsushi Hosoda, Taro Iwashita, Yakan
  • Actress: Ayasaki Momoka, Komine Hinata, Mochida Shiori, Tachibana Mary
  1. Audio extracted and transcribed using WhisperWithVAD_Pro - both original speed audio and slowed audio.
  2. The slowed down audio transcription was adjusted back to the correct timing.
  3. Both transcription versions were AI translated using DeepSeek v3 0324.
  4. The "best" translation of each line from the two versions was used in the final translation.
I don't speak nor read Japanese so there are certain to be lots of mistakes.
It'd be extra-shiny if mistakes could be pointed out or corrected.
 

Attachments

SRT's "A" 1300 .srt files. I am trying a new method so I need to get some feedback to make sure that it is working correctly. In my test I get an error message that google couldn't scan the zip but I downloaded anyway because I know that it is fine since I made it. In any case this is Part A of the revised .srt files. My renaming convention is not perfect but I've re-named all of the files to be almost, but not quite entirely, just the code name of the video. I can't get it perfect because to batch rename I have to set a number of characters to delete the rest, so, "ABC-123 Big yappy dog in the yard with a bowl of stew" becomes "ABC-123" The problem is that if I set my renamer to remove everything after the 7th character then anything with a long code name like, "AGEMIX-123" becomes, "AGEMIX-" so that's no good. Whisper AI fails to encode some files with long names or odd characters or for some other reason that I can't figure out. If you shorten the filename, ta-da, works like a dream. I tried using Powertoys renaming feature to remove all odd characters but that didn't solve the problem, SO This is what ya get. To accomodate for the variance in file name I have given the perameters a wide berth so, back to the dog title, it may now have become, "ABC-123 Big" I'm going through all of this description because the next 25 Thousand srt files will be named this way, if you don't like it, don't download it. Which brings me to a new topic. To accomodate a few people who have asked for an alternative download location I am testing this option out. There are no Cover images so the filesize should be managable. This one, the A's is about 10 Meg. Please would a couple of you let me know if you were able to download these successfully. Thanks, https://drive.google.com/file/d/1LUKEkYrXshW_mYEiFvKlay0zskvzBK70/view?usp=sharing
 
  • Like
Reactions: Imscully and mei2
If all your filenames are like this that's a pretty easy regex "[a-zA-Z]{3,6}-\d{3,4}"

Regex is how you deal with stuff like this, if there's different formats you just set up fallbacks. I have a big script that does my filenames but I doubt it will help you much.
 
SRT's "A" 1300 .srt files. I am trying a new method so I need to get some feedback to make sure that it is working correctly. In my test I get an error message that google couldn't scan the zip but I downloaded anyway because I know that it is fine since I made it. In any case this is Part A of the revised .srt files. My renaming convention is not perfect but I've re-named all of the files to be almost, but not quite entirely, just the code name of the video. I can't get it perfect because to batch rename I have to set a number of characters to delete the rest, so, "ABC-123 Big yappy dog in the yard with a bowl of stew" becomes "ABC-123" The problem is that if I set my renamer to remove everything after the 7th character then anything with a long code name like, "AGEMIX-123" becomes, "AGEMIX-" so that's no good. Whisper AI fails to encode some files with long names or odd characters or for some other reason that I can't figure out. If you shorten the filename, ta-da, works like a dream. I tried using Powertoys renaming feature to remove all odd characters but that didn't solve the problem, SO This is what ya get. To accomodate for the variance in file name I have given the perameters a wide berth so, back to the dog title, it may now have become, "ABC-123 Big" I'm going through all of this description because the next 25 Thousand srt files will be named this way, if you don't like it, don't download it. Which brings me to a new topic. To accomodate a few people who have asked for an alternative download location I am testing this option out. There are no Cover images so the filesize should be managable. This one, the A's is about 10 Meg. Please would a couple of you let me know if you were able to download these successfully. Thanks, https://drive.google.com/file/d/1LUKEkYrXshW_mYEiFvKlay0zskvzBK70/view?usp=sharing
I have a program that scan the folder, find possible movie information on internet and export to an HTML file with title and image. I will try with your folder in a few days. Currently I'm busy with some improvement.

It might look like this (attached file)
 

Attachments

I have a program that scan the folder, find possible movie information on internet and export to an HTML file with title and image. I will try with your folder in a few days. Currently I'm busy with some improvement.

It might look like this (attached file)
Hiya Theydon... that looks like an interesting program. I will look at your html file some more. I don't think that it would solve my naming issue but this looks like it may have other uses. You may want to point this to Tmpguy, who developed Javluv (do you know about JAVLUV?, if not, you need to get it) Search for Tmpguy on AKIBA-online and at the bottom of his posts is a link to javluv. Definitely check that out. That said, the movie information is very interesting. Thanks for sending this. Cheers.
 
Don't you love it when you are finished translating scene 1 and 2 starting on scene 3 and you the only thing you can hear is a running shower and no dialogue.
 
Don't you love it when you are finished translating scene 1 and 2 starting on scene 3 and you the only thing you can hear is a running shower and no dialogue.
There are a Million little complications with Whisper. My .bat and .py files scan a video first with Vad-True/Large-v3 and all kinds of tweaks to VAD filters but at some point the intent to catch whispered dialog gives you Hallucination. Carrying on, if VAD-True yields a .srt less that 10K, then it doubles back and re-scans the file with VAD-False and Large-V3. It is working reasonably well but there's always little issues that pop up here and there. I don't think that you can get perfect unless you manually edit and you have to know Japanese to do that. I hope that my files will be 'good enough' for most people but they will never be perfect.. Cheers/
 
Don't you love it when you are finished translating scene 1 and 2 starting on scene 3 and you the only thing you can hear is a running shower and no dialogue.
mdx_kim2 could help with that, it's a better voice extraction method than what faster-whisper uses (which is HT demucs I think ?). I use whisperXXL