The Akiba-online English Sub Project★NOT A SUB REQUEST THREAD★

avatarthe

Well-Known Member
Feb 1, 2008
184
277
Gentlemen, I would like to purpose an exciting project which could benefit us all…



The Akiba-online English Sub Project

I’ve been reading the Thai JAV site AVCollectors.com site and the Korean JAV site Jamakav.com and both have ongoing mechanisms in place to promote users to make Thai or Korean subtills of JAV films. JAmakav.com have appoint system where you can gain points by uploading subtitles of films, preferably in Korean, but in foreign languages as well. Other members can then take the foreign subs , translate them into Korean and upload them. AVCollectors.com has an even better system where people just upload foreign subtitles (Usually from Chinese subs) , other members pick the film(s) they want translate, those films get labeled “in progress” so no one else duplicates the effort and also tries to translate the same film. Once the film is translated the subs get posted for everyone to enjoy!

I suggest we follow AVCollctors.com’s model and start producing English language subtitles!

You don’t need to understand Japanese (or Chinese) to do the translation, I’ve attached a zip file containing Chinese language subtitles for a ton of JAV films release this past year! This collection comes from PRIDER, an AVCOLLECTORS.com member who sorted and bundled these together.

If you want to take part in this great social experiment in cooperation and sexy Japanese ladies you will need the attached zip file containing the Chinese subtitles and you will also need to download



1. Subtitles Edit https://www.nikse.dk/SubtitleEdit

This great piece of freeware will not only help you edit your subtitles, and tweak the timing, but It can also use Google translation service to do a rough translation of the Chinese subtitles into English

[IF YOU WOULD LIKE TO WRITE A SHORT OVERVIEW OF SUBTITLE EDIT TOCHING ON USING IT TO TRANSLATE, AND EDIT SUBTITLES IN TRASNLATOR MODE WHICH I CAN INSERTED HERE, PLEASE PM ME]

2. Pytranscriber https://github.com/raryelcostasouza/pyTranscriber

This nifty followup to autosub can be used to generate automatic transcription / automatic subtitles for audio/video files through a friendly graphical user interface.

[IF YOU WOULD LIKE TO WRITE A SHORT OVERVIEW OF PYTRANSCIBER TOCHING ON IT’S USE AND ISSUES WHICH I CAN INSERTED HERE, PLEASE PM ME]



The Process:

I think the best was to translate a film is to start by making a Computer translation of the Chinese subtitles either using subtitled edit ( which may only allow a certain number of line to be translated per ip address each day) to the more manual process using google chrome outlined here ( https://www.akiba-online.com/thread...e-repository-jsp.1466451/page-54#post-3998837 )



Then watch the film in subtitle edit with the translated titles, correct the obvious mistakes as you see them, often gender pronouns are reversed – “he” becomes “she”, “girlfriend” becomes “boyfriend”, some sentences are grammatically a mess, but you can understand what it’s trying to say from the context, these can be quickly fixed. And find/ replace can be used to fix poorly translated slang, change “Meat stick ( yes this is a frequent translation” to “cock”, “Making rice” to “Cuming”, ect.

After you’ve tidied up the translated Chinese subtitles run the film though Pytranscriber to produce a set of Japanese subtitles. Computer translate the Japanese subtitles into English. Now finally open both sets of subtitles in Subtitle Edit in TRANSLATOR Mode, this will allow you to see the film along with both sets of subs time synced. Edit the subs together, adding in lines that were missing from the translated Chinese subs and correct lines which didn’t make sense by looking at the other translation.

When your all done attach the subtitle to this thread and PM me, lets see if we can cooperated and make some subtitles!

Below is a list of the Chinese subs included in the linked zipfile, if someone is already translating a file I will place their username next to the movie code number, please check before you duplicate work.

Here's the Zipfile
(CHN-SRT pack 2020-A)
https://mega.nz/#!2F0jkJQB!2KND5TGn1f3DORa2L2nA0w0Dzhc-GVjOxjdFC3mXI9s

Here’s the list…

DEMO (not included in zip file)

MIDE-749 - Miura Sakura
(SUBS by The Avatar, SOURCE - Korean subs & PyTranscriber)
(Progress Complete)
MIDE-749-poster.jpg
https://www.akiba-online.com/thread...glish-sub-project.1920331/page-2#post-4014022

Attackers

ATID-375 - Minori Kawana
ATID-376 - Touka Rinne
ATID-377 - Iroha Natsume
ATID-378 - Nanase Kase
ATID-379 - Tsumugi Akari
ATID-380 - Yuka Arai
ATID-381 - Naoko Akase
ATID-382 - Rui Hiiragi
ATID-384 - Eimi Fukada
ATID-385 - Mizuki Reika
ATID-386 - Rumi Orikasa
ATID-387 - Nono Yuuki
ATID-388 - Hikari Ninomiya
ATID-389 - Sachiko
ATID-390 - Rin Chibana
ATID-391 - Minori Kawana
ATID-392 - Iroha Natsume
ATID-393 - Shouko Akase
ATID-394 - Akari Tsumugi
ATID-395 - Eimi Fukada

Kawaii

CAWD-019 - Moko Sakura
CAWD-020 - Ito Mayuki
CAWD-021 - Ichika Kasagi
CAWD-022 - Aika Usagi
CAWD-023 - Mirina Kousaka
CAWD-024 - Erika Arimura
CAWD-025 - Erika Misumi
CAWD-026 - Ayame Chinen
CAWD-027 - Aya Mochizuki
CAWD-028 - Mei Osaka
CAWD-029 - Moko Sakura _ Yui Nagase
CAWD-030 - Maiyuki Ito
CAWD-032 - Megumi Ayane
CAWD-033 - Kotome Himeno
CAWD-034
CAWD-035 - Akina Kokono
CAWD-036 - Mio Fukada

Honnaka

HND-732
HND-733 - Miu Nanase
HND-735 - Rich Valentine
HND-736 - Suzu Nonomiya
HND-737 - Sora Shiina
HND-738 - Kurumi Kashiwagi
HND-739 - Eimi Fukada
HND-740 - Nanami Matsumoto
HND-741 - Yui Nagase
HND-743 - Shouko Akase
HND-744 - Akari Neo
HND-745 - Akari Mitani
HND-746 - Aoi Kururigi
HND-748 - Nao Jinguuji
HND-749 - Hikaru Takagi
HND-750 - Megumi Ayane
HND-751 - Nagi Aiiro
HND-752 - Erika Arimura
HND-753 - Yuu Matsui
HND-754 - Momoe Takanashi
HND-756 - Eri Akira
HND-757 - Kaho Imai
HND-758 - Eimi Fukada
HND-760 - Akari Mitani
HND-761 - Touka Rinne
HND-762 - Shiori Kamisaki
HND-763 - Hayashi Aizai
HND-764 - Maho Mitsuhashi
HND-765 - Nao Jinguuji
HND-766 - Sora Shiina
HND-767 - Akari Neo
HND-768 - Minako Komukai
HND-769 - Mami Nagase
HND-770 - Kanon Kanon
HND-771 - Mitsuki Nagisa

IDEAPOCKET

IPX-393 - Maron Natsuki
IPX-395 - Tsubasa Amami
IPX-397 - Kokona Yuutsuki
IPX-398 - Karen Kaede
IPX-400 - Kana Momonogi
IPX-401 - Yume Nishinomiya
IPX-402 - Minami Aizawa
IPX-403 - Nanami Misaki
IPX-404 - Akari Tsumugi
IPX-405 - Minori Hatsune
IPX-407 - Yui Sanda
IPX-408 - Narumi Hirose
IPX-410 - Rin Monami
IPX-411 - Marin Natsuki
IPX-412 - Tsubasa Amami
IPX-413 - Kokona Yuutsuki
IPX-414 - Karen Kaede
IPX-416 - Kana Momonogi
IPX-417 - Yume Nishinomiya
IPX-418 - Nanami Misaki
IPX-419 - Akari Tsumugi
IPX-421 - Akari Neo
IPX-424 - Maron Natsuki
IPX-425 - Tsubasa Amami
IPX-426 - Kokona Yuutsuki
IPX-428 - Momo Sakura
IPX-429 - Kana Momonogi
IPX-430 - Tsumugi Akari
IPX-431 - Yume Nishimiya
IPX-432 - Nanami Misaki
IPX-434 - Yui Mita
IPX-436 - Narumi Hirose
IPX-438 - Momo Sakura
IPX-439 - Kana Momonogi
IPX-440 - Tsubasa Amami
IPX-441 - Yume Nishimiya _ Tsubasa Hachino

Madonna

JUL-028 - Momoko Isshiki
JUL-030 - Kaori Iiyama
JUL-031 - Tsubaki Amano
JUL-032 - Rui Hiiragi
JUL-033 - Ayane Haruka
JUL-034 - Rei Aoki
JUL-035 - Yui Nagase
JUL-036 - Shou Aoyama
JUL-037 - Juri Yamaguchi
JUL-038 - Hikari Kisaki
JUL-039 - Amina Nanjo
JUL-040 - Ryo Hitomi
JUL-041 - Kato Saki
JUL-042 - Saori Yagami
JUL-043 - Eri Akira
JUL-044 - Rumi Mochizuki
JUL-045 - Kaho Imai
JUL-046 - Maki Tomoda
JUL-047 - Shouko Akase
JUL-048 - Aka Mukai
JUL-049 - Kanon Kanon
JUL-050 - Arare Mochizuki
JUL-051 - Yuka Oshima
JUL-052 - Rin Chibana
JUL-053 - Honoka Tsujii
JUL-054 - Akari Mitani
JUL-055 - Shiori Sano
JUL-056 - Suzuna Mizukawa
JUL-057 - Nao Jinguuji
JUL-058 - Minami Kurisu
JUL-060 - Sena Maikawa
JUL-061 - Tsubaki Amano
JUL-062 - Momoko Isshiki
JUL-063 - Yukino Ooki
JUL-064 - Eimi Fukada
JUL-065 - Yuuko Shiraki
JUL-066 - Akari Neo
JUL-067 - Maki Hojo
JUL-068 - Airi Yukina
JUL-069 - Ayane Haruka
JUL-072 - Chiho Rukawa
JUL-073 - Saya Tanihana
JUL-074 - Ayumi Miura
JUL-075 - Sho Aoyama
JUL-076 - Juri Yamaguchi
JUL-077 - Mio Kusakari
JUL-078 - Saori Yagami
JUL-079 - Rumi Mochizuki
JUL-080 - Maki Tomoda
JUL-081 - Amina Nanjou
JUL-082 - Kurea Hasumi
JUL-083 - Arisa Nishimura
JUL-084 - Ai Mukai
JUL-085 - Haruna Kawakita
JUL-086 - Yuka Oshima
JUL-087 - Reiko Sawamura
JUL-088 - Sano Shiori
JUL-089 - Rika Aimi
JUL-090 - Iori Yoshioka
JUL-091 - Nao Jinguuji
JUL-092 - Ayane Sezaki
JUL-093 - Tsubaki Kanno
JUL-094 - Kana Mito
JUL-095 - Ena Koume
JUL-096 - Maki Hojo
JUL-097 - Akari Neo
JUL-098 - Minami Kurisu
JUL-099 - Ryo Hitomi
JUL-100 - Yiko Shiraki
JUL-101 - Sena Maikawa
JUL-102 - Yukino Oki
JUL-103 - Megumi Meguro
JUL-104 - Ayane Haruka
JUL-105 - Yuki Nanao
JUL-106 - Airi Kijima
JUL-109 - Rana Kawai
JUL-110 - Chiho Rukawa

Tameike Goro

MEYD-542 - Nao Nakano
MEYD-543 - Nanami Kawakami
MEYD-544 - Airi Kijima
MEYD-545 - Asami Nagase
MEYD-546 - Yui Nagase
MEYD-547 - Kurumi Kashiwagi
MEYD-548 - Eimi Fukada
MEYD-549 - Shoko Akase
MEYD-550 - Meguri
MEYD-551 - Nao Nakano
MEYD-552 - Nanami Kawakami _ Airi Kijima
MEYD-553 - Koharu Sakino
MEYD-554 - Megumi Meguro
MEYD-555 - Eimi Fukada
MEYD-556 - Kurumi Kashiwagi
MEYD-557 -Meiko Nakao
MEYD-558 - Sanai Hamabe

MOODYZ

MIAA-171 - Eimi Fukada
MIAA-172 - Yui Nagase
MIAA-173 - Mio Kimishima _ Rino Kirishima
MIAA-175 - Kanon Kanon
MIAA-176 - Arare Mochizuki Arare
MIAA-177 - Yu Shinoda
MIAA-178 - Kurumi Kashiwagi
MIAA-180 - Ruka Inaba _ Rui Hiiragi
MIAA-181 - Mitsuki Nagisa
MIAA-182 - Kurumi Kashiwagi
MIAA-184 - AkariNeo
MIAA-185 - Kanon Kanon
MIAA-188 - Mitsuki Nagisa
MIAA-191 - Yui Nagase
MIAA-192 - Shizuku Kurosaki
MIAA-193 - Kaho Imai
MIAA-194 - Arare Mochizuki
MIAA-195 - Kurumi Kashiwagi
MIAA-196 - Mio Kimijima
MIAA-197 - Mizuki Yayoi
MIDE-697 - Minami Hatsukawa
MIDE-698 - Ena Koume
MIDE-699 - Mizuki Aiga
MIDE-700 - Kana Yume
MIDE-701 - Miu Nakamura
MIDE-702 - Mia Nanasawa
MIDE-704 - Tsubomi
MIDE-705 - Sakura Miura
MIDE-706 - Fumika Nakayama
MIDE-707 - Minami Hatsukawa
MIDE-708 - Mizuki Aiga
MIDE-709 - Shoko Takahashi
MIDE-710 - Nana Yagi
MIDE-711 - Mia Nanasawa
MIDE-713 - Tsubomi
MIDE-714 - Miyu Misaki
MIDE-715 - Sakura Miura
MIDE-716 - Kana Yume

Premium

PRED-193 - Aika Yamagishi
PRED-194 - Sera Asaka
PRED-195 - Yuki Takeuchi
PRED-196 - Eimi Fukada
PRED-197 - Kotone Fuyue
PRED-198 - Yu Shinoda
PRED-199 - Yui Hatano _ Mio Kimijima
PRED-200 - Aika Yamagishi
PRED-203 - Kurea Hasumi
PRED-204 - Momoe Takanashi
PRED-205 - Mai Kaede
PRED-207 - Yuki Takeuchi
PRED-208 - JULIA
PRED-209 - Kurumi Kashiwagi
PRED-211 - Yu Shinoda _ Kurea Hasumi
PRED-212 - An Ito
PRED-215 - Eri Akira
PRED-216 - An Ito
PRED-217 - Mitsuki Nagisa
PRED-218 - Eimi Fukada

S-1

SSNI-282 - Aoi Tsukasa
SSNI-283 - Aika Yumeino
SSNI-284 - Arina Hashimoto
SSNI-285 - Nanami Matsumoto
SSNI-286 - Yura Kano
SSNI-289 - Miru Sakamichi
SSNI-295 - Aoi
SSNI-296 - Miharu Usa
SSNI-297 - Koharu Suzuki
SSNI-298 - Shunka Ayami
SSNI-299 - Azusa Oto
SSNI-300 - Fumika Hatsune
SSNI-301 - Yua Mikami
SSNI-302 - Nami Hoshino
SSNI-303 - Aoi Tsukasa
SSNI-304 - Aika Yumeno
SSNI-305 - Arina Hashimoto
SSNI-306 - Nanami Matsumoto
SSNI-307 - Nene Yoshitaka
SSNI-308 - Yura Kano
SSNI-309 - Saika Kawakita
SSNI-310 - Miru Sakamichi
SSNI-594 - Nene Yoshitaka
SSNI-597 - Iga Mako
SSNI-598 - Jun Kakei
SSNI-599 - Marin Hinata
SSNI-600 - Mahoro Kamiki
SSNI-601 - Shiori Kotoi
SSNI-602 - Shinna Amin
SSNI-603 - Moe Amatsuka _ Miru Sakamichi
SSNI-604 - Hiyori Yoshioka
SSNI-605 - Moe Amatsuka
SSNI-606 - Saki Okuda
SSNI-608 - Miru Sakamichi
SSNI-610 - Hotaru Nogi
SSNI-611 - Rika Aimi
SSNI-612 - Nodoka Sahane
SSNI-613 - Mao Mashiro
SSNI-615 - Rin Kira
SSNI-616 - Ema Shiiba
SSNI-617 - Sora Amakawa
SSNI-619 - Aoi Tsukasa
SSNI-620 - Aika Yumeno
SSNI-621 - Arina Hashimoto
SSNI-623 - Yura Kanno
SSNI-624 - Ichika Hoshimiya
SSNI-625 - Jun Kakei
SSNI-627 - Shihori Kotoi
SSNI-628 - Amin Arana
SSNI-629 - Hiyori Yoshioka
SSNI-630 - Moe Amatsuka
SSNI-631 - Saki Okuda
SSNI-633 - Miru Sakamichi
SSNI-634 - Shion Yumi
SSNI-635 - Hotaru Nogi
SSNI-636 - Rika Aimi
SSNI-639 - Riko Shiraha
SSNI-640 - Rin Kira
SSNI-641 - Ema Shiiba
SSNI-642 - Sora Amakawa
SSNI-643 - RION
SSNI-644 - Yua Mikami
SSNI-645 - Tsukasa Aoi
SSNI-647 - Arina Hashimoto
SSNI-648 - Nene Yoshitaka
SSNI-652 - Jun Kakei
SSNI-653 - Marin Hinata
SSNI-654 - ShihoriKotoi
SSNI-655 - Amin Arana
SSNI-656 - Hiyori Yoshioka
SSNI-657 - Riho Hori
SSNI-659 - Moe Amatsuka
SSNI-660 - Saki Okuda
SSNI-661 - Minami Kojima(SUBS by slim_beech, SOURCE Chineses Subs) (Progress ???)
SSNI-662 - Miharu Usa
SSNI-663 - Miru Sakamichi
SSNI-664 - Shion Yumi
SSNI-665 - Hotaru Nogi
SSNI-666 - Nodoka Sahane
SSNI-668 - Riko Shiraha
SSNI-669 - Rin Kira
SSNI-670 - Sora Amakawa
SSNI-671 - Rion
SSNI-672 - Amu Hanamiya
SSNI-674 - Yua Mikami
SSNI-675 - Tsukasa Aoi
SSNI-676 - Aika Yumeno
SSNI-678 - Nene Yoshitaka
SSNI-679 - Yura Kano
SSNI-680 - Ichika Hoshimiya
SSNI-681 - Mako Iga
SSNI-683 - Marin Hinata
SSNI-684 - Shihori Kotoi
SSNI-685 - Amin Arana
SSNI-686 - Hiyori Yoshioka
SSNI-687 - Kyouko Shuri
SSNI-689 - Moe Amatsuka

SOD Create

STARS -173
STARS-139 - Yume Takeda
STARS-140 - Makoto Toda
STARS-143 - Maria Wakui
STARS-144 - Shou Nishino
STARS-145 - Ichika Nagano
STARS-146
STARS-147 - Yuna Ogura
STARS-148 - Tadai Mahiro
STARS-149 - Chiharu Minagawa
STARS-151 - Meru Ishihara
STARS-164 - Yuna Ogura _ Suzu Honjou
STARS-165 - Tadai Mahiro
STARS-166 - Tina Nanami
STARS-167 - Meru Ishihara
STARS-168 - Hikari Aozora
STARS-169 - Mana Sakura
STARS-170 - Makoto Toda _ Ruru Arisu
STARS-171 - Yuna Ogura _ Suzu Honjou
STARS-172 - Maria Wakui
STARS-174 - Ichika Nagano
STARS-175 - Hinata Koizumi
STARS-177 - Yuzu Shirakawa
STARS-178 - Masami Ichikawa
STARS-179 - Yuna Ogura
STARS-180 - Mahiro Tadai
STARS-181 - Tina Nanami
STARS-184 - Kogawa Iori
STARS-185 - Mana Sakura
STARS-186 - Makoto Toda
STARS-187 - Suzu Honjou
STARS-191 - Meru Ishihara
STARS-194 - Masami Ichikawa
STARS-195 - Yuna Ogura
STARS-196 - Mahiro Tadai
STARS-197 - Tina Nanami
STARS-200 - Mana Sakura
STARS-201 - Makoto Toda
STARS-203 - Maria Wakui
STARS-205 - Ichika Nagano

WANZ FACTORY

WANZ-900 - Rin Ayane
WANZ-902 - Sora Shiina
WANZ-903 - Matsuri Kiritani
WANZ-905 - Eimi Fukada
WANZ-907 - Rui Hiiragi
WANZ-908 - Mami Nagase
WANZ-909 - Sora Shiina
WANZ-910 - Tsubomi
WANZ-911 - Kaho Imai
WANZ-912 - Matsuri Kiritani
WANZ-913 - JULIA
WANZ-915 - Kanon Kanon
WANZ-916 - Shouko Akase
WANZ-917 - Aika Yamagishi
WANZ-918 - Tsubomi
WANZ-919 - Mikan Kururugi
WANZ-920 - Sora Shiina
WANZ-921 - Matsuri Kiritani
WANZ-923 - Ai Sayama
WANZ-924 - Mizuki Yayoi
 

Attachments

  • ssni661pl.jpg
    ssni661pl.jpg
    162.4 KB · Views: 322
Last edited by a moderator:

George1980

Well-Known Member
Oct 24, 2016
79
300
I'm interested. I've been downloading chinese subbed videos so I can rip the subtitles. Although now I dont know if its already been ripped lol! I should look into these sites a bit more.
 

ding73ding

Akiba Citizen
Oct 25, 2009
2,331
2,065
Awesome! So nice to get a batch of new softsubs. I can help a little bit... but for next couple weeks I will be very busy so I won't be starting translating for a while. Wish you all the best.
 

George1980

Well-Known Member
Oct 24, 2016
79
300
BTW, what are the acceptable metrics? Do the translations need to be perfect or just machine translated?

EDIT: how do i upload subtitles on jamakav?
 
Last edited:

avatarthe

Well-Known Member
Feb 1, 2008
184
277
BTW, what are the acceptable metrics? Do the translations need to be perfect or just machine translated?
Better than just machine translation. Machine translated, human corrected and tweaked following the process outlined above.
 

ding73ding

Akiba Citizen
Oct 25, 2009
2,331
2,065
BTW, what are the acceptable metrics? Do the translations need to be perfect or just machine translated?
Machine translation is certainly positive value, for many users, the difference between 0 and 0.1 is infinity. But for other users, 0.1 is very close to 0. I suggest any machine translated sub gets tagged in the filename, say like SSNI-678.machine.en.srt
 
  • Like
Reactions: Taako

ding73ding

Akiba Citizen
Oct 25, 2009
2,331
2,065
Hey one way I can help is coding. I notice some folks have problem with SRT files broken into parts which is time consuming to splice together again. It's a trivial operation for me using Python (I wrote a little SRT toolkit and once in a while add new tools to it) but it's a little technical to use (technical in the sense that the user "You" have to perform the splicing in Command Prompt). I am not keen to create any kind of user interface.

I just now take a look at Google translate API, looks like it's pretty easy to invoke from Python ... will see if I can squeeze out some time to add that to my SRT tool box. Or if you guys have some request I will consider writing a program to make your life easier.
 

ding73ding

Akiba Citizen
Oct 25, 2009
2,331
2,065
See... I have no free time... but one coffee break can turn into a rabbit hole. Here is Google Translate MIDE-709... it's pretty rubbish. Can't blame Google, the styles and lexicon and grammar of Chinese subs is pretty particular.

And to show you how simple it is on Python here is the complete code I write, yep took me less than 30 minutes:

Code:
import srttool as st
from googletrans import Translator

s1 = st.readsrt(r"J:\fun\av\srt\2020\MOODYZ\MIDE-709 - Shoko Takahashi\MIDE709.FHD.srt")

translator = Translator()
outs = translator.translate([s[2] for s in s1], src='zh-cn', dest='en')
strans = [(s[0], s[1], t.text) for (s,t) in zip(s1,outs)]

st.writesrt(strans, r"J:\fun\av\srt\2020\MOODYZ\MIDE-709 - Shoko Takahashi\MIDE-709.machine.en.srt")

(no you can't run this code since I'm not ready to release srttool yet)

EDIT: so it would be easy (like another coffee break) to put above code in a loop that crawl thru the directories and machine-translate all 417 vids (I scanned 455 SRT files, but I think the extra 38 files are identical duplicates).

The only hiccup is Google seems not to like being spammed, both according to the creator of googletrans and @avatarthe so I can write the program but I don't want to run it on any of my machines. Frankly I am paranoid about Google. So if any of you want to take this, we can work together, I can deal with all the programming technicality, but you have to learn how to run Python scripts from command prompt, using your own machines/IP address. And of course you have to share the fruits.

EDIT (2): I cringed first time I read avatarthe 's post , so unbelievably painful. You should feed pure text to any machine translation. Instead of formatted text (esp. SRT with time code). If take the Chinese SRT split up into time codes and pure text, feed the text to be machine translated and re-merge the time codes with the English text. That what my program does.

If you would spend time to undo Chrome's messing up of time code, or splicing broken parts by hand, better to learn some old school computer stuff (how to do things in command prompts) and use my programs. I can write some powerful programs but I don't want Google to tag me (my IP) as spammer so someone else have to run it, and I don't want to make GUI cause there's no fun and profits in it.
 

Attachments

  • MIDE-709.machine.en.srt.txt
    32.7 KB · Views: 548
Last edited:
  • Like
Reactions: shawn720s

Arny Jackson

Well-Known Member
Oct 9, 2019
147
792
Great project !! Enjoy my new eng subs !!!
 

Attachments

  • Subbed.rar
    240.9 KB · Views: 2,331
  • Capturehjhjh.JPG
    Capturehjhjh.JPG
    57.7 KB · Views: 977
  • Like
Reactions: dent80

wowsers

New Member
Mar 12, 2019
21
7
Great idea well presented.

But what is the value of translating Chinese->Japanese->English?

You can get a good translation direct from Chinese using Yandex. I must be missing something.
 

George1980

Well-Known Member
Oct 24, 2016
79
300
I've been using yandex to translate. It comes out with better grammar than the other translators, though it does get it wrong every once in a while.
 
  • Like
Reactions: Arny Jackson

avatarthe

Well-Known Member
Feb 1, 2008
184
277
Great idea well presented.

But what is the value of translating Chinese->Japanese->English?

You can get a good translation direct from Chinese using Yandex. I must be missing something.

I've been using yandex to translate. It comes out with better grammar than the other translators, though it does get it wrong every once in a while.

no, not double translation ( Chinese -> Japanese-> English), TWO translations, the first Chinesee to English ( from the Chinese subtitles), the second translation is Japanese to English using Pytranscriber on the movie. If all translation were perfect these 2 translations would be exactly the same, but obviously they are not and by comparing the 2 English translations you can better understand what was actually being said in Japanese, you can use this insight to fix errors and omissions in the translated subtitles. I’ll do one Tonight and post it so people can see how much better the subtitles are. Also I have noticed machine translations of Thai or Korean translations ( which were made from Chinese subtitles) are so oo much better than machine translations of the original Chinese subtitles. This may be due to better translation algorithms or more importantly because the Thai and Korean translators are going through a double script technique like I suggest making their subs more accurate and easier for the machine to translate.
 
Last edited:

DarkMask

Active Member
Jul 26, 2012
119
83
You should create a Github Repo where people can upload translations so we can have an index instead of a thread where things will get buried.
 
  • Like
Reactions: softcoreseason

avatarthe

Well-Known Member
Feb 1, 2008
184
277
where do you get chinese subtitle? I want to get more
1., first download a film with Chinese hard subs ( usually these filings have a “c” added to the name, like. “Pppd-807-c”
2. Then use videosubfinder to extract the subs and ABBYY Finereader to do OCR


https://sourceforge.net/projects/videosubfinder/

the biggest issue is ABBYY fine reader is a commercial piece of software that cost $199...
 
Last edited:

avatarthe

Well-Known Member
Feb 1, 2008
184
277
Ok, I was hoping for more interest in this project...

To try an bolster interest and show How I think this should work I have started translating MIDE-749, now these subtitles are not included in the zip file but I have a thing for
Miura Sakura and her unreal body. she does a great job in this film as the highly inappropriate and very horny big sister in this film. The source for the subs are Korean subs which were translated from Chinese subs by Epeudi at Jamakav. I using machine translation of his subs along with Pytranslator translation of the film to make my subs.
They should be finished tomorrow and I'll upload them here.


mide-749-poster-jpg.2058876
 
Last edited:
  • Like
Reactions: Arny Jackson

ding73ding

Akiba Citizen
Oct 25, 2009
2,331
2,065
Ok, I was hoping for more interest in this project...

To try an bolster interest and show How I think this should work I have started translating MIDE-749, now these subtitles are not included in the zip file but I have a thing for
Miura Sakura and her unreal body. she does a great job in this film as the highly inappropriate and very horny big sister in this film. The source for the subs are Korean subs which were translated from Chinese subs by Epeudi at Jamakav. I using machine translation of his subs along with Pytranslator translation of the film to make my subs.
They should be finished tomorrow and I'll upload them here.


mide-749-poster-jpg.2058876
Quite by chance I was playing with your recommended VideoSubfinder for the first time and I picked a hardsub version the same vid. Managed to get good RGBImage but not TXTImage, so I cleaned out some spurious RGBImage (cut over 200 down to 459 caps) and generated an empty sub file. At that point I stopped because I am not sure where I want to go with this. Having gone thru the frame grabs (RGBImage) of scene 1 and beginning of scene 2 I got a very good idea of the story and don't feel I need to even follow all the subtitles. Much less feel the need to translate all 459 lines to English.

Well if you are doing most of the work and obviously you are neither fluent in Japanese, Chinese, nor Korean I think I can offer a little help. Only a little. If you have difficulty with some lines, mark them with hash signs (###) and I will manually translate/correct for you.

(I think the Chinese sub is fairly good, but definitely machine translation will make a stinky pile out of it.)
 

avatarthe

Well-Known Member
Feb 1, 2008
184
277
Quite by chance I was playing with your recommended VideoSubfinder for the first time and I picked a hardsub version the same vid. Managed to get good RGBImage but not TXTImage, so I cleaned out some spurious RGBImage (cut over 200 down to 459 caps) and generated an empty sub file. At that point I stopped because I am not sure where I want to go with this. Having gone thru the frame grabs (RGBImage) of scene 1 and beginning of scene 2 I got a very good idea of the story and don't feel I need to even follow all the subtitles. Much less feel the need to translate all 459 lines to English.

Well if you are doing most of the work and obviously you are neither fluent in Japanese, Chinese, nor Korean I think I can offer a little help. Only a little. If you have difficulty with some lines, mark them with hash signs (###) and I will manually translate/correct for you.

(I think the Chinese sub is fairly good, but definitely machine translation will make a stinky pile out of it.)
Take a look at my subs tomorrow and let me know what you think
 

ding73ding

Akiba Citizen
Oct 25, 2009
2,331
2,065
Another effort I took today... which hasn't gone to any final fruit yet... is trying my Google Translate program on some old Korean softsub. So I pick a Korean srt file that is also in my vid collection but lacking a Chinese sub (soft or hard). ('cus if I have a Chinese sub for it, I wouldn't need machine trans)

So actually one big problem is that I have a few folders of Korean subs with really badly named files. These folders were mostly duplicated but not totally. And bad filenames (random formats and lots of Korean) make them nearly useless. So first I wrote a program to pick out the video code from the randomly formatted filenames to rename all the files to "video-code (n).srt/.smi/.ass" etc. And with consistent format filename now I can merge all folders together and eliminate identical files (but not different files for the same vid). Finally I had well over 2000 files for only 789 different video codes. Anyway that's big progress for just having a easy way to find if any vid is already Korean soft-subbed. Combined with my JAV database it's also easy now to say find all Chinese or Korean softsub (in my collection) by, say, performer.

Anyway, pick a slightly random soft sub for KR->EN machine translation: TEK-072. Feed into my program, got some garbage English that is well, definitely English, but not at all making any sense (so many repeats of the word "hate" make no sense for a MUTEKI vid of Yua Mikami) and then I try to fiddle with a little something and retry the translate. Boom... I broke Google Translate. At least I broke it for myself. I'm pretty sure I've used up the unknown daily limit of Google Translate because the exact same program and data that was working previous no longer works.

So my mini conclusion is: quality is not quite passable, quantity is very very limited, not enough to translate a vid (664 lines) 3 or 4 times.
 
  • Like
Reactions: Arny Jackson

Arny Jackson

Well-Known Member
Oct 9, 2019
147
792
Ok, I was hoping for more interest in this project...

To try an bolster interest and show How I think this should work I have started translating MIDE-749, now these subtitles are not included in the zip file but I have a thing for
Miura Sakura and her unreal body. she does a great job in this film as the highly inappropriate and very horny big sister in this film. The source for the subs are Korean subs which were translated from Chinese subs by Epeudi at Jamakav. I using machine translation of his subs along with Pytranslator translation of the film to make my subs.
They should be finished tomorrow and I'll upload them here.


mide-749-poster-jpg.2058876
Just another Miura fan here ! Glad to see you working on it. I have almost all the Miura movies sub , MIDE-540 is the hottest ! iF YOU need any subs of her , ask me ! Waiting for the finished product !!