Super-Massive SRT dump "D" Stay tuned for revised info
Last edited:
Ty IMScully. You are one of the few ppl that give me a thumbs up and I appreciate it.Super-Massive SRT dump "D" Stay tuned for revised info
I did some testing with the flash and pro 2.5 model, 2 or 3 months ago, which in the AI universe is a century ago. It was a lil worse imo.Anyone done some testing how gemini models compare to deepseek for translation?
Interesting. I did some testing this week and in my opinion based on 2 examples (chinese source), they both are equal in quality. Some lines are better with deepseek and some are better with gemini 2.5 flash.I did some testing with the flash and pro 2.5 model, 2 or 3 months ago, which in the AI universe is a century ago. It was a lil worse imo.
The flash version is really cheap and very fast, which imo is really an option if you don't mind the slightly worse translation.
Apparently the gemini pro 2.5 06-05 version is a lot better for translation ,but it's still waaay too pricy, so haven't even tried it, and my €300 free credits have expired. There's no flash version yet afaik, which will should be a lot cheaper.
As I also do like 70% of my translations on chinese sources (30% Japanese transcriptions), gonna be hard to beat deepseek as well. As I assume it's more trained on Chinese.
Wow! You're rocking it, Dude. Thanks for sharing.Well folks, here is a new installment in the Super-Mucho-Mega SRT Adventure. What follows is E-G and is about 1600 .srt's. I'm kind of tired of doing this project so I may take a little break. Hope these are useful to some of you. Cheers.
E: https://drive.google.com/file/d/18lW0IFS4bOavNxVCXsjwSpBeuG77_Bg0/view?usp=sharing
F: https://drive.google.com/file/d/1SMEpdOj88ilzRRFxAiDFX5SYNKqpDfKF/view?usp=sharing
G: https://drive.google.com/file/d/1b0dKY-lChc4J5NSygbtru9j8tcNhWzJf/view?usp=sharing
I have seen this a bunch of times Freespirit and I have gone through a ton of tweaks with my .bat and.py scripts to try to minimize these types of things. I have absolutely no idea how this happens. Once in a while it will 'hallucinate' this for half of a video. IF you ever find a solutione would you please let me know. Thanks.Anyone know why when you use whisper in Large mode to get subtitles it has things like: "Naokiman Show Instagramуют" "Please subscribe to the channel." "Thank you for watching until the end today" "See you in the next video." and "Thank you for watching!" in the text when it clearly isn't being said in the video?
Anyone know why when you use whisper in Large mode to get subtitles it has things like: "Naokiman Show Instagramуют" "Please subscribe to the channel." "Thank you for watching until the end today" "See you in the next video." and "Thank you for watching!" in the text when it clearly isn't being said in the video?
Thanks Scully, I was starting to think that only a few people gave a crap about them and was toying with the idea of just pm'ing those people with the links instead of just posting them in the open forum. I'll ponder that but ultimately if I go that route you are definitely on the list. Cheers.Wow! You're rocking it, Dude. Thanks for sharing.
That is interesting Sam. I have my configs set to minimize AI Guessing/learning and only translating exactly what it 'hears' . I don't see it much but I suppose you can't totally rely on Whisper to not 'guess' now and then. Thanks for the response.Because the AI model has been trained on youtube videos and all AI do is guessing so since they say that a lot, it's a go to phrase it'll use when uncertain about something, especially if it matches when it's usually said, either at the beginning or end of it.
Yeah, that makes a lot of sense. One day, Probably past my expiry date, but One day the technology will be better at context. Right now it reads text and puts the dots together. Sometimes I wonder if I should even bother with this subtitle project because who knows what the technology will be like in a Year, or 6 Months, or Tomorrow Morning.I don't know how those settings work exactly, but what we have as AI doesn't think at all, it's all educated guesses from doing/associating something over and over again(when I see this, that is the expected result) so if the data it's trained on has issues, those issues are passed on to your results, it doesn't know it's just guessing since for it, what it knows is what it is.
Since most youtube video will have that sentence or something similar at the end, the AI learns that this is the normal thing to say at the end of something and if you use a VAD, you end up with a lot of ends since it's all split into chunks.
They are getting better but it's still happening.
Anyone know why when you use whisper in Large mode to get subtitles it has things like: "Naokiman Show Instagramуют" "Please subscribe to the channel." "Thank you for watching until the end today" "See you in the next video." and "Thank you for watching!" in the text when it clearly isn't being said in the video?
These files have been removed from Google Drive and will be reposted, with a few tweaks, later today.Wow! You're rocking it, Dude. Thanks for sharing.