Before it can only process one file then it will say out of memory error. Now it'll keep continuing after each file without out of memory errors.What's your edit? I'm using the one that you've uploaded before
Before it can only process one file then it will say out of memory error. Now it'll keep continuing after each file without out of memory errors.What's your edit? I'm using the one that you've uploaded before
I'll try it later, thank you for the update, currently using the previous one. It was so convenient but haha I need to manually clean out the moan soundBefore it can only process one file then it will say out of memory error. Now it'll keep continuing after each file without out of memory errors.
Yup, the initial_prompt tells Whisper to transcribe everything even moans. You don't have to manually clean it. Use my clean_japanese.py. It will automatically clean all repetitions, moaning, laughing, and garbage.I'll try it later, thank you for the update, currently using the previous one. It was so convenient but haha I need to manually clean out the moan sound
So I should just leave it as is for now, and it will be cleaned up later? I’ve used up my runtime for today, but I’ll continue working on the file in Colab tomorrow.Yup, the initial_prompt tells Whisper to transcribe everything even moans. You don't have to manually clean it. Use my clean_japanese.py. It will automatically clean all repetitions, moaning, laughing, and garbage.
Keep the audio file, json and partial filenames the same and it will load the partial srt tomorrow and continue appending to it. Or upload those files to a different Colab account and it will also continue.So I should just leave it as is for now, and it will be cleaned up later? I’ve used up my runtime for today, but I’ll continue working on the file in Colab tomorrow.
Thank you so much for this, can I used it in Linux Mint? I ditched Windows 3 years ago and never go back to that OS.Having Python installed on your PC will save you so much time. I have lots of Python scripts to share. I have Python 3.10.11 installed for reference.
My clean_japanese.py removes repetitions, cleans up moaning, deletes garbage lines and does replacements of mis-transcribed phrases with whatever you specify.
View attachment 3682384
Sharing my Python Selenium script to automate translation of srt files on DeepL. It doesn't use any API, just uses the free website. It mimics how a user might manually do it. You don't have to move your mouse, it does all the work! Need to download chromedriver.exe and set it for example driver_path=r'C:\Program Files (x86)\chromedriver.exe' My script takes the srt file, removes the index numbers, timestamps and only pastes the Chinese/Japanese text to the input box, then scrolls to the center, then clicks the copy button, then clears the textbox and keeps repeating. At the end, all English text will combine to form the final srt. Note there is a limit on DeepL, a ~900 line subtitle file might barely finish till completion, sometimes DeepL will have a popup saying something like, "You have been translating too much. Please subscribe to Pro". The script will also break whenever DeepL changes the look of their website.
Download: https://drive.google.com/drive/folders/18xVzdHG8X1PoLZWbvaYliADLnY3kNr_L
View attachment 3682385
Yes both scripts will work. If you are not able to start the script, then you missed installing something. Ask Claude, Gemini, Perplexity or ChatGPT for installation help.Thank you so much for this, can I used it in Linux Mint? I ditched Windows 3 years ago and never go back to that OS.