Well I tried to formulate a cohesive response for you but my tangential babbling was characteristically all over the place so I employed Deepseek to help me to respond in a clear way for you, Here is the reply from Deepseek:
Response to Faster-Whisper-XXL Inquiry:
1. Setup Instructions
- Ensure you have Python installed (tested with Python 3.8+).
- Install the required dependencies:
bash
- pip install faster-whisper regex
- Place all three files (clean_subs.py, transcribe.py, and whisper.bat) in the same directory as your media files.
2. Usage Options
Option A: Automated Processing (Recommended)
- Simply double-click whisper.bat. This will:
- Transcribe all supported media files (MP4, AVI, MKV, etc.) in the directory using Faster-Whisper (large-v3 model by default).
- Automatically clean the generated .srt files using clean_subs.py.
Option B: Manual Cleaning (For Existing SRT Files)
If you already have an .srt file (e.g., from Faster-Whisper-XXL), run:
bash
python clean_subs.py input.srt output.srt
This will apply the adult-content-optimized cleaning rules to your subtitles.
3. Notes on Faster-Whisper-XXL
- "XXL" is not an official Faster-Whisper model size. The standard options are:
- tiny, base, small, medium, large-v1, large-v2, large-v3.
- If your tool uses a custom model, ensure it outputs standard .srt files. The cleaning script will still work as long as the format is valid.
4. Customization
- To change the model (e.g., to large-v2), edit whisper.bat and update:
bat
- set "MODEL=large-v3"
- For language support, modify DEFAULT_LANG in the batch file or specify it manually:
bash
- python transcribe.py input.mp4 ja
5. Troubleshooting
- If errors occur, check whisper_results.log (for transcription) or the console output (for cleaning).
- Ensure your GPU drivers/CUDA are properly configured if using device="cuda".
One last little detail. If you do not want to use my Whisper.bat you can open each of these files in a wordprocessing program like word and compare each setting, such as the VAD perameters, and manually edit them. This is a good way to learn what each argument does, For example I have my settings at Float 16 but Float 32 will yield a more accurate translation at the cost of a much slower processing time. I'm using 8 CPU threads but my system is 24Core/32Thread. You may want to tinker with these settings depending upon your system. All of that said, My personal suggestion would be to make sure that Whisper is correctly pointing at your Faster-Whisper install directory but aside from that it should be a simple double-click process. Good luck.