I'm sure I saw a recent post here about the new
Qwen3-ASR model. I would have replied to that post but now I can't find it.
I had a play with the two sizes of the Qwen model and (with a quick first try) it doesn't seem that great at transcription compared to Whisper Large v2. I'm hoping it's just the settings I'm using that give me the underwhelming results and it can actually do better than my first attempt.
The zip attached has transcribed and translated subtitles for this interview (
Actress Japan: Nozomi Ishihara) on YouTube. I will give the models another go with some longer JAV-sourced audio when I get time.