My simple advice is increase the volume to reasonable levels, add bass reasonably, listen to it, then export to mp3, and then used that mp3 for pytranscriber. I used this alot with some personal settings.Hi - how can you make the audio file (mp3) with Audicity to have better-transcribing results? I mean, what do you edit or tweak to make it better recognized by whisper / pytranscriber / or any software you use to transcribe?
My difficult advice have you break the scenes using muxtool, and then used the simple advice. When I say scenes... I mean scenes that are usually 20-40 min long.
My difficult advice#2 I won't even recommended even though it's not super difficult, as it's annoying
Remember NOTHING is 100% perfect and I used pytranscriber mostly for time stamps.