More Accurate Speech Recognition with whisper.cpp
I have been using OpenAI's whisper for a while to convert audio files to text. For example, to generate subtitles for a file, I used
whisper "$INPUT_FILE" -f srt --model turbo --language en
Especially on long files, this would sometimes over time change it's behavior leading to either extremely long or extremely short sentences (run away).
Also, whisper took a long time to run.
Luckily, there is whisper-cpp. On my system with an M2 Pro chip, this can now run speech recognition on a 40 minute audio file in a few minutes instead of half an hour.