Closed
Description
I am running whisper.cpp inside docker, as a POC I translated 800 WAV files.
In few cases (less then 5), the client is getting into endless loop at a certain time of the audio.
If I tell it to start second after the loop point, it transcript the audio as expected.
Few notes -
- The audio is english
- It happens across all models
- Using the python version - translate is fine.
- It is consistent always in the same point of time
That is the output that I get (I change some of the text for privacy manaers):
.....
[00:00:30.000 --> 00:00:32.000] Some valid text bla bla
[00:00:32.000 --> 00:00:35.000] Some valid text bla bla
[00:00:35.000 --> 00:00:53.000] Some valid text bla bla
[00:00:53.000 --> 00:01:09.000] Some valid text bla bla
[00:01:09.000 --> 00:01:30.000] We were working at high school.
[00:01:30.000 --> 00:01:45.000] We were working at high school.
[00:01:45.000 --> 00:02:06.000] We were working at high school.
[00:02:06.000 --> 00:02:21.000] We were working at high school.
[00:02:21.000 --> 00:02:40.000] We were working at high school.
...
Goes like that up to the end.