Transcription of 1 minute file takes 20 - 40 seconds #113

martin-opensky · 2024-10-12T14:52:54Z

Hello,

I am using the faster-whisper-server on a Mac M1 with the following start command:

docker run --publish 8000:8000 --volume ~/.cache/huggingface:/root/.cache/huggingface fedirz/faster-whisper-server:latest-cpu

And I'm receiving no performance improvements over the original Open AI Model with the following command:
curl http://localhost:8000/v1/audio/transcriptions -F "file=@test.wav" -F "stream=false" -F "language=en" -F "model=Systran/faster-whisper-small"

This is a 1 minute file which takes between 20s - 40s to transcribe depending on the model size.

To transcribe the same audio file using the Systran faster-whisper directly it takes around a 1-3s.

I'm really unsure why this would be the case. Can anyone shed some light onto what may be causing this?

Many thanks
Martin

fedirz · 2024-10-12T16:43:30Z

Try the running the following command instead

docker run --publish 8000:8000 --env WHISPER__INFERENCE_DEVICE=auto --volume ~/.cache/huggingface:/root/.cache/huggingface fedirz/faster-whisper-server:latest-cpu

Please let me know if that improves the inference speed or not.

yimejky · 2024-11-26T00:44:32Z

I believe this is happening because the CPU version in docker cannot utilize more than one thread. I even tried overriding the config via ENV, and it's still the same :(

docker run \
    -p 8000:8000 \
    --cpus=10 \
    -e WHISPER__INFERENCE_DEVICE=auto \
    -e WHISPER__CPU_THREADS=10 \
    -e OMP_NUM_THREADS=10 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --name faster-whisper-server fedirz/faster-whisper-server:latest-cpu

fedirz pushed a commit that referenced this issue Oct 12, 2024

No longer set WHISPER__INFERENCE_DEVICE in Dockerfiles (#113)

17f5f26

fedirz mentioned this issue Oct 12, 2024

unset inference device env from dockerfile #116

Merged

fedirz pushed a commit that referenced this issue Oct 12, 2024

No longer set WHISPER__INFERENCE_DEVICE in Dockerfiles (#113)

b8eb26a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transcription of 1 minute file takes 20 - 40 seconds #113

Transcription of 1 minute file takes 20 - 40 seconds #113

martin-opensky commented Oct 12, 2024

fedirz commented Oct 12, 2024 •

edited

Loading

yimejky commented Nov 26, 2024 •

edited

Loading

Transcription of 1 minute file takes 20 - 40 seconds #113

Transcription of 1 minute file takes 20 - 40 seconds #113

Comments

martin-opensky commented Oct 12, 2024

fedirz commented Oct 12, 2024 • edited Loading

yimejky commented Nov 26, 2024 • edited Loading

fedirz commented Oct 12, 2024 •

edited

Loading

yimejky commented Nov 26, 2024 •

edited

Loading