Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OOM when using VAD #1193

Closed
Troffifi opened this issue Dec 6, 2024 · 2 comments · Fixed by #1198
Closed

OOM when using VAD #1193

Troffifi opened this issue Dec 6, 2024 · 2 comments · Fixed by #1198

Comments

@Troffifi
Copy link

Troffifi commented Dec 6, 2024

Hi, does somebody else experience issues with memory consumption when transcribing audio files containing a lot of speech (~ 4 hours long)? I am running the latest version of faster-whisper in a Kubernetes pod on a g4dn AWS instance. The server has 4 cores, 1 GPU, and 16GB RAM, but the pod is limited to 2 cores. The base image is pytorch/pytorch:2.5.1-cuda12.4-cudnn9-runtime and as per this pinned issue the installed versions should be compatible:

  • python 3.11
  • torch 2.5.1+cu124
  • ctranslate2 4.5.0
  • cuda 12.4
  • cudnn 9.1.0.7

The process gets killed during the transcription phase when VAD is enabled. I tried the solution described here, but it doesn't help. See the logs attached. Anyone has any idea what could be the cause of the OOM?

libraries.txt
logs on sigkill.txt

Purfview added a commit to Purfview/faster-whisper that referenced this issue Dec 10, 2024
Reported problems:
SYSTRAN#1193
SYSTRAN#1169

VAD implementations consumes humongous memory amounts [original Silero doesn't have this problem]

This PR should fix the OOM problem.
Alt solution could be removing 'lru_cache'.
@Purfview
Copy link
Contributor

Try this fix -> #1198

@Troffifi
Copy link
Author

I still have to run some tests, but I tried it and it looks promising. The latest commit has fixed it. Thank you @Purfview! I will let you know if the issue persists.

MahmoudAshraf97 pushed a commit to Purfview/faster-whisper that referenced this issue Dec 12, 2024
Reported problems:
SYSTRAN#1193
SYSTRAN#1169

VAD implementations consumes humongous memory amounts [original Silero doesn't have this problem]

This PR should fix the OOM problem.
Alt solution could be removing 'lru_cache'.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants