Add vram flushing support #256

aidancrowther · 2024-12-09T19:16:30Z

This change implements a monitoring thread to unload the whisper model after a user set timeout period. Timeout defaults to being disabled, and can be set with the IDLE_TIMEOUT environment variable.

Once unloaded some residual VRAM allocation appears to remain (~0.25GB), but memory usage remains consistent across reloads, leading me to believe that this is a limitation of Docker.

This closes #216 and closes #196

Implement automatic VRAM clearing after a specified period of idleness. * Add a mechanism to track the last activity time and implement a background thread to monitor idleness and clear VRAM after five minutes of inactivity in `app/faster_whisper/core.py` and `app/openai_whisper/core.py`. * Update the `transcribe` and `language_detection` functions in both core files to reset the last activity time upon invocation. * Add a function to fully release the model from memory using `del`, `torch.cuda.empty_cache()`, and `gc.collect()` in both core files. * Add configuration options for the idleness timeout period and enabled/disabled state in the environment variables in `app/webservice.py`.

…out by default

docs/environmental-variables.md

Co-authored-by: Forest Anderson <forestkzanderson@gmail.com>

AngelOnFira and others added 2 commits December 9, 2024 11:25

Re-enable model after unloading

e104eca

AngelOnFira mentioned this pull request Dec 9, 2024

Add VRAM flush when idle #255

Closed

aidancrowther added 2 commits December 9, 2024 14:39

Add environment variable definition to documentation and disable time…

076f99e

…out by default

Update Changelog

3121cd6

AngelOnFira reviewed Dec 9, 2024

View reviewed changes

docs/environmental-variables.md Outdated Show resolved Hide resolved

aidancrowther and others added 3 commits December 9, 2024 14:51

Update docs/environmental-variables.md

f01ea25

Co-authored-by: Forest Anderson <forestkzanderson@gmail.com>

Merge branch 'main' into add-vram-flush

252e5a7

Merge branch 'main' into add-vram-flush

9e8f8e2

ahmetoner approved these changes Dec 16, 2024

View reviewed changes

ahmetoner merged commit 7d3e887 into ahmetoner:main Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vram flushing support #256

Add vram flushing support #256

aidancrowther commented Dec 9, 2024 •

edited

Loading

Add vram flushing support #256

Add vram flushing support #256

Conversation

aidancrowther commented Dec 9, 2024 • edited Loading

aidancrowther commented Dec 9, 2024 •

edited

Loading