Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve support for distil-large-v3 #1982

Merged
merged 1 commit into from
Mar 21, 2024

Conversation

sanchit-gandhi
Copy link
Contributor

While the original Distil-Whisper models were trained on timestamps, they degraded significantly after 15-seconds of audio (due to the shorter distribution of audio we trained on). The implementation in Whisper cpp thus deactivated timestamps entirely for these checkpoints.

The latest Distil-Whisper release, distil-large-v3, addresses this problem, giving a model that accurately predicts timestamps in full 30-second windows, much like the original Whisper models.

To improve support for this new checkpoint, we allow timestamp prediction for the new checkpoint in this PR.

@ggerganov ggerganov merged commit fff24a0 into ggerganov:master Mar 21, 2024
46 checks passed
@sanchit-gandhi sanchit-gandhi deleted the distil-large-v3 branch March 21, 2024 17:34
jiahansu pushed a commit to WiseSync/whisper.cpp that referenced this pull request Apr 17, 2024
viktor-silakov pushed a commit to viktor-silakov/whisper_node_mic.cpp that referenced this pull request May 11, 2024
iThalay pushed a commit to iThalay/whisper.cpp that referenced this pull request Sep 23, 2024
iThalay pushed a commit to iThalay/whisper.cpp that referenced this pull request Sep 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants