Performance metrics for evaluating time segmentation of Whisper models #182
Unanswered
mafaisalpg
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Yes, we studied how to evaluate timestamps quality. Unfortunately there is no open source code to compute those metrics. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello everyone,
I like to compare different Whisper models on my own datasets changing some parameters like detect_disfluencies, etc. My output looks like
start, end, text
instead of word level timestamp.
Questions
What are metrics (MSE, MAE, etc.) best fit for this purpose?
Is there any Python implementation which I can reuse?
Is there any paper or article have done such comparison?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions