-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CoreML conversion and model execution takes 11 hours #1307
Comments
Unfortunately, there's not much we can do about it. Please refer to #1278 for more details. |
You might want to give the latest |
Okay that is cool, how can i compile with Metal support? is it just using |
Yes, you can just use Lines 38 to 42 in 7e1592d
|
Okay great, then i already have it. |
Absolutely! Feel free to create a pull request once you're done, and I'd be happy to add it to the readme list. |
After compiling this library with WHISPER_COREML=1 set, i tried converting the large model using the included script, however it takes 11 hours to compile, and then another 11 hours on the first run. I have encountered this issue before with smaller coreml models, and the solution there was to upgrade the coremltools to 7.0b2, which i did here too. However that hasnt worked, on a fresh install it is still 11 hours for compile and initial run.
The reason i suspect this is an issue with this repo and not coremltools itself, is that the coreml-encoder-large.mlpackage only takes 2 minutes to compile, while the ggml-encoder-large.mlmodelc takes the 11 hours and ggml-encoder-tiny.mlmodelc takes under a minute.
So if it was the same coremltools issue i encountered before, all three compilations would have taken an extremely long time, regardless of model size, but in this case it is only the ggml-encoder-large.mlpackage which seems to have an issue.
Specifications:
OS: MacOS Ventura
CPU: ARM64 M1
RAM: 16GB
Version: whisper.cpp latest commit
Python: ARM 3.9.7 and Universal 3.9.6
Coremltools: 7.0b2 and 6.3.0
The text was updated successfully, but these errors were encountered: