-
Notifications
You must be signed in to change notification settings - Fork 157
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ray.exceptions.ActorDiedError: The actor died unexpectedly before finishing this task. #8
Comments
Try Ray version |
2.38 |
Upgrade! |
Thanks! I tried both ray 2.41 and 2.42, but I'm still getting the same error. |
What version of vLLM are you using? |
0.6.3 |
You have the same exact version I'm using atm... Apologies, not sure how to resolve your bug, but if you do, lmk and can put bug fix in the PR? |
I've found out that it's related to H20. One can fix it by updating nvidia-cublas-cu12 following the method in this issue vllm-project/vllm#4392.
|
I tried to run the
run_deepscaler_1.5b_8k.sh
script following the steps in the README. However, I always encounter this error.I thought it was due to CUDA OOM, so I tried reducing the length from 8192 to 1024, but the error still persists.
I'm wondering if you've ever encountered this issue and how you resolved it. I am runnig this code on 8xH20(96G).
Here is the logs:
The text was updated successfully, but these errors were encountered: