-
Notifications
You must be signed in to change notification settings - Fork 433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tensor Parallelism v2 #3335
Tensor Parallelism v2 #3335
Conversation
This reverts commit f154ea6.
…omposer into mvpatel2000/tp-v3
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you add the e2e that Daniel run and failed in your v1 PR ?
Added to description! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we gate the old tests to world size 2? I don't see a reason to test world size 2 and 4
…omposer into mvpatel2000/tp-v3
What does this PR do?
Restores #3269, which was originally reverted. Fixes the following issues:
0-16gpu-save-l4Gcli
Follow-on Tasks: