-
Notifications
You must be signed in to change notification settings - Fork 161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adopt MatX v0.4.0 #921
Adopt MatX v0.4.0 #921
Conversation
The build error is caused by a compute capability issue. https://stackoverflow.com/questions/74201452/cuda-11-8-fails-to-compile-atomiccas-for-16-bit-unsigned-integers-is-cudas-doc Looking for a workaround. |
@cliffburdick has fixed the compilation error, but the PR does not add support for half reduction to pascal architecture. |
Please let us know if there's a use case for this on Pascal. Specifically this would be argmax/argmin with fp16/bf16 only I believe. |
Thanks for the quick fix! |
Closing this PR as the goal was to move off of tracking a commit hash on to an official release tag. |
Those are just max/min and will use cub, so I don't think you're using anything that would be affected. |
Description
fixes #909
Checklist