-
Notifications
You must be signed in to change notification settings - Fork 448
First version of developer overview #537
First version of developer overview #537
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great Work!
I did some cursory glances over the document
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a really great and important initiative! This is great work and something that we can continue to build on and adding to in the future. Thanks for putting this together, @senior-zero!
I only have a few minor suggestions.
In general, I was wondering if we want to agree and communicate some fundamental contractual details. E.g., that Device
-scope algorithms do not synchronise the host with the device (cudaSyncrhonize
or cudaStreamSynchronize
).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for sharing this document, @senior-zero! I have a small handful of comments. I learned a lot from reading this!
0d8336c
to
c4be103
Compare
Co-authored-by: Jake Hemstad <jhemstad@nvidia.com> Co-authored-by: Bradley Dice <bdice@bradleydice.com> Co-authored-by: Elias Stehle <3958403+elstehle@users.noreply.github.com>
c4be103
to
d99e2fb
Compare
This PR briefly explains the current CUB design. The document is intended to help contributors. Coming PTX dispatch changes will lead to changes in this document. Having a diff of design overview will ease the transition period (still having a few PRs in flight that apply existing dispatch logic).