Add multi-nodes example & update doc #455
Draft
+79
−3
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Documentation update:
docs/design/mscclpp-dsl.md
: Updated the link to the examples folder to reflect the correct path.New example script:
python/examples/allgather_allpairs_multinodes_packets.py
: Added a new example script demonstrating the allgather all-pairs algorithm across multiple nodes using packet communication.IR module improvements:
python/mscclpp/language/ir.py
: Refined the sorting criteria for GPU instance channels and thread block channels to include the channel type, ensuring a more accurate order.Debugging enhancements:
src/executor/executor.cc
: Added a debug log to indicate the start of communication collective execution with details about the execution plan and collective.src/include/debug.h
: Introduced a new debug log subsystem identifierMSCCLPP_EXECUTOR
for logging executor-related information.