When should mcore_gpt: True be used? #9728
Unanswered
dementrock
asked this question in
Q&A
Replies: 1 comment
-
Megatron core should be used when possible. It is the code path that we test the most and code that is in both NeMo and Megaton core will eventually be deprecated and removed in favor of importing the code from mcore. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
This flag is used in some of the launch files like https://github.com/NVIDIA/NeMo/blob/main/examples/nlp/language_modeling/conf/megatron_gpt_config.yaml. I know this flag causes different implementation of the model to be used, but can their differences and recommendations for when to use one vs another be explained in more detail in the doc?
Beta Was this translation helpful? Give feedback.
All reactions