Add MicroLlama training support #1457

keeeeenw · 2024-06-01T23:29:06Z

Hello,

I was previously contacted by your team regarding the addition of MicroLlama support to the latest litgpt codebase. Please review and accept this pull request if you agree with my approach. Otherwise, I welcome any questions and comments.

Additionally, I have not yet tested this change with Lightning AI Studios. I will work on any necessary code adjustments to support the studio and update the documentation as needed.

I made the following changes:

Add microllama.yaml for training configuration
Modify config.py to add MicroLlama model config
Modify pretrain.py to import and handle MicroLlama data
Add microllama.py for data loading during training
Fixed a bug in prepare_slimpajama.py by adding is_generator attribute. Data processing with the latest litdata will crash if this attribute is missing.

Data Processing:
Same as TinyLlama

Training:
litgpt pretrain --config config_hub/pretrain/microllama.yaml

keeeeenw · 2024-06-02T00:48:23Z

It looks like the unit test failed because I did not link hf_config to my hf repo correctly to get the tokenizer.

 hf_config=dict(org="MicroLlama", name="MicroLlama-300M{}"),

Let me make an update.

1. Add microllama.yaml for training configuration 2. Modify config.py to add MicroLlama model config 3. Modify pretrain.py to import and handle MicroLlama data 4. Add microllama.py for data loading during training 5. Fixed a bug in prepare_slimpajama.py by adding is_generator attribute. Data processing with the latest litdata library will crash if this attribute is missing. Data Processing: Same as TinyLlama but you don't need startcoder data. Training: litgpt pretrain --config config_hub/pretrain/microllama.yaml

rasbt · 2024-06-02T22:05:01Z

Awesome, this is great! Really appreciate it! And that's a really clean PR. I will test this either tonight or tomorrow before merging.

rasbt

This looks all good to me. There's one minor suggestion: Instead of duplicating most of the code in the MicroLlama data module, what do you think about making a LlamaDataModule base class with a starcoder toggle from which you can then initialize a TinyLlama and MicroLlama subclass. Basically just a minor refactoring to avoid code duplication. Let me know in case you want my help with this.

keeeeenw · 2024-06-02T22:48:29Z

Thanks for the review and updating the doc! Let me refactor the code today! I can run some basic checks on my side but I will need your help regression test TinyLlama.

litgpt/config.py

litgpt/pretrain.py

Andrei-Aksionov · 2024-06-03T13:40:21Z

Thanks @keeeeenw for the PR 👍

rasbt · 2024-06-03T13:45:19Z

Let me refactor the code today! I can run some basic checks on my side but I will need your help regression test TinyLlama.

awesome! thanks for the quick update! I will test this today

morphpiece · 2024-06-03T17:36:30Z

Point No. 5 of OP brought me here. Both prepare_slimpajama() and prepare_starcoder() are crashing for lack of is_generator attribute in their DataRecipe classes.

keeeeenw · 2024-06-03T18:33:36Z

Point No. 5 of OP brought me here. Both prepare_slimpajama() and prepare_starcoder() are crashing for lack of is_generator attribute in their DataRecipe classes.

Yeah, I suspected prepare_starcoder could have the same issue but I don't have the star coder dataset locally to test any fixes. Let me add the same change to prepare_starcoder. Please let me know if it works for you.

litgpt/data/llama_data.py

litgpt/pretrain.py

litgpt/data/prepare_slimpajama.py

awaelchli · 2024-06-03T23:11:20Z

Thanks for contributing the model and training config!

rasbt · 2024-06-04T01:28:38Z

LGTM, big thanks for this PR!

keeeeenw · 2024-06-04T04:06:43Z

Great! Thank you all for the review and thoughtful comments. According to https://lightning.ai/blog/contribute-to-lightning/, you will merge my change, right? Please let me know if there is anything else you need from my side!

rasbt · 2024-06-04T19:17:49Z

Yes, this is great and ready to merge! In fact, I will merge it now! Thanks again, we really appreciate the effort!

keeeeenw requested review from awaelchli and lantiga as code owners June 1, 2024 23:29

keeeeenw force-pushed the main branch from 1292fc5 to cdb0891 Compare June 2, 2024 00:50

keeeeenw force-pushed the main branch from cdb0891 to e4fc9ce Compare June 2, 2024 01:47

rasbt suggested changes Jun 2, 2024

View reviewed changes

update docs

5f0f4f9

rasbt requested a review from williamFalcon as a code owner June 2, 2024 22:21

rasbt removed the request for review from williamFalcon June 2, 2024 22:22

Refactor TinyLlama and MicroLlama data class

317a4f2

Andrei-Aksionov reviewed Jun 3, 2024

View reviewed changes

litgpt/config.py Outdated Show resolved Hide resolved

Andrei-Aksionov reviewed Jun 3, 2024

View reviewed changes

litgpt/pretrain.py Outdated Show resolved Hide resolved

awaelchli reviewed Jun 3, 2024

View reviewed changes

litgpt/data/llama_data.py Outdated Show resolved Hide resolved

awaelchli reviewed Jun 3, 2024

View reviewed changes

litgpt/pretrain.py Outdated Show resolved Hide resolved

awaelchli reviewed Jun 3, 2024

View reviewed changes

litgpt/data/prepare_slimpajama.py Outdated Show resolved Hide resolved

keeeeenw force-pushed the main branch from d37a9e6 to da0d758 Compare June 3, 2024 19:07

Address review comments for microllama and fix starcoder crash

7bf294a

keeeeenw force-pushed the main branch from da0d758 to 7bf294a Compare June 3, 2024 19:10

awaelchli approved these changes Jun 3, 2024

View reviewed changes

rasbt added 3 commits June 3, 2024 20:23

model table with author column

d4979f9

remove whitespace for proper file diff

86a0460

minor cosmetics

90e837c

rasbt approved these changes Jun 4, 2024

View reviewed changes

rasbt merged commit fa88952 into Lightning-AI:main Jun 4, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MicroLlama training support #1457

Add MicroLlama training support #1457

keeeeenw commented Jun 1, 2024 •

edited

Loading

keeeeenw commented Jun 2, 2024

rasbt commented Jun 2, 2024

rasbt left a comment

keeeeenw commented Jun 2, 2024

Andrei-Aksionov commented Jun 3, 2024

rasbt commented Jun 3, 2024

morphpiece commented Jun 3, 2024

keeeeenw commented Jun 3, 2024

awaelchli commented Jun 3, 2024

rasbt commented Jun 4, 2024

keeeeenw commented Jun 4, 2024

rasbt commented Jun 4, 2024

Add MicroLlama training support #1457

Add MicroLlama training support #1457

Conversation

keeeeenw commented Jun 1, 2024 • edited Loading

keeeeenw commented Jun 2, 2024

rasbt commented Jun 2, 2024

rasbt left a comment

Choose a reason for hiding this comment

keeeeenw commented Jun 2, 2024

Andrei-Aksionov commented Jun 3, 2024

rasbt commented Jun 3, 2024

morphpiece commented Jun 3, 2024

keeeeenw commented Jun 3, 2024

awaelchli commented Jun 3, 2024

rasbt commented Jun 4, 2024

keeeeenw commented Jun 4, 2024

rasbt commented Jun 4, 2024

keeeeenw commented Jun 1, 2024 •

edited

Loading