-
Notifications
You must be signed in to change notification settings - Fork 637
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Chatllama] error when load dataset when use deepspeed #229
Comments
I got the bug too. Has any one debug. |
@bino282 thank you for reaching out. We know that currently we have some issue with DeepSpeed we already working to fix it. Could you please share with us your current setup? |
@PierpaoloSorbellini The setup as following: REQUIREMENTS = [ this_directory = Path(file).parent setup( |
I was able to fix the problem "Training data must be a torch Dataset". The parameter training_data of deepspeed.initialize must be altered to |
hi, when I use deepspeed , I encountered this error:
[2023-03-09 10:46:33,647] [INFO] [logging.py:77:log_dist] [Rank 0] DeepSpeed Flops Profiler Enabled: False
Traceback (most recent call last):
File "/datahdd/nhanv/Projects/NLP/chatllama/artifacts/main.py", line 50, in
actor_trainer = ActorTrainer(config.actor)
File "/home/ntq/miniconda3/envs/textgen/lib/python3.10/site-packages/chatllama/rlhf/actor.py", line 324, in init
) = deepspeed.initialize(
File "/home/ntq/miniconda3/envs/textgen/lib/python3.10/site-packages/deepspeed/init.py", line 125, in initialize
engine = DeepSpeedEngine(args=args,
File "/home/ntq/miniconda3/envs/textgen/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 319, in init
self.training_dataloader = self.deepspeed_io(training_data)
File "/home/ntq/miniconda3/envs/textgen/lib/python3.10/site-packages/deepspeed/runtime/engine.py", line 1674, in deepspeed_io
raise ValueError("Training data must be a torch Dataset")
ValueError: Training data must be a torch Dataset
How can I fix it?
The text was updated successfully, but these errors were encountered: