[Chatllama] Should I add a <end_of_text> at the end of sentence? #260

bino282 · 2023-03-13T04:08:17Z

When I train a actor model with bloom-560M. I realize that , the model generates the text repeated at the end. The model always generates enough words predefined by max_length but it;s not stop early. Should I add a <end_of_text> at the end of sentence when training?

PierpaoloSorbellini · 2023-03-14T09:19:13Z

@bino282 Yes, with HF models the text is repeated, we are aware of the problem and will be releasing a patch for this very soon. We will get back to you as soon as possible.

allaccs · 2023-03-27T01:16:37Z

I am having the same problem of repeated text, what is the current workaround?

PierpaoloSorbellini · 2023-04-03T14:41:16Z

Hi @bino282 @allaccs
Yes we have the same issue too with some HF models.
currently we have tried to add a EOS to each sequence to make the model understand where to put this token,
and added some parameters to generate function of the actor form HF that should help in removing the repetition.
The last version is in the PR #306 please refer to that and contact me again if you notice that the problem persist.
thanks for feedback!

PierpaoloSorbellini changed the title ~~Should I add a <end_of_text> at the end of sentence?~~ [Chatllama] Should I add a <end_of_text> at the end of sentence? Mar 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Chatllama] Should I add a <end_of_text> at the end of sentence? #260

[Chatllama] Should I add a <end_of_text> at the end of sentence? #260

bino282 commented Mar 13, 2023

PierpaoloSorbellini commented Mar 14, 2023

allaccs commented Mar 27, 2023

PierpaoloSorbellini commented Apr 3, 2023

[Chatllama] Should I add a <end_of_text> at the end of sentence? #260

[Chatllama] Should I add a <end_of_text> at the end of sentence? #260

Comments

bino282 commented Mar 13, 2023

PierpaoloSorbellini commented Mar 14, 2023

allaccs commented Mar 27, 2023

PierpaoloSorbellini commented Apr 3, 2023