FIRE sampling added. #58

laonahongchen · 2024-12-19T19:41:48Z

Add FIRE sampling from paper Flaming-hot Initiation with Regular Execution Sampling for Large Language Models (https://arxiv.org/abs/2410.21236) to the PPO trainer loop as an additional option to turn on.

eric-haibin-lin

Sorry for the late review and thanks for the contribution! Would you mind introducing a new class inheriting vllmRollout, add the custom sampling logics? This way it'd be easier to maintain the codebase

laonahongchen · 2025-02-20T03:39:52Z

Hi, Sorry for the delay as I find it extremely challenging to update the code with the latest version of the verl as many things have changed and I am not able to access the original test environment again.

I have updated the code logic following the suggestion from @eric-haibin-lin above, and also merge the latest commits. Please take another round of review and let me know if there is any further suggestions. Thanks!

verl/workers/fsdp_workers.py

eric-haibin-lin

thank u. could you fix the errors reported from CI?

CLAassistant · 2025-02-26T00:33:05Z

All committers have signed the CLA.

laonahongchen · 2025-02-26T00:42:46Z

Hi,

Thanks for the info. Again, I am not so familiar with the CI system, so I am wondering is the current status ready to go after my last commit? Or is there anything I still need work on as the only failing check I see seems to be caused by random shutdown from the server.

laonahongchen added 2 commits December 19, 2024 11:38

Update ppo_trainer.yaml

07bc2dc

Update vllm_rollout.py

24b8941

willem-bd requested a review from PeterSH6 December 24, 2024 11:10

eric-haibin-lin reviewed Jan 13, 2025

View reviewed changes

laonahongchen added 2 commits February 19, 2025 18:42

Merge remote-tracking branch 'volcengine/main'

1727abf

Merge FIRE sampling with latest version of verl.

f1f676b

eric-haibin-lin reviewed Feb 20, 2025

View reviewed changes

verl/workers/fsdp_workers.py Show resolved Hide resolved

Simplify changes to merge into main verl.

7d1a729

eric-haibin-lin reviewed Feb 21, 2025

View reviewed changes

Change format to try to meet style requirement.

d622462

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIRE sampling added. #58

FIRE sampling added. #58

laonahongchen commented Dec 19, 2024

eric-haibin-lin left a comment •

edited

Loading

laonahongchen commented Feb 20, 2025

eric-haibin-lin left a comment

CLAassistant commented Feb 26, 2025 •

edited

Loading

laonahongchen commented Feb 26, 2025

FIRE sampling added. #58

Are you sure you want to change the base?

FIRE sampling added. #58

Conversation

laonahongchen commented Dec 19, 2024

eric-haibin-lin left a comment • edited Loading

Choose a reason for hiding this comment

laonahongchen commented Feb 20, 2025

eric-haibin-lin left a comment

Choose a reason for hiding this comment

CLAassistant commented Feb 26, 2025 • edited Loading

laonahongchen commented Feb 26, 2025

eric-haibin-lin left a comment •

edited

Loading

CLAassistant commented Feb 26, 2025 •

edited

Loading