[Feature request] Request to Update Forked Megatron-LM Repository with Flash-Attention Improvement #1766

leocnj · 2023-07-24T20:53:39Z

Dear Accelerate Developers,

I would like to take this opportunity to express my gratitude for your continuous work and contribution towards developing this indispensable tool.

As it currently stands, we are utilizing a forked version of Megatron-LM (https://github.com/huggingface/Megatron-LM), ↗,) which is significantly lagging behind the main repository (NVIDIA:main) by 524 commits. Among the missing updates, there is a particular commit that stands out for its potential to significantly expedite Transformer training — the Flash-Attention update from Tri Dao.

On January 11, 2023, Tri Dao's pull request (https://github.com/NVIDIA/Megatron-LM/pull/267) ↗) which integrated Flash-Attention into Megatron-LM, was successfully merged. Recently, Tri Dao even released the second version of his impressive Flash-Attention update.

Given the efficiency enhancement that Flash-Attention brings to Transformer training, I believe its integration would be highly beneficial for a broad spectrum of Accelerate users who rely on Megatron-LM. Therefore, I kindly request that you consider updating the forked version of Megatron-LM to a more recent version that incorporates the changes made by PR 267.

Looking forward to your response and potential plan of action on this matter.

Best regards

sgugger · 2023-07-24T22:00:40Z

cc @pacman100

leocnj · 2023-07-27T18:17:58Z

@pacman100, after some tweaks, I just created a PR to implement the requested function. Will you please take a look?

github-actions · 2023-08-24T15:05:56Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

github-actions bot closed this as completed Sep 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] Request to Update Forked Megatron-LM Repository with Flash-Attention Improvement #1766

[Feature request] Request to Update Forked Megatron-LM Repository with Flash-Attention Improvement #1766

leocnj commented Jul 24, 2023

sgugger commented Jul 24, 2023

leocnj commented Jul 27, 2023 •

edited

Loading

github-actions bot commented Aug 24, 2023

[Feature request] Request to Update Forked Megatron-LM Repository with Flash-Attention Improvement #1766

[Feature request] Request to Update Forked Megatron-LM Repository with Flash-Attention Improvement #1766

Comments

leocnj commented Jul 24, 2023

sgugger commented Jul 24, 2023

leocnj commented Jul 27, 2023 • edited Loading

github-actions bot commented Aug 24, 2023

leocnj commented Jul 27, 2023 •

edited

Loading