enforce recompute flag on fsdpa quantization #133

dudilester · 2024-03-28T13:23:04Z

Currently fp8fsdpa quantization supported only when flash_attention_recompute is True

astachowiczhabana · 2024-06-12T10:53:53Z

dudilester · 2024-06-13T11:40:30Z

…ers (#133) * Update transformer_engine._convert_model to skip LoRA layers * Remove print statement * Add check for peft module availability

…ers (#133) (#163) * Update transformer_engine._convert_model to skip LoRA layers * Remove print statement * Add check for peft module availability

enforce recompute flag on fsdpa quantization

b865c6c

dudilester requested review from MrGeva, ulivne and bgoldberg-habana March 28, 2024 13:23

dudilester requested review from mandy-li and libinta as code owners March 28, 2024 13:23

dudilester requested a review from a user March 28, 2024 13:23

MrGeva approved these changes Mar 28, 2024

View reviewed changes

MrGeva merged commit 7043900 into habana-main Mar 28, 2024

dudilester added a commit that referenced this pull request Mar 31, 2024

enforce recompute flag on fsdpa quantization (#133)

c46fd33

astachowiczhabana pushed a commit that referenced this pull request Apr 5, 2024

enforce recompute flag on fsdpa quantization (#133)

b05a8e0

astachowiczhabana pushed a commit that referenced this pull request Apr 5, 2024

enforce recompute flag on fsdpa quantization (#133)

4f0cfb1

astachowiczhabana pushed a commit that referenced this pull request Apr 19, 2024

enforce recompute flag on fsdpa quantization (#133)

b8c073a

astachowiczhabana pushed a commit that referenced this pull request Apr 22, 2024

enforce recompute flag on fsdpa quantization (#133)

0b2e152

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

enforce recompute flag on fsdpa quantization (#133)

0261e77

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

enforce recompute flag on fsdpa quantization (#133)

f40ec4e

dudilester added a commit that referenced this pull request May 7, 2024

enforce recompute flag on fsdpa quantization (#133)

105f12b

dudilester added a commit that referenced this pull request May 8, 2024

enforce recompute flag on fsdpa quantization (#133)

86fa5b6

dudilester added a commit that referenced this pull request May 13, 2024

enforce recompute flag on fsdpa quantization (#133)

a963932

Provide feedback