Disable non-functioning torch.compile #17

carmocca · 2023-03-27T13:53:19Z

No description provided.

awaelchli · 2023-03-27T13:58:15Z

train.py

+# compilation fails as it does not support torch.complex64 for RoPE
+# compile = False


Ah this sucks :((

We could explore this in the future: Add a flag to our nano model that allows us to switch between the complex implementation and the previous real one. We would use the flag for comparision and inference using the meta checkpoints, but we would use the real real implementation when training from scratch.

We still want to be able to compile for training, as that's where the speedup becomes more valuable (less $$$ spent).

Maybe we should look into getting the non-complex implementation to work as expected.

That's what I'm explaining above. We would compile the non-complex version for training, but we would use the other for inference where we load the meta checkpoint.

Maybe we should look into getting the non-complex implementation to work as expected.

I spent the whole weekend on this so yeah, it's not like we didn't try already xD

carmocca · 2023-03-27T13:59:16Z

generate.py

@@ -5,7 +5,7 @@
 import lightning as L


-@torch.inference_mode()
+@torch.no_grad()


pytorch/pytorch#93042

carmocca added 3 commits March 27, 2023 08:20

Disable non-functioning torch.compile

71e3553

TODO

e463931

Merge branch 'main' into carmocca/compile-disable

cb24797

carmocca marked this pull request as ready for review March 27, 2023 13:53

carmocca requested review from awaelchli and lantiga as code owners March 27, 2023 13:53

SHA

78319fd

awaelchli approved these changes Mar 27, 2023

View reviewed changes

carmocca commented Mar 27, 2023

View reviewed changes

generate.py

@@ -5,7 +5,7 @@

import lightning as L

@torch.inference_mode()

@torch.no_grad()

Copy link

Contributor Author

carmocca Mar 27, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pytorch/pytorch#93042

carmocca merged commit 587824c into main Mar 27, 2023

carmocca deleted the carmocca/compile-disable branch March 27, 2023 14:02

carmocca mentioned this pull request Mar 27, 2023

Avoid complex usage to compute RoPE #21

Closed

gkroiz pushed a commit to gkroiz/lit-llama that referenced this pull request May 9, 2023

Add gif animation (Lightning-AI#17)

2025d24

timothylimyl referenced this pull request in timothylimyl/lit-llama-qa May 21, 2023

Disable non-functioning torch.compile (#17)

6557f21

carmocca self-assigned this Nov 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable non-functioning torch.compile #17

Disable non-functioning torch.compile #17

carmocca commented Mar 27, 2023

awaelchli Mar 27, 2023

carmocca Mar 27, 2023

awaelchli Mar 27, 2023

carmocca Mar 27, 2023

		# compilation fails as it does not support torch.complex64 for RoPE
		# compile = False

Disable non-functioning torch.compile #17

Disable non-functioning torch.compile #17

Conversation

carmocca commented Mar 27, 2023

awaelchli Mar 27, 2023

Choose a reason for hiding this comment

carmocca Mar 27, 2023

Choose a reason for hiding this comment

awaelchli Mar 27, 2023

Choose a reason for hiding this comment

carmocca Mar 27, 2023

Choose a reason for hiding this comment