Remove falcon style rope #34

magician-blue · 2023-09-27T17:34:44Z

All HF llama model are falcon style ROPE and we can convert them to original llama style ROPE with a permutation.
This pull request solve the bug when converting HF GQA to gguf format.
I learned idea from it and fix the similar bug in the llama2.c's exports.py.
Now I successfully convert Tinyllama-1.1B-chat to llama style ROPE. So, we can remove the falcon ROPE part.
I have upload the new export.py and llama2.mojo.

Details:
python export.py tl-chat.bin --hf PY007/TinyLlama-1.1B-Chat-v0.2 --version 0 to conver the model

magician-blue · 2023-09-27T17:37:40Z

Why there is a difference is readme??

magician-blue · 2023-09-27T17:38:13Z

I have update my model on huggingface.

Add tinlyllama-1.1B support

c25a3e3

magician-blue force-pushed the test branch from 4d37bcb to c25a3e3 Compare September 27, 2023 17:43

magician-blue closed this Sep 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove falcon style rope #34

Remove falcon style rope #34

magician-blue commented Sep 27, 2023

magician-blue commented Sep 27, 2023

magician-blue commented Sep 27, 2023

Remove falcon style rope #34

Remove falcon style rope #34

Conversation

magician-blue commented Sep 27, 2023

magician-blue commented Sep 27, 2023

magician-blue commented Sep 27, 2023