Remove Falcon style ROPE #35

magician-blue · 2023-09-27T17:56:15Z

All HF llama model are falcon style ROPE and we can convert them to original llama style ROPE with a permutation.
This pull request solve the bug when converting HF GQA to gguf format.
I learned idea from it and fix the similar bug in the llama2.c's exports.py.
Now I successfully convert Tinyllama-1.1B-chat to llama style ROPE. So, we can remove the falcon ROPE part.
I have upload the new export.py and llama2.mojo.

Details:
python export.py tl-chat.bin --hf PY007/TinyLlama-1.1B-Chat-v0.2 --version 0 to convert the model

magician-blue · 2023-09-27T17:57:09Z

I have updated the model on huggingface.

tairov · 2023-09-27T22:34:27Z

Hi @magician-blue , so do you mean the tl-chat model on HF is not compatible with this repo anymore ?

magician-blue · 2023-09-27T22:48:48Z

Hi @magician-blue , so do you mean the tl-chat model on HF is not compatible with this repo anymore ?

@tairov We still can run with our repo.

Change from

mojo llama2.mojo tl-chat.bin \
    -r falcon \
    -z tok_tl-chat.bin \
    -n 256 -t 0 -s 100 -i "<|im_start|>user\nGive me a python function to generate Fibonacci sequence<|im_end|>\n<|im_start|>assistant\n"

to

mojo llama2.mojo tl-chat.bin \
    -r llama \
    -z tok_tl-chat.bin \
    -n 256 -t 0 -s 100 -i "<|im_start|>user\nGive me a python function to generate Fibonacci sequence<|im_end|>\n<|im_start|>assistant\n"

magician-blue · 2023-09-27T22:55:52Z

If we can convert all HF llama model(they use falcon style rope) to llama style rope. Then we only need to implementone type of rope in our repo. This is what llama2.c and llama.cpp are doing.

tairov · 2023-09-28T12:47:21Z

Looks cool. Could you share some details where is this convert.py file came from? I see it has some dependencies. Probably we can remove it from the PR, and then keep only link to a converted model in the README file so that the overall process will be simpler?

magician-blue · 2023-09-28T19:27:27Z

The original convert file comes from llama2.c and I modify some part of it to support GQA.
I have already make a pull request to llama2.c, but not merged yet.
We can wait for a while.

magician-blue · 2023-09-28T19:37:51Z

The next thing I will do is to convert openllama3b(12G RAM), llama2-chat-7b(28G RAM), vicuna-7b to test my convertor and our llama2.mojo.
Besides, I'll focus on the tokenizer part of llama.cpp and llama2.c in order to find a way to remove the hardcode part of our tokenizer.

tairov · 2023-09-28T22:47:00Z

In this case I guess the convert.py is not needed in the repo.
And it's cool that you have plans to research other types of models support

tairov · 2023-09-28T22:48:32Z

model could be converted using script from llama2c
And for llama2.mojo we have a URL in the readme file

tairov · 2023-09-29T06:30:39Z

thank you!

kirp added 2 commits September 27, 2023 05:47

remove falcon style rope

0c6e4e6

conver hf model to llama2.c format

b602a81

update readme

a901057

tairov mentioned this pull request Sep 28, 2023

Getting very strange response when trying the second example in README.md #37

Closed

.

95b9b71

tairov merged commit e37bb87 into tairov:master Sep 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove Falcon style ROPE #35

Remove Falcon style ROPE #35

magician-blue commented Sep 27, 2023 •

edited

Loading

magician-blue commented Sep 27, 2023

tairov commented Sep 27, 2023

magician-blue commented Sep 27, 2023 •

edited

Loading

magician-blue commented Sep 27, 2023 •

edited

Loading

tairov commented Sep 28, 2023

magician-blue commented Sep 28, 2023 •

edited

Loading

magician-blue commented Sep 28, 2023

tairov commented Sep 28, 2023

tairov commented Sep 28, 2023

tairov commented Sep 29, 2023

Remove Falcon style ROPE #35

Remove Falcon style ROPE #35

Conversation

magician-blue commented Sep 27, 2023 • edited Loading

magician-blue commented Sep 27, 2023

tairov commented Sep 27, 2023

magician-blue commented Sep 27, 2023 • edited Loading

magician-blue commented Sep 27, 2023 • edited Loading

tairov commented Sep 28, 2023

magician-blue commented Sep 28, 2023 • edited Loading

magician-blue commented Sep 28, 2023

tairov commented Sep 28, 2023

tairov commented Sep 28, 2023

tairov commented Sep 29, 2023

magician-blue commented Sep 27, 2023 •

edited

Loading

magician-blue commented Sep 27, 2023 •

edited

Loading

magician-blue commented Sep 27, 2023 •

edited

Loading

magician-blue commented Sep 28, 2023 •

edited

Loading