Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No tools work with Llama 1B #195

Closed
PtasiaDupka opened this issue Oct 12, 2024 · 3 comments
Closed

No tools work with Llama 1B #195

PtasiaDupka opened this issue Oct 12, 2024 · 3 comments

Comments

@PtasiaDupka
Copy link

I did everything as said in the documentation but tools just don't execute. For example, if I type "write a snake game to snake.py", the LLM does its job and writes the code but gptme doesn't write it to the file. No messagies from "System", just "User:" prompt again.

@PtasiaDupka
Copy link
Author

Looks like the LLM should be taught how to form output the way gptme understands it, but how? I didn't find any commands for that

@PtasiaDupka
Copy link
Author

PtasiaDupka commented Oct 12, 2024

Okay, I looked into the code and figured out the llm is given all the information it needs to act as intended at the very beginning of every conversation.

I ran gptme with --show-hidden and then just in case also checked the file where the conversation history is stored. Gptme does instruct llm but it just ignores it...

I also forgot to mention that I'm using llama3.2:1b just as in the example from the docs.

@ErikBjare
Copy link
Owner

ErikBjare commented Oct 13, 2024

Llama 1B is nowhere near good enough to adhere to the system prompt. It is merely an example in the docs for how to get the most minimal setup done.

Llama 70B and up behave somewhat correctly (with 405B being really good).

Check the evals to see which models actually work: https://gptme.org/docs/evals.html

@ErikBjare ErikBjare changed the title No tools work with ollama No tools work with Llama 1B Oct 13, 2024
@ErikBjare ErikBjare closed this as not planned Won't fix, can't repro, duplicate, stale Oct 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants