fix(ollama_chat.py): use tiktoken as backup for prompt token counting #1495

puffo · 2024-01-18T16:58:03Z

The fix for ollama completions on already applied this previous commit but is also required for ollama_chat.py.

Making a repeated request to the ollama chat endpoint produces a similar issue as before.

This is likely due to the optional presence of these keys in the response (see the ollama types definition)

vercel · 2024-01-18T16:58:07Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Jan 18, 2024 4:58pm

krrishdholakia · 2024-01-18T18:02:36Z

Good work!

puffo · 2024-01-18T23:47:45Z

Apologies @krrishdholakia , I've found a problem with this solution after digging deeper into the ollama documentation. I'll push up a fix shortly.

fix(ollama_chat.py): use tiktoken as backup for prompt token counting

becff36

vercel bot deployed to Preview January 18, 2024 16:58 View deployment

krrishdholakia merged commit 658fd4d into BerriAI:main Jan 18, 2024
2 checks passed

puffo mentioned this pull request Jan 19, 2024

fix(ollama): metrics handling #1514

Open

Provide feedback