llamapilot is very slow #1

madebyollin · 2023-03-12T22:20:58Z

loading the model takes ten thousand years. we should do it once, instead of many times.

madebyollin · 2023-03-12T22:49:02Z

ggml-org/llama.cpp#61 seems like progress (interactive mode now allows multiple queries to the same process). but it looks like interactive mode persists the prompt over time, so might still need to modify llama.cpp to allow issuing multiple independent queries to the same process.

madebyollin changed the title ~~it's so slow~~ llamapilot is very slow Mar 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llamapilot is very slow #1

llamapilot is very slow #1

madebyollin commented Mar 12, 2023

madebyollin commented Mar 12, 2023

llamapilot is very slow #1

llamapilot is very slow #1

Comments

madebyollin commented Mar 12, 2023

madebyollin commented Mar 12, 2023