You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ggml-org/llama.cpp#61 seems like progress (interactive mode now allows multiple queries to the same process). but it looks like interactive mode persists the prompt over time, so might still need to modify llama.cpp to allow issuing multiple independent queries to the same process.
loading the model takes ten thousand years. we should do it once, instead of many times.
The text was updated successfully, but these errors were encountered: