Misc. bug: Some server response JSON still not restored #10728

CentricStorm · 2024-12-09T03:02:41Z

Name and Version

version: 4291 (ce8784b)
built with MSVC 19.29.30157.0 for x64

Operating systems

Windows

Which llama.cpp modules do you know to be affected?

llama-server

Problem description & steps to reproduce

After #10722, there are still some issues when using the native /completion endpoint:

The has_new_line property is set to 1 instead of true like it used to be.
The model property is set to gpt-3.5-turbo-0613 even though this isn't the OpenAI-compatible API.

First Bad Commit

No response

Relevant log output

No response

The text was updated successfully, but these errors were encountered:

ggerganov · 2024-12-09T09:15:26Z

PRs welcome

MichelleTanPY · 2024-12-12T15:01:45Z

Hi @ggerganov , I saw the 'good first issue' tag on this. I'm new to this project and thought to give this a try. I manage to build the llama-server, but I'm getting this strange error when get the server up and running.
I'm following the README here : https://github.com/ggerganov/llama.cpp/tree/master/examples/server and getting the following error.
llama_model_load: error loading model: missing tensor 'token_embd.weight' llama_load_model_from_file: failed to load model common_init_from_params: failed to load model '.\models\ggml-vocab-deepseek-llm.gguf' srv load_model: failed to load model, '.\models\ggml-vocab-deepseek-llm.gguf' main: exiting due to model loading error
Any pointers on how I should proceed? Building with MSVC 19.42.34435.0 x64 Windows machine.

ngxson · 2024-12-13T16:32:04Z

The model property is set to gpt-3.5-turbo-0613 even though this isn't the OpenAI-compatible API.

This is to prepare for adding OAI-compat to /v1/completions endpoint. If you're searching for the model path, please have a look at /props endpoint

CentricStorm added the bug-unconfirmed label Dec 9, 2024

ggerganov added bug Something isn't working good first issue Good for newcomers and removed bug-unconfirmed labels Dec 9, 2024

MichelleTanPY mentioned this issue Dec 13, 2024

Bug: No docs explain the value for cache-type-k/v #10373

Closed

MichelleTanPY mentioned this issue Dec 13, 2024

server: Fix has_next_line in JSON response #10818

Merged

ngxson closed this as completed in #10818 Dec 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Misc. bug: Some server response JSON still not restored #10728

Misc. bug: Some server response JSON still not restored #10728

CentricStorm commented Dec 9, 2024

ggerganov commented Dec 9, 2024

MichelleTanPY commented Dec 12, 2024

ngxson commented Dec 13, 2024

Misc. bug: Some server response JSON still not restored #10728

Misc. bug: Some server response JSON still not restored #10728

Comments

CentricStorm commented Dec 9, 2024

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Problem description & steps to reproduce

First Bad Commit

Relevant log output

ggerganov commented Dec 9, 2024

MichelleTanPY commented Dec 12, 2024

ngxson commented Dec 13, 2024