server : bugfix - stop server from sending empty json during oai chat completions #10694

m18coppola · 2024-12-06T20:15:40Z

Since #10643, the webui crashes when the model finished generating a response:

Upon investigation, I found that it was because an empty json object is sent from the server before the on_complete json is sent:

$ curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "stream": true,
    "messages": [
      {
        "role": "system",
        "content": "You are a helpful assistant."
      },
      {
        "role": "user",
        "content": "Hello!"
      }
    ]
  }'

...

data: {"choices":[{"finish_reason":null,"index":0,"delta":{"content":"?"}}],"created":1733515737,"id":"chatcmpl-LtMeOt2U3SPq40U172bsTkOzMiL8X4xC","model":"gpt-3.5-turbo-0613","object":"chat.completion.chunk"}

data: {}

data: {"choices":[{"finish_reason":"stop","index":0,"delta":{}}],"created":1733515738,"id":"chatcmpl-LtMeOt2U3SPq40U172bsTkOzMiL8X4xC","model":"gpt-3.5-turbo-0613","object":"chat.completion.chunk","timings":{"prompt_n":1,"prompt_ms":67.807,"prompt_per_token_ms":67.807,"prompt_per_second":14.747739908858966,"predicted_n":25,"predicted_ms":1543.024,"predicted_per_token_ms":61.72096,"predicted_per_second":16.20195149265339},"usage":{"completion_tokens":25,"prompt_tokens":23,"total_tokens":48}}

data: [DONE]

I made a change such that the server skips sending any empty json objects returned from the completion results stream.

…_complete json object during oai chat streaming response

ggerganov · 2024-12-07T07:53:46Z

It might be better to find what is the root cause for having this empty json object and prevent from being created in the first place.

ggerganov · 2024-12-07T09:36:43Z

I traced it and this empty object is created when an end-of-turn token is generated and the --special CLI arg is not passed to llama-server. In this case, the special token is not rendered and the content is empty.

I think a better solution would be to send a valid object with empty content instead of skipping this message from the stream.

m18coppola added 2 commits December 6, 2024 14:58

bug fix: stop server from sending empty json object before sending on…

859ce0c

…_complete json object during oai chat streaming response

Merge branch 'ggerganov:master' into master

de594d0

github-actions bot added examples server labels Dec 6, 2024

ggerganov mentioned this pull request Dec 7, 2024

server : various fixes #10704

Merged

ggerganov closed this in #10704 Dec 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server : bugfix - stop server from sending empty json during oai chat completions #10694

server : bugfix - stop server from sending empty json during oai chat completions #10694

m18coppola commented Dec 6, 2024

ggerganov commented Dec 7, 2024

ggerganov commented Dec 7, 2024

server : bugfix - stop server from sending empty json during oai chat completions #10694

server : bugfix - stop server from sending empty json during oai chat completions #10694

Conversation

m18coppola commented Dec 6, 2024

ggerganov commented Dec 7, 2024

ggerganov commented Dec 7, 2024