[Bug]: "Continue generation" request to API is missing a parameter #5654

Originalimoc · 2025-02-05T06:13:47Z

When you click stop generation, there will be an additional continue generation button which won't appear if you let the generation finish itself. That button should trigger a generation request contains the specific parameter in another project's issue closing comment or the HF endpoint will assume it's a new round and model will not behave as what continue generation intended. It's called "Prefilled" generation request. Another use case of this feature is model steering. To prevent this, add add_generation_prompt to false in the request to prevent adding an extra turn:

Relavant:
"https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html?ref=blog.mozilla.ai#:~:text=role.%22)%2C%0A%20%20%20%20)-,add_generation_prompt,-%3A%20bool%20%3D"
&
"huggingface/transformers#33198"

Code of Conduct

I agree to follow this project's Code of Conduct

Originalimoc added the 🐛 bug Something isn't working label Feb 5, 2025

Repository owner locked and limited conversation to collaborators Feb 5, 2025

danny-avila converted this issue into discussion #5659 Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

[Bug]: "Continue generation" request to API is missing a parameter #5654

[Bug]: "Continue generation" request to API is missing a parameter #5654

Originalimoc commented Feb 5, 2025 •

edited

Loading

This issue was moved to a discussion.

This issue was moved to a discussion.

[Bug]: "Continue generation" request to API is missing a parameter #5654

[Bug]: "Continue generation" request to API is missing a parameter #5654

Comments

Originalimoc commented Feb 5, 2025 • edited Loading

Code of Conduct

This issue was moved to a discussion.

Originalimoc commented Feb 5, 2025 •

edited

Loading