"Continue generation" request to API is missing a parameter #5659
Originalimoc
started this conversation in
Feature Requests & Suggestions
Replies: 1 comment 1 reply
-
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
When you click stop generation, there will be an additional continue generation button which won't appear if you let the generation finish itself. That button should trigger a generation request contains the specific parameter in another project's issue closing comment or the HF endpoint will assume it's a new round and model will not behave as what continue generation intended. It's called "Prefilled" generation request. Another use case of this feature is model steering. To prevent this, add add_generation_prompt to false in the request to prevent adding an extra turn:
Relavant:
"https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html?ref=blog.mozilla.ai#:~:text=role.%22)%2C%0A%20%20%20%20)-,add_generation_prompt,-%3A%20bool%20%3D"
&
"huggingface/transformers#33198"
Code of Conduct
Beta Was this translation helpful? Give feedback.
All reactions