Dynamically intterupt token generation #599

Bartvelp · 2023-08-10T18:57:25Z

Is your feature request related to a problem? Please describe.
During the generation of tokens I would like to stop when I encounter some condition that changes during the runtime, using Stream=True.
E.g. I would like to stop generation after 5 lines of generation.

Describe the solution you'd like
I would like a method on llm called stop(), or interrupt(), that forces the model to stop after the next token is generated, similar to CTRL+C in the regular llama.cpp

Describe alternatives you've considered
I have considered adding a newline as stop token, but I think this is not performant. Another way I can think of is changing the stop list after passing it the generation method, but that feels hacky.

The text was updated successfully, but these errors were encountered:

gjmulder added the enhancement New feature or request label Aug 13, 2023

simonchatts linked a pull request Sep 18, 2023 that will close this issue

Add cancel() method to interrupt a stream #733

Open

woheller69 mentioned this issue May 8, 2024

Stop LLM output on user request? Maximilian-Winter/llama-cpp-agent#47

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamically intterupt token generation #599

Dynamically intterupt token generation #599

Bartvelp commented Aug 10, 2023

Dynamically intterupt token generation #599

Dynamically intterupt token generation #599

Comments

Bartvelp commented Aug 10, 2023