Allow setting can_stream in extra-openai-models.yaml to allow for o1 over proxy #599

cmungall · 2024-10-31T23:46:24Z

o1 support was added in response to

Add o1 support #570

However, this hardwires the streamability to named o1 models. I am accessing o1-preview via a (litellm) proxy, so I get a 'message': 'litellm.BadRequestError: AzureException BadRequestError - Error code: 400 - {\'error\': {\'message\': "Unsupported value: \'stream\' does not support true with this model. Only the default (false) value is supported.", \'type\': \'invalid_request_error\'

I believe I need to be able to do this

- model_name: openai/o1
  model_id: lbl/o1
  api_base: <MY PROXY>
  api_key_name: <MY KEY NAME>
  can_stream: false

however, can_stream is currently ignored

The text was updated successfully, but these errors were encountered:

Fixes simonw#599 A longer term fix would be to use something like Pydantic so we don't repeat ourselves, but this would be a bit of a refactor

simonw · 2024-11-06T07:43:27Z

Here's how I tested this: I added this to my extra-openai-models.yaml file:

- model_id: o1-via-proxy
  model_name: o1-preview
  api_base: "http://localhost:8040/v1"
  api_key_name: openai
  can_stream: false

Then I ran a proxy on port 8040 like this:

uv run --with asgi-proxy-lib==0.2a0 \
  python -m asgi_proxy \
  https://api.openai.com -p 8040 -v

And tested it like this:

llm -m o1-via-proxy 'just say hi'

Output:

Hi there! How can I assist you today?

While my proxy server showed:

INFO:httpx:HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"
INFO:root:Request: POST https://api.openai.com/v1/chat/completions
INFO:root:Response: 200 OK
INFO:     127.0.0.1:52683 - "POST /v1/chat/completions HTTP/1.1" 200 OK

I had to fix this issue first though:

Disable timeout by default, add timeout= option asgi-proxy-lib#10

Refs #507, #599, #600, #603, #608, #611, #612, #613, #614, #615, #616, #621, #622, #623, #626, #629

cmungall added a commit to cmungall/llm that referenced this issue Oct 31, 2024

Allow passing of can_stream in openai_models.py

6a68078

Fixes simonw#599 A longer term fix would be to use something like Pydantic so we don't repeat ourselves, but this would be a bit of a refactor

cmungall mentioned this issue Oct 31, 2024

Allow passing of can_stream in openai_models.py #600

Merged

simonw closed this as completed in #600 Nov 6, 2024

simonw closed this as completed in 3b2e526 Nov 6, 2024

simonw added a commit that referenced this issue Nov 14, 2024

Release 0.18a0

041730d

Refs #507, #599, #600, #603, #608, #611, #612, #613, #614, #615, #616, #621, #622, #623, #626, #629

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow setting can_stream in extra-openai-models.yaml to allow for o1 over proxy #599

Allow setting can_stream in extra-openai-models.yaml to allow for o1 over proxy #599

cmungall commented Oct 31, 2024

simonw commented Nov 6, 2024 •

edited

Loading

Allow setting can_stream in extra-openai-models.yaml to allow for o1 over proxy #599

Allow setting can_stream in extra-openai-models.yaml to allow for o1 over proxy #599

Comments

cmungall commented Oct 31, 2024

simonw commented Nov 6, 2024 • edited Loading

simonw commented Nov 6, 2024 •

edited

Loading