Support AsyncPipeline for RESTful API #270

toilaluan · 2023-11-08T02:56:38Z

Are you planning to support this feature?
I'm wanna use FastGen in my app but it's not currently support RESTful API asynchronously
vLLM support it's very well: https://github.com/vllm-project/vllm/blob/main/vllm/entrypoints/api_server.py

I've also use Ray to deploy a server use MIIPipeline with dynamic batching but the performance is far behind vLLM default settings.

mrwyattii · 2023-11-08T17:58:36Z

@toilaluan we actually have RESTful API capabilities, but we have not fully tested them (these pieces were brought over from MII-Legacy). I suspect enabling it will error out currently.

I can bring this feature back to life later this week or early next week when I find time!

toilaluan · 2023-11-09T03:01:14Z

@mrwyattii I've tested it, but currently it serve requests sequentially. Hope you can do soon, thank for your work 🔥

mrwyattii · 2023-11-13T22:21:17Z

Please see the example we have for enabling the REST API: https://github.com/microsoft/DeepSpeed-MII#restful-api

You will need to install from source until we do our next release: pip install git+https://github.com/microsoft/DeepSpeed-MII.git

dongxiaolong · 2023-11-14T07:04:56Z

Please see the example we have for enabling the REST API: https://github.com/microsoft/DeepSpeed-MII#restful-api

You will need to install from source until we do our next release: pip install git+https://github.com/microsoft/DeepSpeed-MII.git

An openai compatible api would be much easier to use.
such as https://github.com/vllm-project/vllm/blob/main/vllm/entrypoints/openai/api_server.py

mrwyattii · 2023-11-17T00:00:57Z

@dongxiaolong are you referring to being able to pass "role" and "content" as the input? Could you please open an issue and add the Enhancement label? Thanks!

mrwyattii self-assigned this Nov 8, 2023

mrwyattii mentioned this issue Nov 13, 2023

Update RESTful API #294

Merged

mrwyattii closed this as completed in #294 Nov 13, 2023

dongxiaolong mentioned this issue Nov 17, 2023

openai compatible api #316

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support AsyncPipeline for RESTful API #270

Support AsyncPipeline for RESTful API #270

toilaluan commented Nov 8, 2023

mrwyattii commented Nov 8, 2023

toilaluan commented Nov 9, 2023

mrwyattii commented Nov 13, 2023

dongxiaolong commented Nov 14, 2023

mrwyattii commented Nov 17, 2023

Support AsyncPipeline for RESTful API #270

Support AsyncPipeline for RESTful API #270

Comments

toilaluan commented Nov 8, 2023

mrwyattii commented Nov 8, 2023

toilaluan commented Nov 9, 2023

mrwyattii commented Nov 13, 2023

dongxiaolong commented Nov 14, 2023

mrwyattii commented Nov 17, 2023