Skip to content

Issues: vllm-project/aibrix

v0.3.0 roadmap
#698 opened Feb 18, 2025 by Jeffwan
Open 8
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Do LLM Cache Support V100 hardware?
#791 opened Mar 4, 2025 by jlcoo
Does aibrix support to do load balance against managed model endpoints area/gateway triage/needs-information Indicates an issue needs more information in order to work on it.
#784 opened Mar 3, 2025 by Colstuwjx
Failed to run benchmark scripts against the endpoint area/gateway kind/bug Something isn't working priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
#783 opened Mar 3, 2025 by Jeffwan
Add probe usage practice for super large models, including multi-node case area/performance kind/documentation Improvements or additions to documentation kind/enhancement New feature or request priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
#782 opened Mar 3, 2025 by Jeffwan
Managing common functionalities of benchmark in a separate utils dir area/benchmark kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt.
#767 opened Feb 28, 2025 by gangmuk
Mounting s3 hosted model files using s3fs is causing startup issues area/lora kind/bug Something isn't working triage/accepted Indicates an issue or PR is ready to be actively worked on.
#765 opened Feb 28, 2025 by robert-moyai
why it donnot supploy helm deploy? area/installation kind/feature Categorizes issue or PR as related to a new feature. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#762 opened Feb 28, 2025 by ying2025
Stateful information sync for ext-proc Instances area/gateway area/stability kind/enhancement New feature or request priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release.
#761 opened Feb 27, 2025 by Jeffwan
Support high availability of gateway server for production users area/gateway area/stability kind/feature Categorizes issue or PR as related to a new feature.
#760 opened Feb 27, 2025 by Jeffwan
Support multi-node & autoscaling & routing together for models like Deepseek-R1 area/autoscaling area/distributed priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now.
#758 opened Feb 27, 2025 by Jeffwan
ProTip! Type g i on any issue or pull request to go back to the issue listing page.