Welcome to the code repository for our NeurIPS 2024 paper, "Toward Efficient Inference for Mixture of Experts". We are currently in the process of securing approval for the public release. Updates regarding the release, including potential links, will be posted here as they become available.
-
Notifications
You must be signed in to change notification settings - Fork 0
hyhuang00/moe_inference
About
Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published