diff --git a/README.md b/README.md index dc354a75a..742d043b5 100644 --- a/README.md +++ b/README.md @@ -768,7 +768,7 @@ Notes: - `--fast_moe` is an integer value that configures the amount of expert parallel sharding (ep_degree). - `world_size` must be divisible by the `ep_degree` - Running fast moe modifies the state dict of the model, and must be post-processed using [checkpoint utils](https://github.com/foundation-model-stack/fms-acceleration/blob/main/plugins/accelerated-moe/src/fms_acceleration_moe/utils/checkpoint_utils.py) to run inference (HF, vLLM, etc.). - - The typical uscase for this script is to run: + - The typical usecase for this script is to run: ``` python -m fms_acceleration_moe.utils.checkpoint_utils \ \