Add documents on how to add new models #65

zhuohan123 · 2023-05-04T09:05:56Z

No description provided.

SUMMARY: * update "set-env" action to set HF_HOME * add action to mount HF cache * pin some tests to devices "0,1" as this enables about 2k more test points TEST PLAN: runs on remote push --------- Co-authored-by: andy-neuma <andy@neuralmagic.com>

Sync with upstream@v0.5.1-10-g16620f43

remove expert_max hard code (vllm-project#47) vLLM-Ext: Full enabling of ALiBi (vllm-project#34) Add version inference via setuptools-scm (vllm-project#58) Revert "vLLM-Ext: Full enabling of ALiBi (vllm-project#34)" (vllm-project#59) Remove punica_hpu.py from vllm_hpu_extension (vllm-project#66) Removed previous (not-pipelined) pa implementation (vllm-project#72) Add flag to enable running softmax in fp32 (vllm-project#71) Update calibration readme link (vllm-project#73) allow lm_head quantization in calibration process (vllm-project#65) Pad to bmin if value is less (vllm-project#67) Update pyproject.toml (HabanaAI#75) --------- Co-authored-by: Michał Kuligowski <mkuligowski@habana.ai>

WoosukKwon added the documentation Improvements or additions to documentation label May 4, 2023

WoosukKwon self-assigned this May 6, 2023

WoosukKwon added the P0 label May 10, 2023

WoosukKwon mentioned this issue Jun 4, 2023

[Docs] Write the Adding a New Model section #138

Merged

WoosukKwon closed this as completed in #138 Jun 6, 2023

fialhocoelho pushed a commit to fialhocoelho/vllm that referenced this issue Jul 8, 2024

Merge pull request vllm-project#65 from vllm-project/main

c624c10

Sync with upstream@v0.5.1-10-g16620f43

dllehr-amd pushed a commit to dllehr-amd/vllm that referenced this issue Jul 22, 2024

fix error (vllm-project#65)

17e6307

JHLEE17 pushed a commit to JHLEE17/vllm that referenced this issue Aug 1, 2024

Add mark steps to prevent oom in static moe op (vllm-project#65)

11f047c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documents on how to add new models #65

Add documents on how to add new models #65

zhuohan123 commented May 4, 2023

Add documents on how to add new models #65

Add documents on how to add new models #65

Comments

zhuohan123 commented May 4, 2023