[Doc] [SpecDecode] Update MLPSpeculator documentation #7100

tdoublep · 2024-08-03T06:06:19Z

Adding some documentation about MLPSpeculator including some links to the published draft models.

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

github-actions · 2024-08-03T06:06:30Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

LiuXiaoxuanPKU

Thanks! LGTM!

njhill · 2024-08-05T22:25:46Z

docs/source/models/spec_decode.rst

+        model="meta-llama/Meta-Llama-3.1-70B-Instruct",
+        tensor_parallel_size=4,
+        speculative_model="ibm-fms/llama3-70b-accelerator",
+        speculative_draft_tensor_parallel_size=1,


We can remove this line now that #7105 is merged.

) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by: Alvant <alvasian@yandex.ru>

) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

Update MLPSpeculator documentation

7ab6ca1

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

tdoublep added 4 commits August 3, 2024 02:11

Fix list

4c6a669

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

Add HF org to docstring

3945cda

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

Add note about TP

90b58ed

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

fmt

8eb243c

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

LiuXiaoxuanPKU approved these changes Aug 5, 2024

View reviewed changes

njhill approved these changes Aug 5, 2024

View reviewed changes

njhill enabled auto-merge (squash) August 5, 2024 22:24

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 5, 2024

njhill reviewed Aug 5, 2024

View reviewed changes

njhill merged commit 789937a into vllm-project:main Aug 5, 2024
65 checks passed

sfc-gh-mkeralapura pushed a commit to sfc-gh-mkeralapura/vllm that referenced this pull request Aug 12, 2024

[Doc] [SpecDecode] Update MLPSpeculator documentation (vllm-project#7100

291b874

) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Aug 17, 2024

[Doc] [SpecDecode] Update MLPSpeculator documentation (vllm-project#7100

ed67e94

) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

fialhocoelho pushed a commit to opendatahub-io/vllm that referenced this pull request Aug 22, 2024

[Doc] [SpecDecode] Update MLPSpeculator documentation (vllm-project#7100

4241159

) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Doc] [SpecDecode] Update MLPSpeculator documentation (vllm-project#7100

802c1fb

) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by: Alvant <alvasian@yandex.ru>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Doc] [SpecDecode] Update MLPSpeculator documentation (vllm-project#7100

6c8f341

) Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] [SpecDecode] Update MLPSpeculator documentation #7100

[Doc] [SpecDecode] Update MLPSpeculator documentation #7100

tdoublep commented Aug 3, 2024

github-actions bot commented Aug 3, 2024

LiuXiaoxuanPKU left a comment

njhill Aug 5, 2024

[Doc] [SpecDecode] Update MLPSpeculator documentation #7100

[Doc] [SpecDecode] Update MLPSpeculator documentation #7100

Conversation

tdoublep commented Aug 3, 2024

github-actions bot commented Aug 3, 2024

LiuXiaoxuanPKU left a comment

Choose a reason for hiding this comment

njhill Aug 5, 2024

Choose a reason for hiding this comment