Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disaggregate prefill decode with zmq #1

Draft
wants to merge 20 commits into
base: main
Choose a base branch
from

Conversation

panf2333
Copy link
Collaborator

@panf2333 panf2333 commented Jan 6, 2025

FILL IN THE PR DESCRIPTION HERE

FIX #xxxx (link existing issues this PR will resolve)

**BEFORE SUBMITTING, PLEASE READ https://docs.vllm.ai/en/latest/contributing/overview.html **

@panf2333
Copy link
Collaborator Author

panf2333 commented Jan 6, 2025

  1. 下载后通过vllm connect --prefill-addr localhost:5555 --decode-addr localhost:5556 启动connect 服务
  2. 进入 vllm/benchmarks/disagg_benchmarks/zmq
  3. 执行python test_connect_server1.py 启动模拟的第一个prefill node
  4. 执行python test_connect_server2.py 启动模拟的第一个decode node
  5. 执行python test_request.py 发起测试请求

prefill node 和 decode node 目前都是mock的数据返回流式请求

@panf2333 panf2333 force-pushed the disaggregate_prefill_decode_with_zmq branch from 1bc97ec to 0728a42 Compare January 8, 2025 16:39
@panf2333 panf2333 force-pushed the disaggregate_prefill_decode_with_zmq branch from e0a1b83 to 0986283 Compare February 18, 2025 12:46
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
2.To more accurately reflect its purpose, we will rename connect.py to disagg_connector.py.

Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
…oy(linger=0) for immediate termination

Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
Signed-off-by: clark <panf2333@gmail.com>
@panf2333 panf2333 force-pushed the disaggregate_prefill_decode_with_zmq branch from 0986283 to cc7c582 Compare February 18, 2025 14:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant