[Frontend] Factor out chat message parsing #7055

DarkLight1337 · 2024-08-02T01:24:57Z

To further support #5049, I've factored out more code relating to chat message parsing into chat_utils.py

github-actions · 2024-08-02T01:25:11Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

rkooo567 · 2024-08-02T18:21:44Z

vllm/entrypoints/openai/serving_tokenization.py

-                result = parse_chat_message_content(message, model_config,
-                                                    tokenizer)
-                conversation.extend(result.messages)
+            if mm_futures:


QQ: should we warn only once? (I think there's a function like warn_once) it is not going to be spammy?

~~Good point! It's unlikely to be an issue since we have explicitly mentioned that we're only supporting single-image input currently, but yea I agree we should use warn_once here!~~

nvm - the warning here is actually different than what I thought, yea we should put warn_once here

Actually, the default behaviour already does this (i.e. only print out the first occurrence of the warning).

https://docs.python.org/3/library/warnings.html#the-warnings-filter

Signed-off-by: Alvant <alvasian@yandex.ru>

Factor out chat message parsing

4a8768c

DarkLight1337 requested a review from ywang96 August 2, 2024 01:24

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 2, 2024

rkooo567 approved these changes Aug 2, 2024

View reviewed changes

Merge branch 'upstream' into chat-utils-parse

25d7209

DarkLight1337 enabled auto-merge (squash) August 3, 2024 02:36

youkaichao disabled auto-merge August 3, 2024 04:31

youkaichao merged commit 8c025fa into vllm-project:main Aug 3, 2024
64 of 67 checks passed

DarkLight1337 deleted the chat-utils-parse branch August 3, 2024 04:33

dtrifiro mentioned this pull request Aug 5, 2024

Sync with upstream@v0.5.4-7-g9118217f opendatahub-io/vllm#120

Closed

sfc-gh-mkeralapura pushed a commit to sfc-gh-mkeralapura/vllm that referenced this pull request Aug 12, 2024

[Frontend] Factor out chat message parsing (vllm-project#7055)

7c27b0f

kylesayrs pushed a commit to neuralmagic/vllm that referenced this pull request Aug 17, 2024

[Frontend] Factor out chat message parsing (vllm-project#7055)

b0c2444

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Frontend] Factor out chat message parsing (vllm-project#7055)

6a44991

Signed-off-by: Alvant <alvasian@yandex.ru>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Frontend] Factor out chat message parsing (vllm-project#7055)

389faab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Frontend] Factor out chat message parsing #7055

[Frontend] Factor out chat message parsing #7055

DarkLight1337 commented Aug 2, 2024

github-actions bot commented Aug 2, 2024

rkooo567 Aug 2, 2024

ywang96 Aug 2, 2024 •

edited

Loading

DarkLight1337 Aug 3, 2024

[Frontend] Factor out chat message parsing #7055

[Frontend] Factor out chat message parsing #7055

Conversation

DarkLight1337 commented Aug 2, 2024

github-actions bot commented Aug 2, 2024

rkooo567 Aug 2, 2024

Choose a reason for hiding this comment

ywang96 Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

DarkLight1337 Aug 3, 2024

Choose a reason for hiding this comment

ywang96 Aug 2, 2024 •

edited

Loading