[Misc] Validate grammar and fail early #11119

comaniac · 2024-12-11T23:03:19Z

If the grammar is invalid, we now let xGrammar throw RuntimeError when compiling it. However, this happens in logits processor, so the exception is raised from model executor. Since we don't expect model executor to throw any exception now, the exception will crash the engine and kill the worker process.

This PR adds a validation to make sure the grammar is valid when constructing the GrammarConfig to solve this issue.

~~Note that there is another issue with the xgrammar backend that isn't addressed by this PR #11118~~

cc @mgoin

github-actions · 2024-12-11T23:03:30Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

kouroshHakha · 2024-12-11T23:16:39Z

Hey @comaniac,

1/ Let's add unittests for this. Let's make sure it doesn't diverge from this behavior later on.
2/ We should do a similar thing for json schema as well. Use the following extra_body and it will still kill the engine:
extra_body={"guided_json": {"type": "str"}}

comaniac · 2024-12-12T00:36:53Z

I don't know where to add unit tests for xgrammar backend. It seems not much unit tests have been added for this component. @mgoin do you have any pointer or should we merge this PR first and make up unit tests later?

Per offline discussion with @mgoin, this PR also fixes the Lark-like issue. I also found another issue that crashes the engine with my fix:

Send an invalid grammar. It failed with bad request. However, at this moment the tokenizer data is cached.
Send a valid grammar. Since the tokenizer data is cached, we have encoded_vocab = None, which results in crash in get_compiler. This is because here is a hidden assumption that when encoded_vocab = None, the compiler must be initialized already, but this is no longer guaranteed.

The fix in this PR is to make sure encoded_vocab would never be None. This shouldn't hurt performance because the tokenizer data is cached anyways.

vllm/model_executor/guided_decoding/xgrammar_utils.py

comaniac · 2024-12-12T00:39:16Z

vllm/model_executor/guided_decoding/xgrammar_decoding.py

-        if tokenizer_hash in TokenizerDataCache._cache:
-            encoded_vocab = None
-            stop_token_ids = None
-            backend_str = None
-        else:
-            tokenizer_data = TokenizerDataCache.get_tokenizer_data(tokenizer)
-            encoded_vocab = tokenizer_data.encoded_vocab
-            stop_token_ids = tokenizer_data.stop_token_ids
-            backend_str = tokenizer_data.backend_str


encoded_vocab cannot be None anymore because the compiler may not be initialized even the tokenizer data is cached if the grammar is invalid. This change shouldn't hurt performance because the tokenizer data is cached anyways.

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

AlbertoCastelo · 2024-12-12T09:17:34Z

vllm/model_executor/guided_decoding/xgrammar_utils.py

+        # Look for GBNF rule definition
+        if '::=' in line:
+            return False

-        # Look for Lark-specific features
-        if any(pattern in line for pattern in ['?start:', '|', '~']):
-            return True
-
-    return False
+    return True


This would fix my issue!

mgoin

Thanks!

[Misc] Validate grammar and fail early

4baf2cb

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

comaniac force-pushed the validate_grammar branch from c5c5aa8 to 4baf2cb Compare December 11, 2024 23:07

mgoin self-requested a review December 12, 2024 00:19

comaniac linked an issue Dec 12, 2024 that may be closed by this pull request

[Bug]: grammar_is_likely_lark doesn't work correctly #11118

Closed

1 task

mgoin reviewed Dec 12, 2024

View reviewed changes

vllm/model_executor/guided_decoding/xgrammar_utils.py Outdated Show resolved Hide resolved

comaniac commented Dec 12, 2024

View reviewed changes

comaniac added 2 commits December 12, 2024 00:43

fix

0633583

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

comment

4f003b3

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

comaniac force-pushed the validate_grammar branch from 83a32cb to 4f003b3 Compare December 12, 2024 00:43

comaniac added 4 commits December 12, 2024 00:45

fix lark

06b63ea

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

fix lark

ef80b01

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

fix lark

5848882

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

fix typo

61c1d49

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

AlbertoCastelo reviewed Dec 12, 2024

View reviewed changes

AlbertoCastelo mentioned this pull request Dec 12, 2024

[Bug]: grammar_is_likely_lark doesn't work correctly #11118

Closed

1 task

mgoin approved these changes Dec 12, 2024

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 12, 2024

mgoin enabled auto-merge (squash) December 12, 2024 17:05

mgoin merged commit 2c97eca into vllm-project:main Dec 12, 2024
63 checks passed

sleepwalker2017 pushed a commit to sleepwalker2017/vllm that referenced this pull request Dec 13, 2024

[Misc] Validate grammar and fail early (vllm-project#11119)

003f4f2

comaniac deleted the validate_grammar branch December 20, 2024 22:02

ZenPuzzle pushed a commit to ZenPuzzle/vllm that referenced this pull request Dec 24, 2024

[Misc] Validate grammar and fail early (vllm-project#11119)

712a419

BKitor pushed a commit to BKitor/vllm that referenced this pull request Dec 30, 2024

[Misc] Validate grammar and fail early (vllm-project#11119)

5289193

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Validate grammar and fail early #11119

[Misc] Validate grammar and fail early #11119

comaniac commented Dec 11, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 11, 2024

kouroshHakha commented Dec 11, 2024

comaniac commented Dec 12, 2024

comaniac Dec 12, 2024

AlbertoCastelo Dec 12, 2024

mgoin left a comment

[Misc] Validate grammar and fail early #11119

[Misc] Validate grammar and fail early #11119

Conversation

comaniac commented Dec 11, 2024 • edited by github-actions bot Loading

github-actions bot commented Dec 11, 2024

kouroshHakha commented Dec 11, 2024

comaniac commented Dec 12, 2024

comaniac Dec 12, 2024

Choose a reason for hiding this comment

AlbertoCastelo Dec 12, 2024

Choose a reason for hiding this comment

mgoin left a comment

Choose a reason for hiding this comment

comaniac commented Dec 11, 2024 •

edited by github-actions bot

Loading