[Feature]: Only apply Guided/Structured grammar after reasoning steps in Reasoning models #12619

cksac · 2025-01-31T16:49:13Z

🚀 The feature, motivation and pitch

Only apply Guided/Structured grammar only in the answer for reasoning model. i.e. for DeepSeek R1 only enforce grammar inside <answer></answer> or after </think>
This would make Reasoning models more useful in agent workflow expecting structured output.

Alternatives

No response

Additional context

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

gaocegege · 2025-02-08T03:24:37Z

Related to #11908

gaocegege · 2025-02-08T07:18:53Z

I have a PoC tested with deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B, which introduces a checker in the logits processor to skip xgrammar schema check:

class XGrammarLogitsProcessor:
    """Wrapper class to support pickle protocol"""
    config: GrammarConfig

    ctx: xgr.CompiledGrammar | None = None
    token_bitmask: torch.Tensor = None  # type: ignore[assignment]
    matchers: list[xgr.GrammarMatcher] = field(default_factory=list)
    batch_size: int = field(default=1)
    prefilled: bool = field(default=False)

    def __call__(self, input_ids: list[int],
                 scores: torch.Tensor) -> torch.Tensor:
+        if not reasoning_end(input_ids):
+           return scores
        if self.ctx is None:
            self._ensure_ctx()

def reasoning_end(input_ids: list[int]) -> bool:
    """Check if the input_ids contain the end of reasoning token."""
    # Hard coded endthink token id </think>
    endthink_token_id = 151649

    if endthink_token_id in input_ids:
        return True

I can generalize it to support all R1 models and structured engines if you think this approach is effective.

full code:

gaocegege@9134347

gaocegege · 2025-02-08T09:46:32Z

Having a draft PR, #12955

Please let me know if the approach works for you.

cksac added the feature request label Jan 31, 2025

russellb added the structured-output label Jan 31, 2025

gaocegege mentioned this issue Feb 8, 2025

[v0][structured output] Support reasoning output #12955

Open

jacobthebanana mentioned this issue Feb 9, 2025

[Feature] Support user-specified "trigger" token before starting structured decoding #12995

Closed

2 tasks

liuyanyi mentioned this issue Feb 10, 2025

[Feature]: Chat Prefix Completion #13005

Closed

1 task

gaocegege mentioned this issue Feb 23, 2025

[Feature]: add tool calling support for DeepSeek-R1-Distill-Qwen-32B #13700

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Only apply Guided/Structured grammar after reasoning steps in Reasoning models #12619

[Feature]: Only apply Guided/Structured grammar after reasoning steps in Reasoning models #12619

cksac commented Jan 31, 2025 •

edited

Loading

gaocegege commented Feb 8, 2025

gaocegege commented Feb 8, 2025 •

edited

Loading

gaocegege commented Feb 8, 2025

[Feature]: Only apply Guided/Structured grammar after reasoning steps in Reasoning models #12619

[Feature]: Only apply Guided/Structured grammar after reasoning steps in Reasoning models #12619

Comments

cksac commented Jan 31, 2025 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

gaocegege commented Feb 8, 2025

gaocegege commented Feb 8, 2025 • edited Loading

gaocegege commented Feb 8, 2025

cksac commented Jan 31, 2025 •

edited

Loading

gaocegege commented Feb 8, 2025 •

edited

Loading