Support for context-free-grammars (CFG) to constrain model output #25778

jvhoffbauer · 2023-08-26T17:52:02Z

Feature request

It would be nice to constrain the model output with a CFG directly when calling model.generate.

This is already done by llama.cpp grammars

An example is in this repo.

prompt = "ReLLM, the best way to get structured data out of LLMs, is an acronym for "
pattern = regex.compile(r'Re[a-z]+ L[a-z]+ L[a-z]+ M[a-z]+')
output = complete_re(model=model, 
                     prompt=prompt,
                     pattern=pattern)

> Realized Logistic Logistics Model

Is such a parameter on the roadmap for transformers?

Motivation

This can be super useful to make model output parseable within architectures that process the output of an LLM using classical methods. E.g. it can be used to make a model generate valid JSON in every case.

Your contribution

Happy to build this with CFGs if it helps! 😄

The text was updated successfully, but these errors were encountered:

ArthurZucker · 2023-08-28T13:07:40Z

I think something like this is planned cc @gante 🤗

jvhoffbauer · 2023-08-28T18:34:51Z

@gante @ArthurZucker can I help with this somehow? Happy to set up a PR over the weekend!

gante · 2023-08-29T15:24:39Z

Hey @jvhoffbauer 👋

This feature seems very similar to Microsoft's guidance project, which is compatible with transformers.

Is there some use case that you see guidance not solving that this one would solve? :)

jvhoffbauer · 2023-08-31T16:52:30Z

Hey @gante

I think guidance is a very feature-rich framework to query LLMs. It, however, does not provide

support for context-free grammars, only regex (that is my understanding so far!)
beam search
a lightweight approach to perform inference that can potentially be embedded in training pipelines

Using transformers would be more convenient for my specific use case (generating markdown). Do you think that this justifies integrating it? I also would be curious if others need such a feature.

gante · 2023-08-31T19:37:12Z

@jvhoffbauer you're the first one requesting it :D

Since this requires non-trivial code (that we have to maintain in the future) and our bandwidth is quite limited at the moment, I'll do my usual pact: if this comment reaches 10 reactions, I'll greenlight its inclusion in transformers :) (Whoever does the 10th react, please tag me in a comment!)

That way, we know for sure that there is demand for the feature, and that our team's bandwidth is being put to the best use in favor of the community 🤗

jvhoffbauer · 2023-09-05T11:13:28Z

Makes sense!

jvhoffbauer · 2023-09-23T09:53:19Z

@gante It's even 11 now!

I am super happy to prepare a PR. Can you provide guidance on how to go about discussions on the interface and architecture? Should I just draft something out or is there a better way?

oobabooga · 2023-09-24T19:32:28Z

+1 for this. It would be very interesting to use BNF as a built-in LogitsProcessor in transformers.

LysandreJik · 2023-09-25T10:14:58Z

Thanks all for your interest! @gante, leading generation, is on leave for the coming few weeks, but we'll make sure to attend to this issue when he's back.

@jvhoffbauer, if you're motivated to open a PR with a draft of what you have in mind, please go ahead!

jvhoffbauer · 2023-09-25T10:32:48Z

Thanks all for your interest! @gante, leading generation, is on leave for the coming few weeks, but we'll make sure to attend to this issue when he's back.

@jvhoffbauer, if you're motivated to open a PR with a draft of what you have in mind, please go ahead!

Super cool! Yes, I will create a draft this week!

jvhoffbauer · 2023-11-17T14:38:04Z

I see that @Saibo-creator already created a draft in #27557 which is exactly what was discussed!

In addition to that, I am starting a research project in Uni working on syntax-error-free text generation which will explore applications of CFG-based text generation. Potentially describing further use-cases in that area in a community blog post might be interesting!

Saibo-creator · 2023-11-17T15:27:13Z

@jvhoffbauer Happy to see that you are also working on research project related to grammar-constrained decoding! I'm also working on a research project related to GCD, would you mind us having a zoom chat at some time? It may spark new ideas! :)
here is my email saibo.geng@epfl.ch

shermansiu · 2023-11-21T04:41:01Z

By the way, Microsoft's guidance repo has CFG decoding now, although it doesn't seem like you can easily define the CFG as a text file (i.e. not defining the grammar itself programmatically).

shermansiu · 2023-11-21T04:58:35Z

@jvhoffbauer @Saibo-creator: By the way, you might want to review Picard and Synchromesh, as they both use CFG decoding to improve the generation of code.

Saibo-creator · 2023-11-24T08:26:04Z

@shermansiu
Thanks for pointing out the two papers, yes I know both papers. They are important works in this technique

shawnz · 2024-02-09T21:36:29Z

While this is being worked on, you might also consider using https://github.com/r2d4/parserllm (thank @elo-siema for finding it)

AlbertMarashi · 2024-04-15T16:04:19Z

+1 on this, would really love to use it on hugging face models

Saibo-creator · 2024-04-18T19:56:52Z

Hello @AlbertMarashi, the transformers team mentioned they lack the capacity to support this feature, so I've transferred it here https://github.com/epfl-dlab/transformers-CFG

It's functioning quite effectively :)

jvhoffbauer linked a pull request Oct 1, 2023 that will close this issue

[WIP] Add Support for masking output using a Context-Free-Grammar #26520

Open

huggingface deleted a comment from github-actions bot Oct 27, 2023

Saibo-creator mentioned this issue Nov 17, 2023

Context Free Grammar Constrained Decoding (ebnf interface, compatible with llama-cpp) #27557

Open

19 tasks

huggingface deleted a comment from github-actions bot Dec 19, 2023

Saibo-creator mentioned this issue Dec 22, 2023

add context-free grammar constrained decoding(ebnf interface) into reserach project directory #28210

Closed

5 tasks

huggingface deleted a comment from github-actions bot Jan 15, 2024

ArthurZucker added the Feature request Request for a new feature label Jan 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for context-free-grammars (CFG) to constrain model output #25778

Support for context-free-grammars (CFG) to constrain model output #25778

jvhoffbauer commented Aug 26, 2023 •

edited

Loading

ArthurZucker commented Aug 28, 2023

jvhoffbauer commented Aug 28, 2023

gante commented Aug 29, 2023

jvhoffbauer commented Aug 31, 2023 •

edited

Loading

gante commented Aug 31, 2023

jvhoffbauer commented Sep 5, 2023

jvhoffbauer commented Sep 23, 2023

oobabooga commented Sep 24, 2023

LysandreJik commented Sep 25, 2023

jvhoffbauer commented Sep 25, 2023 •

edited

Loading

jvhoffbauer commented Nov 17, 2023

Saibo-creator commented Nov 17, 2023 •

edited

Loading

shermansiu commented Nov 21, 2023

shermansiu commented Nov 21, 2023

Saibo-creator commented Nov 24, 2023

shawnz commented Feb 9, 2024 •

edited

Loading

AlbertMarashi commented Apr 15, 2024

Saibo-creator commented Apr 18, 2024

Support for context-free-grammars (CFG) to constrain model output #25778

Support for context-free-grammars (CFG) to constrain model output #25778

Comments

jvhoffbauer commented Aug 26, 2023 • edited Loading

Feature request

Motivation

Your contribution

ArthurZucker commented Aug 28, 2023

jvhoffbauer commented Aug 28, 2023

gante commented Aug 29, 2023

jvhoffbauer commented Aug 31, 2023 • edited Loading

gante commented Aug 31, 2023

jvhoffbauer commented Sep 5, 2023

jvhoffbauer commented Sep 23, 2023

oobabooga commented Sep 24, 2023

LysandreJik commented Sep 25, 2023

jvhoffbauer commented Sep 25, 2023 • edited Loading

jvhoffbauer commented Nov 17, 2023

Saibo-creator commented Nov 17, 2023 • edited Loading

shermansiu commented Nov 21, 2023

shermansiu commented Nov 21, 2023

Saibo-creator commented Nov 24, 2023

shawnz commented Feb 9, 2024 • edited Loading

AlbertMarashi commented Apr 15, 2024

Saibo-creator commented Apr 18, 2024

jvhoffbauer commented Aug 26, 2023 •

edited

Loading

jvhoffbauer commented Aug 31, 2023 •

edited

Loading

jvhoffbauer commented Sep 25, 2023 •

edited

Loading

Saibo-creator commented Nov 17, 2023 •

edited

Loading

shawnz commented Feb 9, 2024 •

edited

Loading