llama: preserve field order in user-defined JSON schemas #8002

bmizerany · 2024-12-09T01:14:51Z

llama: preserve field order in user-defined JSON schemas

Previously we decoded and re-encoded JSON schemas during validation,
which served no purpose since json.RawMessage already validates JSON
syntax. Worse, the re-encoding lost field ordering from the original
schema, which affects inference quality during step-by-step reasoning.

While fixing this ordering issue by using json.RawMessage directly,
testing revealed that schema_to_grammar (from llama.cpp) also fails to
preserve field order during grammar generation. This appears to be the
root cause of inference degradation.

This change prevents us from mangling the user's original schema order,
but we still need to address the ordering issue in schema_to_grammar.
That will be a separate change.

Updates #7978

iscy · 2024-12-09T15:15:07Z

@bmizerany, llama.cpp's json-schema-to-grammar already ensures that the order is being preserved by using nlohmann::ordered_json. These PRs were opened in the past and kept the original key ordering while still relying on that facility:
#7588
#6658

bmizerany · 2024-12-11T21:06:05Z

Skipping schema_to_grammer tests in prep for @ParthSareen's work on patching. Ready to merge.

bmizerany · 2024-12-11T21:07:43Z

@iscy Thank you for pointing that out. This gets us closer. More updates incoming for a complete fix.

Previously we decoded and re-encoded JSON schemas during validation, which served no purpose since json.RawMessage already validates JSON syntax. Worse, the re-encoding lost field ordering from the original schema, which affects inference quality during step-by-step reasoning. While fixing this ordering issue by using json.RawMessage directly, testing revealed that schema_to_grammar (from llama.cpp) also fails to preserve field order during grammar generation. This appears to be the root cause of inference degradation. This change prevents us from mangling the user's original schema order, but we still need to address the ordering issue in schema_to_grammar. That will be a separate change. Updates #7978

llama/grammar_test.go

llm/server.go

The change in #8002 introduced a regression where the server rejected request with the format was set and empty. This was a regression from the previous version of the code it was solving other problems for. Then, in #8127, the server was updated to allow the empty format, but also reintroduced the regression where the server would silently fail when the format was set, but invalid. This commit fixes both regressions. The server does not reject the empty format, but it does reject invalid formats. It also adds tests to help us catch regressions in the future. Also, the updated code provides a more detailed error message when a client sends a non-empty, but invalid format, echoing the invalid format in the response.

Changes in #8002 introduced fixes for bugs with mangling JSON Schemas. It also fixed a bug where the server would silently fail when clients requested invalid formats. It also, unfortunately, introduced a bug where the server would reject requests with an empty format, which should be allowed. The change in #8127 updated the code to allow the empty format, but also reintroduced the regression where the server would silently fail when the format was set, but invalid. This commit fixes both regressions. The server does not reject the empty format, but it does reject invalid formats. It also adds tests to help us catch regressions in the future. Also, the updated code provides a more detailed error message when a client sends a non-empty, but invalid format, echoing the invalid format in the response.

Changes in #8002 introduced fixes for bugs with mangling JSON Schemas. It also fixed a bug where the server would silently fail when clients requested invalid formats. It also, unfortunately, introduced a bug where the server would reject requests with an empty format, which should be allowed. The change in #8127 updated the code to allow the empty format, but also reintroduced the regression where the server would silently fail when the format was set, but invalid. This commit fixes both regressions. The server does not reject the empty format, but it does reject invalid formats. It also adds tests to help us catch regressions in the future. Also, the updated code provides a more detailed error message when a client sends a non-empty, but invalid format, echoing the invalid format in the response. This commits also takes the opportunity to remove superfluous linter checks.

bmizerany force-pushed the bmizerany/issue7978 branch 3 times, most recently from 0d216f7 to 4b4fe47 Compare December 9, 2024 01:25

bmizerany force-pushed the bmizerany/issue7978 branch from 4b4fe47 to d3ca863 Compare December 11, 2024 21:05

bmizerany changed the title ~~DO NOT MERGE: llama: preserve field order in user-defined JSON schemas~~ llama: preserve field order in user-defined JSON schemas Dec 11, 2024

bmizerany force-pushed the bmizerany/issue7978 branch from d3ca863 to 17baf5f Compare December 11, 2024 21:08

ParthSareen approved these changes Dec 11, 2024

View reviewed changes

llama/grammar_test.go Show resolved Hide resolved

llama/grammar_test.go Show resolved Hide resolved

llm/server.go Show resolved Hide resolved

bmizerany merged commit 9039c82 into main Dec 11, 2024
16 checks passed

bmizerany deleted the bmizerany/issue7978 branch December 11, 2024 22:07

iscy mentioned this pull request Dec 12, 2024

llama: parse JSON schema using nlohmann::ordered_json #8071

Merged

bmizerany mentioned this pull request Dec 17, 2024

llm: do not silently fail for supplied, but invalid formats #8130

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama: preserve field order in user-defined JSON schemas #8002

llama: preserve field order in user-defined JSON schemas #8002

bmizerany commented Dec 9, 2024 •

edited

Loading

iscy commented Dec 9, 2024

bmizerany commented Dec 11, 2024 •

edited

Loading

bmizerany commented Dec 11, 2024

llama: preserve field order in user-defined JSON schemas #8002

llama: preserve field order in user-defined JSON schemas #8002

Conversation

bmizerany commented Dec 9, 2024 • edited Loading

iscy commented Dec 9, 2024

bmizerany commented Dec 11, 2024 • edited Loading

bmizerany commented Dec 11, 2024

bmizerany commented Dec 9, 2024 •

edited

Loading

bmizerany commented Dec 11, 2024 •

edited

Loading