Clarify string escaping #160

ndmitchell · 2021-02-05T12:53:15Z

The spec defines string escapes like \a as \x07 - but doesn't say if \x07 is a legitimate escape. The octal escapes list \119 as \t9, but don't say what rules are used to determined it's a single character escape. There is no notation for specifying Unicode values. Should we just copy the spec from https://python-reference.readthedocs.io/en/latest/docs/str/escapes.html?

The text was updated successfully, but these errors were encountered:

alandonovan · 2021-02-05T14:45:42Z

See #112, which I think covers this issue. (I've been hoping to fix #112 today, though time keeps slipping away.) In short: text strings will permit \xXX escapes for values in the range 0-127, and byte strings will permit them for values in the range 0-255.

Unicode code points will be denoted as \uXXXX or \UXXXXXXXX, which may appear in text or byte strings. In a text string, the escape denotes the UTF-k encoding of the 16- or 32-bit Unicode code point. In a byte string, the escape denotes its UTF-8 encoding. Text and byte string literals may also contain unescaped non-ASCII code points, such as "Ω" or b"Ω" in the source file, which is assumed to be encoded as UTF-8. (Bazel has a bug in which its source files are currently assumed to be Latin1, so we may need to temporarily disallow non-ASCII in literals in Bazel, to avoid confusion.)

Summary: Our escape characters are not implemented as per the Starlark spec, and the Starlark spec isn't very complete. I've raised bazelbuild/starlark#160 to get more details in the spec, but the temptation is to just follow the Python character escaping spec. Reviewed By: bobyangyf Differential Revision: D26276328 fbshipit-source-id: c50a2a677707257a9b481a45a65343cd109cf715

brandjon · 2025-01-08T20:40:27Z

The current spec lists all valid escapes and says escapes not listed are an error. #112 should cover the need for unicode escapes. Closing.

brandjon closed this as completed Jan 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify string escaping #160

Clarify string escaping #160

ndmitchell commented Feb 5, 2021

alandonovan commented Feb 5, 2021 •

edited

Loading

brandjon commented Jan 8, 2025

Clarify string escaping #160

Clarify string escaping #160

Comments

ndmitchell commented Feb 5, 2021

alandonovan commented Feb 5, 2021 • edited Loading

brandjon commented Jan 8, 2025

alandonovan commented Feb 5, 2021 •

edited

Loading