Improve escaping in unittest failure message #320

sfreilich · 2021-09-29T23:00:50Z

daf5137 fixed a bug where the failure message would not be printed
correctly if it included values subject to shell variable expansion
(e.g. "$FOO") by quoting the limit string for the heredoc with the message
("cat <<'EOF'"). However, it still left an issue if that limit string
appeared on its own line in a message, which would happen if a compared
value included "\nEOF\n". (Plausible if the function under test
generated shell script code.) This could be worked around by choosing
a more unusual string, but that remains an odd implementation detail.
Instead, the shell implementation could use the same path of splitting
the output into lines, echoing each line, and backslash-escaping each
character.

The previous code also did not split the error output into lines
correctly if individual failure messages contained newlines (which
could happen when comparing values where at least one was a string
containing a newline; the code that generates the error message
converts with "%s" (str)). This matters if those lines are joined
with something other than "\n", which was the case in the Windows
implementation (and now is the case with both).

The Windows implementation also did not avoid variable expansion
by escaping "%" as "%%" if the error message included "%".

daf5137 fixed a bug where the failure message would not be printed correctly if it included values subject to shell variable expansion (e.g. "$FOO") by quoting the limit string for the heredoc the message ("cat <<'EOF'"). However, it still left an issue if that limit string appeared on its own line in a message, which would happen if a compared value included "\nEOF\n". (Plausible if the function under test generated shell script code.) This could be worked around by choosing a more unusual string, but that remains an odd implementation detail. Instead, the shell implementation could use the same path of splitting the output into lines, echoing each line, and backslash-escaping each character. The previous code also did not split the error output into lines correctly if individual failure messages contained newlines (which could happen when comparing values where at least one was a string containing a newline; the code that generates the error message converts with "%s" (str)). This matters if those lines are joined with something other than "\n", which was the case in the Windows implementation (and now is the case with both). The Windows implementation also did not avoid variable expansion by escaping "%" as "%%" if the error message included "%".

Currently, the failure message does not output correctly if it contains sequences subject to shell variable expansion (e.g. `$1`) or the limit string used to denote the end of the message (in this case `\nEOF\n`). Those are quite plausible edge-cases, since analysistest targets might assert about the command-line of actions generated with `ctx.actions.run_shell` or the contents of shell files generated with `ctx.actions.write`. This uses the approach of echoing by line and escaping characters as necessary for both versions. In the sh case, we can baskslash-escape every character. For Windows, `%` is escaped as `%%`. (Other special characters don't seem to cause trouble, though I'm probably missing relevant cases.) The heredoc style `cat<<EOF\n[...]\nEOF\n` can have shell-escaping disabled by quoting the limit-string (`cat<<'EOF'`) or preceding it with a backslash (`cat<<\EOF`). But that still relies on choosing a limit string that doesn't occur in the contents, and there's no way (whether or not shell expansion is done) of escaping the limit string if it does. While one could choose a more obscure limit string, that would add another hidden implementation detail. Alternately, this could add logic to choose a limit string that doesn't occur in the message, but that seems unnecessarily complex. Another approach would be to write the message to a file, then read it with cat/type. However, that requires either splitting the change between Starlark and Java code with separate release cycles (awkward), or doing something to insert the message file in the test's runfiles at the last minute (more complicated). Adds some new test-cases which fail without this change. Removes an unused return value to make the usage of this function a bit clearer. bazelbuild/bazel-skylib#320 for the corresponding change for unittest. RELNOTES: None. PiperOrigin-RevId: 400309738

tetromino

Thanks for the fix! LGTM modulo lack of docs.

lib/unittest.bzl

tetromino

Thanks! I've regenerated the markdown docs with the new docstring.

alexeagle · 2021-10-09T16:16:07Z

Hey @tetromino I've noticed the same thing, that you have to notice when a PR changes something which causes docs to be out-of-date. I bet there's already skew between docs and the docstrings.

I think we should just have a diff_test that always asserts docs are up-to-date, and prints the command to run to update them, so contributors naturally keep the two in sync.

I'll send a PR

it prints a convenient 'bazel run' command to update them, replacing the shell script Follow-up to bazelbuild#320 (comment)

alexeagle · 2021-10-09T16:21:37Z

Oh, sorry I forgot we already discussed that PR in #297

sfreilich requested review from brandjon and tetromino as code owners September 29, 2021 23:00

google-cla bot added the cla: yes label Sep 29, 2021

tetromino reviewed Oct 4, 2021

View reviewed changes

lib/unittest.bzl Outdated Show resolved Hide resolved

sfreilich and others added 2 commits October 4, 2021 11:45

Add doc for unittest_toolchain attrs

e6f188c

Update generated docs

929c20d

tetromino approved these changes Oct 4, 2021

View reviewed changes

tetromino merged commit 506c172 into bazelbuild:main Oct 4, 2021

alexeagle added a commit to alexeagle/bazel-skylib that referenced this pull request Oct 9, 2021

Add diff_test asserting that docs are up-to-date

cd1dfb9

it prints a convenient 'bazel run' command to update them, replacing the shell script Follow-up to bazelbuild#320 (comment)

alexeagle added a commit to alexeagle/bazel-skylib that referenced this pull request Oct 9, 2021

Add diff_test asserting that docs are up-to-date

047e374

it prints a convenient 'bazel run' command to update them, replacing the shell script Follow-up to bazelbuild#320 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve escaping in unittest failure message #320

Improve escaping in unittest failure message #320

sfreilich commented Sep 29, 2021

tetromino left a comment

tetromino left a comment

alexeagle commented Oct 9, 2021

alexeagle commented Oct 9, 2021

Improve escaping in unittest failure message #320

Improve escaping in unittest failure message #320

Conversation

sfreilich commented Sep 29, 2021

tetromino left a comment

Choose a reason for hiding this comment

tetromino left a comment

Choose a reason for hiding this comment

alexeagle commented Oct 9, 2021

alexeagle commented Oct 9, 2021