Add "modes" to compiletest, for running all tests with NLL enabled and comparing with master #48879

pnkfelix · 2018-03-09T13:52:31Z

The NLL team has been trying to find a good workflow for evaluating the discrepancies between AST borrowck and NLL-based borrowck.

Past workflows have developed and used things like -Z borrowck=compare and the https://github.com/pnkfelix/nll-probe tool, but here I'm going to try to avoid a deep dive into how those have failed to satisfy our needs.

Our goal here: We want a way to track the current state of NLL, including the cases where there are known deviations, in a manner that allows immediate analysis of that state to determine things like:

"how many test cases does NLL deviate from AST borrowck",
"what do the deviations look like", or
"does this deviation represent a bug that needs to be fix? Or is it an improvement on the AST borrowck?"

Here's a new proposal for a (hopefully) relatively small change to compiletest that should yield an easier workflow to answer questions like those above, at least for the ui/ subset of our tests.

Context: The ui/ tests are set up so that each test consists of an input $file.rs, and a set of expected outputs in $file.stderr (compilation error messages) and $file.stdout (non erroneous compiler output; rarely used). (For more info, see this chapter in the rustc-guide.)
Add a "mode" argument to compiletest, encoded either as a command line parameter or as an environment variable, or both. (The first mode we'll support will be "nll", which tells compiletest to pass -Z nll in addition to any other flags when invoking rustc, at least for the ui/ tests).
When running under a particular mode, if there is $file.$mode.stderr file, then this file will be used as the source of "acceptable output". If there is no such file, then compiletest will fallback to the regular filename $file.stderr.
One complication: UI tests also include "inline" comments of the form //~ ERROR that indicate what error is expected on which line (these are mildly redundant with the stderr files above). Because messages may differ in the mode M, but we don't want to edit the sources too much, compiletest should just ignore the //~ ERROR annotations when running in a particular mode. We will still see the errors that occur from the stderr output, it's just less convenient.
To ensure that we are tracking discrepancies somewhere, whenever there is a $file.$mode.stderr, then some tool (probably compiletest, but maybe tidy?) will be responsible to checking that $file.rs somewhere contains a comment that explains the source of the discrepancy. This could be a specially formatted FIXME, if the new behavior seems worse than before:

// $mode FIXME(#123) -- summary of discrepancy caused by NLL bug with corresponding issue num

or a "YAYME" comment for when the new behavior is an improvement:

// $mode YAYME(#123) -- summary of a beneficial discrepancy

(presumably the ticket linked by YAYME would just be the tracking issue for NLL or whatever other mode is being tested).

Benefits of the proposed system:

To find the cases that have discrepancies for nll, one can use ls *.nll.stderr
To find what a given discrepancy looks like, one can use diff onetest.stderr onetest.nll.stderr
To see if a discrepancy is a bug or not, grep for FIXME or YAYME in the .rs files.
- This workflow is perhaps not ideal; @nikomatsakis has pointed out that it might be nicer if these comments somehow lived in the *.$mode.stderr files.

Open Questions (to be resolved by implementor)

What should compiletest do about occurrences of //~ ERROR in the source text? In particular, should it check that the error output still has those cases, even when running under a given mode?
- The current inclination of @pnkfelix and @nikomatsakis is that it is actually okay for compiletest to ignore the //~ ERROR annotations when running under a given mode. The reasoning here is this: the //~ ERROR annotations will already get checked by compiletest runs that don't have a mode. We probably don't want to force an error when there's a discrepancy when running under a given mode; any discrepancies, including any of those errors disappearing, should be accounted for in the linked // $mode FIXME/YAYME issue, and we want to allow them to disappear or differ.

The text was updated successfully, but these errors were encountered:

nikomatsakis · 2018-03-09T14:17:48Z

I'm marking this as belonging to both @rust-lang/wg-compiler-nll and @rust-lang/wg-traits -- this is because we are both going to need this tool! In general, having a mode like this will be useful anytime that we plan to make major alterations to some core component of the system.

nikomatsakis · 2018-03-09T14:34:53Z

Mentoring instructions

I'm going to assume we will use a command line option to drive this new mode. Let's call "compare-mode". Then we can run the tests with ./x.py test foo --test-args --compare-mode nll or whatever. The compile test harness as a configuration that is loaded here:

rust/src/tools/compiletest/src/main.rs

Line 58 in fedce67

let config = parse_config(env::args().collect());

So we have to extend parse_config with some new option --compare-mode or whatever that takes a string.

rust/src/tools/compiletest/src/main.rs

Line 68 in fedce67

pub fn parse_config(args: Vec<String>) -> Config {

That comparison mode will be translated to some arguments to rustc. Those arguments are arranged in this function:

rust/src/tools/compiletest/src/runtest.rs

Line 1584 in fedce67

    
           fn make_compile_args(&self, input_file: &Path, output_file: TargetLocation) -> Command {

Probably it's best we allow a mapping between the comparison mode and the arguments (that mapping can just be hardcoded into this function though). For example, NLL at least currently really wants three flags added:

match self.config.comparison_mode {
    Some(ComparisonMode::NLL) => {
        rustc.args(&["-Znll", "-Zborrowck=mir", "-Ztwo-phase-borrows"]);
    }
    None => { }
}

This is where the code currently finds the expected files. We'll want to alter these functions to take the comparison mode into account. That's probably a new option to be added, along with revision.

rust/src/tools/compiletest/src/runtest.rs

Lines 2504 to 2508 in fedce67

    
           let expected_stderr_path = self.expected_output_path(UI_STDERR); 
        
           let expected_stderr = self.load_expected_output(&expected_stderr_path); 
        
           let expected_stdout_path = self.expected_output_path(UI_STDOUT); 
        
           let expected_stdout = self.load_expected_output(&expected_stdout_path);

Right now, if the revision is Some, then we look for $test.$rev.stderr, I think we want to look for $test.$rev.$mode.stderr in that case, so that we can always do ls *.nll.stderr to get all things with a given mode. (Or maybe we should make it .mode_$mode.stderr or something more distinctive?)

Anyway, we'll have to alter this to search for the expected output with the given mode and -- if not found -- to then try with a mode of None.

Finally, to skip checking for //~ ERROR comments, if a comparison mode is set, we just don't do any of this stuff:

rust/src/tools/compiletest/src/runtest.rs

Lines 2548 to 2554 in fedce67

    
           if !expected_errors.is_empty() || !proc_res.status.success() { 
        
               // "// error-pattern" comments 
        
               self.check_expected_errors(expected_errors, &proc_res); 
        
           } else if !self.props.error_patterns.is_empty() || !proc_res.status.success() { 
        
               // "//~ERROR comments" 
        
               self.check_error_patterns(&proc_res.stderr, &proc_res); 
        
           }

Uuuuh at that point we are basically done with the compiletest changes, right?

Then the last step is to alter tidy to enforce the FIXME convention, but I don't know much about that. We'll figure that out when we get there.

kennytm · 2018-03-09T14:40:24Z

In compiletest there is already the concept of "revisions" which y'all are already using in compile-fail tests. I think it's better we extend this to support UI tests than layering an NLL-specific "modes" on top of it.

Edit: Disregard, that was #48878.

pnkfelix · 2018-03-09T14:44:05Z

@kennytm this is different interface from revisions. Revisions are encoded with flags on a test by test basis. A mode is given more privileged status within compiletest. We explicitly do not want to add a revision for every test to represent running with and without NLL.

(There's some chance that code might be shared within compiletest between the support for revisions and that for modes, but in terms of UI, the interface is different here.)

nikomatsakis · 2018-03-09T14:51:51Z

Also, @kennytm, I opened this issue regarding extending revisions to UI tests (#48878). That said, I think they already work, and we're just not using them. I've got to do some local tests.

nikomatsakis · 2018-03-09T14:52:43Z

I think though we will eventually want to specify the intersection of the "comparison mode" with a revision. There are a lot of tests that are already encoded with lexical and nll modes, for example -- in those cases, if we have a revision named nll and we are in comparison mode nll, it'd be sort of nice to just ignore the test, or do something smart. But I guess we can leave that for later.

memoryleak47 · 2018-03-22T16:16:28Z

I'd like to try this one. :)

nikomatsakis · 2018-03-22T16:28:55Z

@memoryleak47 that would be super duper. @pnkfelix and I were just saying that this remains pretty high priority. Let me know if I can help in any way. (Of course there are already mentoring instructions.)

@nikomatsakis

… r=pnkfelix Add compiletest `--compare-mode nll` option Before implementing the tidy stuff, I'd appreciate if someone reviews the changes so far. This is my first non-trivial pull request, so I could really use some feedback. :) closes #48879. r? @nikomatsakis

pnkfelix · 2018-04-09T14:49:52Z

I'm going to actually reopen this issue as a way to track when we've gotten this compare-mode into a state where we actually have files checked in that we are referencing against.

(I just discovered that tidy complains about a augemented-assignments.nll.stderr file that I had lying around, so we clearly cannot yet reach that point without some more work.)

It might be good to even keep this issue open until we have bors set up to be running compare-more=nll against the ui tests on some dedicated target like linux.

pnkfelix · 2018-04-10T14:19:49Z

Tidy fix is #49844

…tsakis Blindly checkpoint status of NLL mode ui tests This takes the next (and potentially final?) step with #48879. Namely, this PR got things to the point where I can successfully run `compiletest` on `src/test/ui` with `--compile-mode=nll`. Here are the main pieces of it: 1. To figure out how to even run `compiletest` normally on the ui directory, I ran `x.py test -vv`, and then looked for the `compiletest` invocation that mentioned `src/test/ui`. 2. I took the aforementioned `compiletest` invocation and used it, adding `--compile-mode=nll` to the end. It had 170 failing cases. 3. Due to #49855, I had to edit some of the tests so that they fail even under NLL, via `#[rustc_error]`. That's the first commit. (Then goto 2 to double-check no such tests remain.) 4. I took the generated `build/target/test/foo.stderr` file for every case that failed, and blindly copied it to `src/test/foo.nll.stderr`. That's the second commit. 5. Goto 2 until there were no failing cases. 6. Remove any stamp files, and re-run `x.py test` to make sure that the edits and new `.nll.stderr` files haven't broken the pre-existing test suite.

pnkfelix · 2018-04-12T10:40:40Z

Once #49900 lands, this issue will be almost completely resolved; I think the only task I could imagine beyond that would be to expand #49900 (and also compare-mode?) to work on all the tests, not just the ones in src/test/ui.

…tsakis Add src/test/ui regression testing for NLL This PR changes `x.py test` so that when you are running the `ui` test suite, it will also always run `compiletest` in the new `--compare-mode=nll`, which just double-checks that when running under the experimental NLL mode, the output matches the `<source-name>.nll.stderr` file, if present. In order to reduce the chance of a developer revolt in response to this change, this PR also includes some changes to make the `--compare-mode=nll` more user-friendly: 1. It now generates nll-specific .stamp files, and uses them (so that repeated runs can reuse previously cached results). 2. Each line of terminal output distinguishes whether we are running under `--compare-mode=nll` by printing with the prefix `[ui (nll)]` instead of just the prefix `[ui]`. Subtask of #48879

nikomatsakis · 2018-05-29T13:12:22Z

I think we can call this done for now until we have a concrete extension in mind.

pnkfelix · 2018-05-29T13:16:02Z

namely @nikomatsakis and @pnkfelix agreed that any remaining work here is best spent on resolving #44844 (plus porting the run-pass/ tests to ui/, which should probably be filed as another issue).

nikomatsakis · 2018-05-29T13:16:17Z

In particular closed in favor of #44844

nikomatsakis changed the title ~~Add rustc NLL mode to compiletest, for driving comparisons between borrowck modes~~ Add "modes" to compiletest, for running all tests with NLL enabled and comparing with master Mar 9, 2018

pnkfelix added the E-mentor Call for participation: This issue has a mentor. Use #t-compiler/help on Zulip for discussion. label Mar 9, 2018

cuviper added the C-enhancement Category: An issue proposing an enhancement or a PR with one. label Mar 10, 2018

nikomatsakis added this to the NLL: diagnostic parity milestone Mar 14, 2018

nikomatsakis added the NLL-diagnostics Working towards the "diagnostic parity" goal label Mar 14, 2018

memoryleak47 mentioned this issue Mar 23, 2018

Add compiletest --compare-mode nll option #49293

Merged

bors closed this as completed in #49293 Apr 6, 2018

pnkfelix reopened this Apr 9, 2018

pnkfelix mentioned this issue Apr 12, 2018

Add src/test/ui regression testing for NLL #49900

Merged

nikomatsakis closed this as completed May 29, 2018

pnkfelix mentioned this issue Aug 2, 2018

Need compare-mode=2018 (or similar) to run test suite under multiple editions #52979

Closed

jieyouxu added A-compiletest Area: The compiletest test runner A-compiletest-compare-modes Area: compiletest compare-modes labels Feb 3, 2025

jieyouxu mentioned this issue Feb 3, 2025

compiletest: compare-mode false negatives w.r.t. long type file path #136510

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "modes" to compiletest, for running all tests with NLL enabled and comparing with master #48879

Add "modes" to compiletest, for running all tests with NLL enabled and comparing with master #48879

pnkfelix commented Mar 9, 2018 •

edited

Loading

nikomatsakis commented Mar 9, 2018

nikomatsakis commented Mar 9, 2018

kennytm commented Mar 9, 2018 •

edited

Loading

pnkfelix commented Mar 9, 2018

nikomatsakis commented Mar 9, 2018

nikomatsakis commented Mar 9, 2018

memoryleak47 commented Mar 22, 2018

nikomatsakis commented Mar 22, 2018

pnkfelix commented Apr 9, 2018

pnkfelix commented Apr 10, 2018

pnkfelix commented Apr 12, 2018

nikomatsakis commented May 29, 2018

pnkfelix commented May 29, 2018

nikomatsakis commented May 29, 2018

Add "modes" to compiletest, for running all tests with NLL enabled and comparing with master #48879

Add "modes" to compiletest, for running all tests with NLL enabled and comparing with master #48879

Comments

pnkfelix commented Mar 9, 2018 • edited Loading

Open Questions (to be resolved by implementor)

nikomatsakis commented Mar 9, 2018

nikomatsakis commented Mar 9, 2018

Mentoring instructions

kennytm commented Mar 9, 2018 • edited Loading

pnkfelix commented Mar 9, 2018

nikomatsakis commented Mar 9, 2018

nikomatsakis commented Mar 9, 2018

memoryleak47 commented Mar 22, 2018

nikomatsakis commented Mar 22, 2018

pnkfelix commented Apr 9, 2018

pnkfelix commented Apr 10, 2018

pnkfelix commented Apr 12, 2018

nikomatsakis commented May 29, 2018

pnkfelix commented May 29, 2018

nikomatsakis commented May 29, 2018

pnkfelix commented Mar 9, 2018 •

edited

Loading

kennytm commented Mar 9, 2018 •

edited

Loading