Support line filter / diff input #1931

xmo-odoo · 2023-01-17T10:29:01Z

This is probably a use case which is more relevant for large (and old) codebases, when trying to "ratchet" rules progressively: in CI, it's useful to only check / fail on failures of the code which has been modified, as it allows progressively improving code-state without needing massive one-shot rewrites which can make history traversal and blames a lot more complicated.

This can be done by post-filtering finds emitted by ruff, but it would be useful as a pre / builtin feature, as it would avoid having to provide both input (--include/--exclude) and output (lines) filters.

The text was updated successfully, but these errors were encountered:

not-my-profile · 2023-01-17T10:41:22Z

I think it would make more sense to have a dedicated tool for that, which supports one of the output formats of ruff so then you could just pipe the output of ruff to that filter tool e.g.

ruff . --format=junit | junit_filter_git

I see no reason why this should be implemented in ruff directly.

xmo-odoo · 2023-01-17T10:57:09Z

I see no reason why this should be implemented in ruff directly.

This objection is literally covered by the second paragraph.

not-my-profile · 2023-01-17T11:24:09Z

Ruff is fast enough that you could just run it on the whole codebase and have the filter program filter out violations for all other files. If you care about wasted CPU cycles you could just invoke ruff like:

ruff  --format=junit -- ${{needs.changedfiles.outputs.all}} | junit_filter_git

where ${{needs.changedfiles.outputs.all}} would be some variable listing all the changed files, see for example this explanation for GitHub.

Something like this could very well be provided by a GitHub action so you wouldn't even have to configure that yourself. So yes I still see no reason why this should be implemented in ruff directly.

charliermarsh · 2023-01-17T12:29:54Z

Thanks for filing :) I see the value in what you're describing, and -- since it's come up here -- in general, I'm open to implementing things directly in Ruff that could be accomplished by chaining together other tools. Not always, since everything we implement comes with some upfront and ongoing cost, but on a case-by-case basis. (One way I think about this: Black has Jupyter support built-in, even though the same behavior can be accomplished with nbQA, and that's useful.)

I think in this case, I'm unlikely to prioritize it if there's a reasonable workaround so I'd love to better understand the use-case. Are you imaging that this would only run over changed files? Or over changed lines? How do you think Ruff should determine the changed files?

xmo-odoo · 2023-01-17T12:44:34Z

Not always, since everything we implement comes with some upfront and ongoing cost, but on a case-by-case basis. [...] I think in this case, I'm unlikely to prioritize it if there's a reasonable workaround

Certainly fair enough.

so I'd love to better understand the use-case. Are you imaging that this would only run over changed files? Or over changed lines?

I'd assume ruff would need to run over the entire file in order to correctly analyze the contents. Especially as the modified lines might not even provide a valid AST subtree. This'd be more of a "one-stop" shop to filter (include/exclude) and output (lines filter).

How do you think Ruff should determine the changed files?

I know that some tools can take a unified diff as input and extract the filtering from that, but I don't know if there's a ready-made crate for this, so as described above an alternative could be to extend the include/exclude arguments to allow file subranges.

charliermarsh · 2023-01-17T12:45:56Z

I'd assume ruff would need to run over the entire file in order to correctly analyze the contents. Especially as the modified lines might not even provide a valid AST subtree. This'd be more of a "one-stop" shop to filter (include/exclude) and output (lines filter).

Ah sorry, what I meant to ask was: would you expect to see errors reported only from changed files (more errors), or from changed files (stricter)?

xmo-odoo · 2023-01-17T13:40:59Z

Ah sorry, what I meant to ask was: would you expect to see errors reported only from changed files (more errors), or from changed files (stricter)?

Assuming the second occurrence of files should be line, then that.

And yes I do realise that a change in one line can cause an error in an other (which was not touched), but I think getting this perfect (or as close to as possible) would be complicated as it would be necessary to start from a snapshot of the full output, take a new full output (or at least file-wise in both cases since I assume ruff currently mostly operate on a per-file basis much as flake does), then use the diff's information to adjust all the messages in order to match pre- and post- content to determine whether an error is truly novel or is just a pre-existing error moved a few lines because code was inserted somewhere above it.

charliermarsh · 2023-01-17T14:22:55Z

Uhh, yes, second occurrence should be line! Sorry about that.

Ok noted, let's keep this open! (There's also #1149 which accomplishes the same thing IIUC.)

xmo-odoo · 2023-01-17T14:30:05Z

Uhh, yes, second occurrence should be line! Sorry about that.

No worries.

Ok noted, let's keep this open! (There's also #1149 which accomplishes the same thing IIUC.)

Oh I missed it sorry, didn't know about the term "baseline" (and almost certainly made my search using keywords like "diff", "ci", "git", ...).

If that's an option / consideration then it would be a lot better than this here proposal (which is a lot more simplistic).

charliermarsh · 2023-01-17T15:40:04Z

Oh I missed it sorry, didn't know about the term "baseline" (and almost certainly made my search using keywords like "diff", "ci", "git", ...).

Me neither, before that Issue :)

If that's an option / consideration then it would be a lot better than this here proposal (which is a lot more simplistic).

It is! I'll close this for now in favor of that issue then.

charliermarsh added question Asking for support or clarification core Related to core functionality labels Jan 17, 2023

charliermarsh closed this as completed Jan 17, 2023

fschulze mentioned this issue Jan 26, 2023

[question] only report lines from diff #2189

Closed

charliermarsh mentioned this issue Feb 2, 2023

[Question] is it possible to run ruff only on edited lines of a branch/commit? #2472

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support line filter / diff input #1931

Support line filter / diff input #1931

xmo-odoo commented Jan 17, 2023

not-my-profile commented Jan 17, 2023

xmo-odoo commented Jan 17, 2023

not-my-profile commented Jan 17, 2023 •

edited

Loading

charliermarsh commented Jan 17, 2023

xmo-odoo commented Jan 17, 2023

charliermarsh commented Jan 17, 2023

xmo-odoo commented Jan 17, 2023

charliermarsh commented Jan 17, 2023

xmo-odoo commented Jan 17, 2023

charliermarsh commented Jan 17, 2023

Support line filter / diff input #1931

Support line filter / diff input #1931

Comments

xmo-odoo commented Jan 17, 2023

not-my-profile commented Jan 17, 2023

xmo-odoo commented Jan 17, 2023

not-my-profile commented Jan 17, 2023 • edited Loading

charliermarsh commented Jan 17, 2023

xmo-odoo commented Jan 17, 2023

charliermarsh commented Jan 17, 2023

xmo-odoo commented Jan 17, 2023

charliermarsh commented Jan 17, 2023

xmo-odoo commented Jan 17, 2023

charliermarsh commented Jan 17, 2023

not-my-profile commented Jan 17, 2023 •

edited

Loading