New: Processor improvements #3

nzakas · 2018-11-20T17:34:30Z

Summary

This proposal provides a way to explicitly define which processor(s) to use for different files inside of configuration. It also allows the chaining of multiple processors to fully process a file.

Related Issues

Better support for multiple processors eslint#11035

platinumazure

One typo, and raised a few questions.

designs/2018-processors-improvements/README.md

mysticatea · 2018-11-20T19:03:18Z

Thank you for this proposal :)

Sounds nice to me.
I think that plugins for other kinds than JavaScript may not work without processors. For example, eslint-plugin-vue uses the postprocess API to provide -like directive comment functionality. If the processor is missing, the users will see cryptic error messages. But plugins can contain configuration presets, so it will be no problem even if the end-users need additional setting.

But I have a concern.

how do users configure processors to extract a specific kind of blocks?

Some kind of files can contain various kinds of code blocks. such as markdown.
eslint-plugin-markdown wants to extract only JS blocks. But in the original issue, users want to configure to extract other blocks than JS. This proposal doesn't look to include the way to configure it.

For example:

// .eslintrc.js
module.exports = {
    overrides: [
        {
            files: "*.md",
            processors: ["markdown/markdown"], // want markdown processor to extract JS blocks
        },
        {
            files: "*.md",
            processors: ["markdown/markdown"], // want markdown processor to extract TS blocks
            parser: "typescript-eslint-parser", 
        },
        {
            files: "*.md",
            processors: ["markdown/markdown", "html/html"], // want markdown processor to extract HTML blocks
        },
        {
            files: "*.md",
            processors: ["markdown/markdown", "vue/vue"], // want markdown processor to extract Vue blocks
            parser: "vue-eslint-parser", 
        },
    ]
}

May the eslint-plugin-markdown author has to define processors by the number of block kinds?

mysticatea · 2018-11-20T19:09:05Z

After I re-looked my comment, I'm not sure that setting works well because those processors conflict. But I think, that is the issue we want to solve.

mysticatea · 2018-11-20T20:09:28Z

Ah, mmm, #1 also cannot solve that problem properly, because #1 will implement recursive calls in Linter#verify, but resolving overrides setting does in CLIEngine. I'm thinking...

ilyavolodin

Proposal sounds good in general, but I would like an additional paragraph explaining how the new configuration with overrides will affect having to pass -ext flag to the command line. Based on the context, it looks like the status quo remains. We've had number of requests to change that behavior, but due to the design limitation of file traversal - we can't.

nzakas · 2018-11-21T19:58:38Z

@mysticatea

how do users configure processors to extract a specific kind of blocks?

It's not clear from the proposal, but this intentionally does not solve that problem because I don't believe that's what eslint/eslint#11035 is asking for or needs. (I will update the proposal to make it this clear.)

So far, the only plausible use case would be Markdown with different embedded code fragments inside, maybe some that are JS and some that are TypeScript, etc. I think that this is a corner case to the larger issue of making processors end-user configurable. I would suggest we don't focus on solving that case right now.

That said, I don't believe that this proposal prevents a solution like #1 from being implemented at some point in the future. As you stated, the processing would have to move to outside of Linter into CLIEngine so that patterns could be match for the virtual filenames. If we find that we really do need this capability, then we can always go down that route later (and use overrides to match the virtual filenames to appropriate configs).

nzakas · 2018-11-21T19:59:14Z

@ilyavolodin you are correct, this does not change how --ext works. I'll add a note to make that clear.

nzakas · 2018-11-21T20:01:31Z

@mysticatea I'm sorry, I completely misunderstood our earlier discussion. You can pretty much ignore everything in my previous comment. Let me get in the latest updates and I'll revisit.

nzakas · 2018-11-21T20:10:17Z

Okay, after looking this over again, I realize I completely missed the point that we need to be able to derive virtual filenames for extracted code blocks in this proposal. My apologies for creating any confusion.

I'm going to update this proposal to include @mysticatea's approach of providing a virtual filename for extracted code blocks as I think that's clearly the best way to solve that part of the problem. I think I can add that feature into this proposal without any significant changes, and if @mysticatea likes the updated proposal, I'd suggest we work together on a single proposal moving forward.

nzakas · 2018-11-21T20:42:03Z

Okay, I've updated the proposal to take everyone's feedback into account. I added a bit about virtual filenames from @mysticatea's proposal and how the current implementation of processors would have to change. I think I now properly understand what we're going for, thanks for your patience and help.

mysticatea · 2018-11-21T21:21:22Z

@nzakas Merging is a good idea. I have been thinking #1 should use overrides[].processor instead of processor API's extensions.

designs/2018-processors-improvements/README.md

nzakas · 2018-11-22T17:57:14Z

I've incorporated the latest round of feedback including:

Change from processors to processor, as multiple processors will now be applied by matching the file extension of code blocks.
Clarified that Linter functionality will not be removed, only worked around in the CLI.
Fixed typos and clarified some points.

Outstanding issues:

It seems like we don't want to be able to specify a full file path for code blocks. The question is, what do we want in its place? Should we only allow processors to specify extensions? Should we allow processors to specify full filenames and disallow \ characters?

btmills

Unrelated to this specific RFC, I’m really liking the process. This is so much more efficient than iterating on an issue - nice work @nzakas!

designs/2018-processors-improvements/README.md

btmills · 2018-11-22T22:02:16Z

designs/2018-processors-improvements/README.md

+
+This processor returns two code blocks containing JavaScript code. Each code block is given a virtual filename ending with `.js`.
+
+When a `preprocess()` method returns an object with a `filePath` property, `CLIEngine` will call `getConfigForFile()` on the `filePath` property to determine the correct configuration for the code block, which includes whether another processor should be run on the code block (matched by file extension).


At least in the case of the Markdown processor, since it can’t know which extensions have processors configured, I understand it will want to return every code block it encounters? For example, if original.md contains js, html, and vue code blocks, it would return original.md.js, original.md.html, and original.md.vue. Do we then need to run those through the --ext filter as well? Perhaps I invoke with --ext .html, so I’d expect the html code block to be linted, but not the vue code block.

At initial, I had considered the same thing: we should filter extracted blocks with the --ext or glob patterns. However, I think currently, if a specific file specified (E.g., eslint docs/README.md), I'm not sure what the proper behavior is. I will be confused if ESLint changes the target code blocks by globs. So I want options for processors in config files.

Hmm, I don't think we want every processor to have to implement its own version of --ext. Could we encourage processors to return all code blocks it thinks can be linted with ESLint, and then filter the results in CLIEngine based on the value of --ext before applying glob matching?

I think that makes the most sense. The Markdown processor would return every fenced code block, and the HTML processor would return every <script> tag, and ESLint core would be responsible for deciding which of those get linted.

How that interacts with --ext I’m not sure. I suppose you could argue that if I configure an override for *.ts, then I shouldn’t also have to specify --ext .ts. When plugins provided their own extensions, the --ext flag was necessary, but flipping the relationship around so that the extensions are configured in overrides might make it redundant.

Is appending an extension to the existing filename the best way? I think so, but I wanted to point out that it’s trivial for the Markdown processor to pull the tag from the fenced code block, but an HTML processor would need to map from the <script>’s MIME type to the appropriate extension.

I don't oppose the way.

My small concern is, some users use glob patterns rather than --ext option to specify other files than JS (E.g., src/**/*.{js,ts,tsx}). But we can say that use --ext option to specify embedded codes other than JS.

Okay, it seems like we are in agreement that:

Processors can return all file types it thinks ESLint can handle.

ESLint will filter out any file types that aren't specified by --ext to avoid errors.

Users must use --ext even if they have overrides in their config; overrides is not a replacement for --ext (in the context of this proposal).

@btmills I think appending the file extension makes sense. My hunch is that most processors will only be returning .js files any way. As @mysticatea has pointed out a couple of times, we are really talking about an edge case with non-.js code blocks being returned from processors.

nzakas · 2018-11-23T18:33:33Z

I've incorporated the latest round of feedback including:

Change filePath to filename for code blocks.
filename in code blocks cannot contain slash characters
Clarified how we will treat code blocks without a filename specified
Fixed typos.

Outstanding issues:

How does this proposal interact with the --ext flag?
Where should the processing take place in order to expose this functionality to browsers?

mysticatea · 2018-11-24T04:43:35Z

How does this proposal handle autofix?
If some of the processors don't support autofix, broken fix properties will mix in the result. #1 mentioned it and removing the broken fix properties.

nzakas · 2018-11-26T17:22:35Z

@mysticatea oh, I guess I thought autofixing would "just work." :) Let me read back over that part of your proposal and see what I missed.

nzakas · 2018-11-26T17:28:23Z

Okay, I don't think I understand what's going on with processor autofixing. It's the processor that's responsible for applying fixes? What would change about that with this proposal?

(Sorry, processor autofixing happened when I was away so I just have no context for how that interacts with anything.)

mysticatea · 2018-12-04T05:01:19Z

@nzakas The processor is the pair of preprocess and postprocess, so it will be:

function verify(filename, code) {
    //....(resolve config)....
    let shouldAutofix = true
    const messagesList = preprocess(code, filename).map(item => {
        if (typeof item === "string") {
            return linter.verify(code, config, { ...options, filePath: filename })
        }

        // Recursive call.
        const retv = verify(item.filename, item.code)

        // Take the flag.
        if (!retv.shouldAutofix) {
            shouldAutofix = false
        }
        return retv.messages
    })

    return {
        shouldAutofix,
        messages: postprocess(messagesList, filename)
    }
}

As a reference, stopping creating fix:

function verify(filename, code) {
    //....(resolve config)....
    return postprocess(
        preprocess(code, filename).map(item => {
            if (typeof item === "string") {
                return linter.verify(code, config, {
                    ...options,
                    filePath: filename,
                    disableConstructFix: !supportsAutofix
                })
            }
            // Recursive call.
            return verify(item.filename, item.code)
        }),
        filename,
    )
}

Actually, the disableConstructFix flag is unnecessary if the overrides resolving logic exists in Linter because Linter knows when it should not construct the fix property.

nzakas · 2018-12-04T17:43:59Z

Ah ok, I think I see what you're getting at. I (reluctantly) agree that adding the flag to Linter#verify() to disable creating of fix makes sense.

So that verify() function in your example would end up in CLIEngine?

mysticatea · 2018-12-04T22:23:35Z

So that verify() function in your example would end up in CLIEngine?

Yes 😃

nzakas · 2018-12-05T22:40:04Z

Okay, I'll update the design within the next couple of days.

…

On Tue, Dec 4, 2018 at 2:23 PM Toru Nagashima ***@***.***> wrote: So that verify() function in your example would end up in CLIEngine? Yes 😃 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AACWkuDo1YDzKaMJ-qDiGHzhZH4QDt31ks5u1vXogaJpZM4Yrmbj> .

--

______________________________ Nicholas C. Zakas @SlickNet Author, Principles of Object-Oriented JavaScript <http://amzn.to/29Pmfrm> Author, Understanding ECMAScript 6 <http://amzn.to/29K1mIy>

nzakas · 2018-12-06T18:28:16Z

I've updated the proposal with the latest implementation changes that @mysticatea and I discussed.

I think the last open issue is whether the vue-eslint-parser case (#3 (comment)) is breaking or not. @mysticatea can you clarify your thinking on that?

nzakas · 2018-12-11T18:06:22Z

I've updated the design to address the edge cases @mysticatea pointed out. I believe that this design is now ready for a final review.

btmills · 2018-12-14T22:21:57Z

designs/2018-processors-improvements/README.md

+
+When a `preprocess()` method returns an object with a `filename` property, a new filename is constructed in this format:
+
+> parentFilename + path.sep + index + filenameFromPreprocess


Based on this, if I wanted to configure globals or rules just for code found in Markdown files, I would be able to do that by using the *.md/* pattern?

module.exports = { plugins: ["markdown"], overrides: [ { files: ["*.md"], processor: "markdown/markdown" }, { files: ["*.md/*"], globals: { foo: true }, rules: { strict: "off" } } ] }

To further confirm my understanding, any configuration defined in the *.md override would not cascade down to the code blocks because it lacks a trailing *? I think that's the correct behavior, though it might seem unintuitive at first if someone is used to .eslintrc configurations cascading. overrides just work differently than config cascades.

That's correct.

That lets us give the ability to configure rules for each kind of code blocks.

designs/2018-processors-improvements/README.md

btmills · 2018-12-14T23:06:05Z

designs/2018-processors-improvements/README.md

+
+## Backwards Compatibility Analysis
+
+This proposal is 100% backwards compatible until we remove the old way of defining processors. Both named and extension-based processors can be defined in the same plugin, such as:


I've spent a while trying to figure this out, so please correct me if I've missed something. I think that this will require a major version bump when plugins start to support this API.

Currently, as well as after this change in the case of extension-named processors, code blocks inherit their parent file's filename. For example, project/README.md would contain blocks named project/README.md. In the Markdown plugin's readme right now, we recommend configuring rules etc. using *.md in overrides.

When a processor is upgraded to support the named processor API, those code blocks would become project/README.md/0.js, and any rules configured for *.md files would no longer be applied to the code blocks inside of them.

Individually, the eslint adding support for this API won't break the existing (now called extension-named) processors, and eslint-plugin-markdown can start exporting both extension-named and named processors without breaking versions of ESLint that don't support the new API. But taken together, a version of ESLint that supports extension-named processors and an extension-named processor could require me to modify my configuration if I'm specifically targeting code blocks returned by a processor.

Am I lost in the trees here?

That is true, but only if the processor is returning filenames with the code blocks. If the processor is still returning strings, then the code blocks get the same name as the parent file. And when ESLint is given string code blocks, it would stop matching against the code block filenames because that would be an infinite loop.

So, it would be a breaking change for the plugin if it changes from returning strings to returning filenames, but not a breaking change for ESLint.

That makes sense. I’m happy to use the Markdown plugin to test the recommended upgrade path for processors once this is available in ESLint.

Co-Authored-By: nzakas <nicholas@nczconsulting.com>

nzakas · 2018-12-27T20:13:06Z

@ilyavolodin do you have any further concerns? At this point, you're the only one that has suggested changes outstanding.

ilyavolodin · 2018-12-27T23:26:07Z

Ah, sorry about that. I lost track of this proposal and didn't have much time to recheck it. Changes looks good to me. Thanks.

nzakas · 2019-01-02T15:37:11Z

It looks like we have our first merged RFC! 🎉 Thanks everyone!

not-an-aardvark · 2019-01-02T20:13:41Z

For future reference, do we need TSC approval to merge RFCs like this? I have no problem with it being merged either way (and I see that a majority of TSC members approved the RFC anyway), but I was a bit confused because I thought it would need to be approved in a TSC meeting.

nzakas · 2019-01-03T15:04:21Z

Sorry, I guess that wasn't clear. My thinking was that we don't need a TSC meeting to approve an RFC when we have enough PR approvals. In this case, we had five approvals on the PR and no outstanding questions or comments. If we can't get a consensus on the PR, then I do think we need to discuss in a TSC meeting.

…

On Wed, Jan 2, 2019 at 12:13 PM Teddy Katz ***@***.***> wrote: For future reference, do we need TSC approval to merge RFCs like this? I have no problem with it being merged either way (and I see that a majority of TSC members approved the RFC anyway), but I was a bit confused because I thought it would need to be approved in a TSC meeting. — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#3 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AACWkmavb7s-GCTa2b8WwCxq5joGk9uIks5u_RL1gaJpZM4Yrmbj> .

--

______________________________ Nicholas C. Zakas @SlickNet Author, Principles of Object-Oriented JavaScript <http://amzn.to/29Pmfrm> Author, Understanding ECMAScript 6 <http://amzn.to/29K1mIy>

New: Processor improvements

5f03f55

nzakas added the enhancement New feature or request label Nov 20, 2018

nzakas mentioned this pull request Nov 20, 2018

Better support for multiple processors eslint/eslint#11035

Closed

platinumazure suggested changes Nov 20, 2018

View reviewed changes

ilyavolodin suggested changes Nov 21, 2018

View reviewed changes

Update proposal with virtual filenames

80cdb0f

mysticatea reviewed Nov 21, 2018

View reviewed changes

designs/2018-processors-improvements/README.md Outdated Show resolved Hide resolved

mysticatea reviewed Nov 21, 2018

View reviewed changes

designs/2018-processors-improvements/README.md Outdated Show resolved Hide resolved

mysticatea reviewed Nov 21, 2018

View reviewed changes

designs/2018-processors-improvements/README.md Outdated Show resolved Hide resolved

mysticatea reviewed Nov 21, 2018

View reviewed changes

designs/2018-processors-improvements/README.md Outdated Show resolved Hide resolved

mysticatea mentioned this pull request Nov 21, 2018

New: making Plugin's Processor API chainable #1

Closed

not-an-aardvark reviewed Nov 22, 2018

View reviewed changes

Update with latest feedback

d953bf3

btmills reviewed Nov 22, 2018

View reviewed changes

filePath -> filename, typo fixes

8068d1a

Clarify using --ext and virtual filename construction

6cb7ddd

kaicataldo approved these changes Dec 5, 2018

View reviewed changes

Update implementation details

937144d

nzakas added 2 commits December 10, 2018 09:18

Update construction of code block filenames algorithm

2d5b8c4

Improve description of processor algorithms

d8a14d4

btmills reviewed Dec 14, 2018

View reviewed changes

Fix typo

acc9dd5

Co-Authored-By: nzakas <nicholas@nczconsulting.com>

mysticatea approved these changes Dec 20, 2018

View reviewed changes

btmills approved these changes Dec 20, 2018

View reviewed changes

ilyavolodin approved these changes Dec 27, 2018

View reviewed changes

nzakas merged commit 0f6b17a into master Jan 2, 2019

nzakas deleted the 2018-processors branch January 2, 2019 15:37

not-an-aardvark mentioned this pull request Jan 10, 2019

Docs: Clarify when an RFC can be merged #8

Merged

mysticatea mentioned this pull request Feb 24, 2019

Update: Config File Improvements #13

Closed

mysticatea mentioned this pull request Mar 26, 2019

New: multiple processors support (fixes #11035, fixes #11725) eslint/eslint#11552

Merged

mysticatea mentioned this pull request May 11, 2019

New: Configuring Additional Lint Targets with .eslintrc #20

Merged

mysticatea added the implemented This RFC has been implemented label Oct 14, 2019

btmills mentioned this pull request Mar 18, 2021

Update: compatibility with eslint-mdx eslint/markdown#178

Closed

mdjermanovic mentioned this pull request May 9, 2022

fix: Processor blocks respect ignores eslint/eslint#15813

Closed

1 task

mmkal mentioned this pull request Nov 6, 2023

Change Request: per-rule processors eslint/eslint#17724

Closed

1 task


		This processor returns two code blocks containing JavaScript code. Each code block is given a virtual filename ending with `.js`.

		When a `preprocess()` method returns an object with a `filePath` property, `CLIEngine` will call `getConfigForFile()` on the `filePath` property to determine the correct configuration for the code block, which includes whether another processor should be run on the code block (matched by file extension).


		When a `preprocess()` method returns an object with a `filename` property, a new filename is constructed in this format:

		> parentFilename + path.sep + index + filenameFromPreprocess


		## Backwards Compatibility Analysis

		This proposal is 100% backwards compatible until we remove the old way of defining processors. Both named and extension-based processors can be defined in the same plugin, such as:

New: Processor improvements #3

New: Processor improvements #3

Conversation

nzakas commented Nov 20, 2018

Summary

Related Issues

platinumazure left a comment

Choose a reason for hiding this comment

mysticatea commented Nov 20, 2018

mysticatea commented Nov 20, 2018 • edited Loading

mysticatea commented Nov 20, 2018

ilyavolodin left a comment

Choose a reason for hiding this comment

nzakas commented Nov 21, 2018

nzakas commented Nov 21, 2018

nzakas commented Nov 21, 2018

nzakas commented Nov 21, 2018

nzakas commented Nov 21, 2018

mysticatea commented Nov 21, 2018

nzakas commented Nov 22, 2018

btmills left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nzakas commented Nov 23, 2018

mysticatea commented Nov 24, 2018

nzakas commented Nov 26, 2018

nzakas commented Nov 26, 2018

mysticatea commented Dec 4, 2018

nzakas commented Dec 4, 2018

mysticatea commented Dec 4, 2018

nzakas commented Dec 5, 2018 via email

nzakas commented Dec 6, 2018

nzakas commented Dec 11, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nzakas commented Dec 27, 2018

ilyavolodin commented Dec 27, 2018

nzakas commented Jan 2, 2019

not-an-aardvark commented Jan 2, 2019

nzakas commented Jan 3, 2019 via email

mysticatea commented Nov 20, 2018 •

edited

Loading