erlfmt_format: preserve empty lines between values in containers #83

awalterschulze · 2020-07-31T09:53:39Z

Fixes #72

This preserves empty lines between values in containers, like exports and records.

This required some refactoring, since the simple fold_doc now needed to not only be aware of the break, but also needed to have the original value, to be able to check for empty lines to preserve.

Really struggled to make this refactoring more readable and I guess it is debatable, whether I achieved that, so looking forward to ideas in the review.

Also needed to preserve the end line for better empty line detection.
Should probably do the same for inner end line, so maybe we can think of a better function name.

michalmuskala · 2020-08-03T07:24:54Z

src/erlfmt_format.erl

@@ -421,6 +519,8 @@ fa_groups([{op, Meta, '/', {atom, _, HeadFunction}, _} = Head0 | Tail]) ->
            [Head0 | fa_groups(Rest)];
        {Group, Rest} ->
            Head = erlfmt_scan:delete_annos([pre_comments, post_comments], Head0),
+            EndLoc = erlfmt_scan:get_end_location(lists:last(Group)),
+            Meta = erlfmt_scan:set_end_location(EndLoc, Meta0),


In the parser we have a ?range_anno macro that does something similar. What do you think about having a similar function in erlfmt_scan? Something like:

range_anno(First, Last) when is_tuple(First), is_tuple(Last) -> (element(2, First))#{end_location := get_end_location(Last)}.

We could call it here as:

Meta = erlfmt_scan:range_anno(Head, lists:last(Group))

That sounds great :)

We should probably also set the inner_end_location, right?

Hmm... potentially. Do we really need it? It would need some caseing to only add it if it was there in Last in the first place.

I added range_anno, but I needed to pass in Meta, otherwise, we lose the comments.

michalmuskala · 2020-08-03T07:33:35Z

Looking at all the complexity trailing comments introduce, I started thinking if maybe we should change how we represent them in the AST. My proposal would be to use the post_comment anno of the last element if the container has elements. If the container is empty with only the comment inside, we would still have the standalone comment node. What do you think? I believe it should make a lot of code simpler, we have to special-case those standalone comments a lot and effectively apply some manual processing that would be done for post_comment of last node anyway.

awalterschulze · 2020-08-03T08:00:08Z

Looking at all the complexity trailing comments introduce, I started thinking if maybe we should change how we represent them in the AST. My proposal would be to use the post_comment anno of the last element if the container has elements. If the container is empty with only the comment inside, we would still have the standalone comment node. What do you think? I believe it should make a lot of code simpler, we have to special-case those standalone comments a lot and effectively apply some manual processing that would be done for post_comment of last node anyway.

I guess I have always thought the pre_comments were too attached. I would expect to have more standalone comments in the ast. Basically if there is an empty line between this and the next element, then the comment is intuitively not attached to the next element.
So that would be one way of refactoring, is to embrace the detached comments.

But to be pragmatic we could also embrace the aggressive attachment of comments and consistently say that all comments are attached. This would attach these trailing comments as post comments.

Which ever way we go, I think we can do this in a follow up pull request, and possibly pair on it together.

michalmuskala · 2020-08-03T08:03:14Z

The problem of standalone comments is that they don't behave syntactically like other elements, for example inside lists they don't have , in-between them. Attaching comments to elements (what your code effectively does in here) bypasses this issue.

awalterschulze · 2020-08-03T08:19:39Z

Yes so I am happy to go with the pragmatic route, in a follow up commit.

src/erlfmt_format.erl

michalmuskala · 2020-08-03T08:34:50Z

What happens if the way the container was initially laid out didn't include the "force breaks" space between [ and first element? In particular, if it was written down in a "comma-first" style like below:

[ 1
, 2

, 3
]

Do we collapse it into a single line [1, 2, 3] or do we expand it into:

Either choice is fine by me, I think we should have a test that explores this, though.

awalterschulze · 2020-08-03T08:52:06Z

a test that explores this, though.

Great catch :)
This seems to be the current passing test.

?assertFormat(
        "[ 1\n"
        ", 2\n"
        "\n"
        ", 3\n"
        "]\n",
        "[1, 2,\n"
        "\n"
        "3]\n"
    ).

I also agree, that either case would be great, except the one that is happening now.

awalterschulze · 2020-08-03T12:18:09Z

Have to say it is very tough to get this and other containers like tuples, to place nice together

What happens if the way the container was initially laid out didn't include the "force breaks" space between [ and first element? In particular, if it was written down in a "comma-first" style like below:
[ 1
, 2

, 3
]
Do we collapse it into a single line [1, 2, 3] or do we expand it into:
[
    1,
    2,

    3
]
Either choice is fine by me, I think we should have a test that explores this, though.

This is now fixed, it was quite tricky :)

michalmuskala · 2020-08-03T12:52:06Z

README.md

+[
+    Foo,
+    Bar
+]


Hmm.... We should probably re-write the guide because it no longer works as advertised. It's not enough to just remove the newline between [ and first element.

This is so much complexity in here 😢

Good catch.
I fixed the guide.

I think it makes sense to not only take a hint from the first newline, but rather from any newline in the container.

michalmuskala · 2020-08-03T12:54:21Z

Have to say it is very tough to get this and other containers like tuples, to place nice together

Yeah, there's a lot of complexity in the rules for different containers. We probably need to rethink the general approach of keeping everything together after merging this code.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 31, 2020

erlfmt_format: preserve empty lines between values in containers

fd045d4

awalterschulze force-pushed the preserveexport branch 2 times, most recently from 8e9e582 to 3c070a4 Compare July 31, 2020 14:10

fix dialyzer

10f2661

awalterschulze force-pushed the preserveexport branch from 3c070a4 to 10f2661 Compare July 31, 2020 14:15

awalterschulze added 3 commits July 31, 2020 15:20

erlfmt parts of erlfmt_format.erl

dd3dae5

fix fa_groups, where newlines were added between groups

255d53c

simpler break_fun

500f830

michalmuskala reviewed Aug 3, 2020

View reviewed changes

add range_anno

cee0f50

michalmuskala reviewed Aug 3, 2020

View reviewed changes

src/erlfmt_format.erl Outdated Show resolved Hide resolved

awalterschulze added 2 commits August 3, 2020 09:57

add surround fun

001ac08

consider any line break, not just the meta line break

131be49

michalmuskala reviewed Aug 3, 2020

View reviewed changes

fix readme

08e0d6b

michalmuskala approved these changes Aug 4, 2020

View reviewed changes

awalterschulze mentioned this pull request Aug 4, 2020

Consider rolling trailing comments into post comments #87

Closed

awalterschulze merged commit b65c0a0 into master Aug 4, 2020

michalmuskala deleted the preserveexport branch August 4, 2020 12:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

erlfmt_format: preserve empty lines between values in containers #83

erlfmt_format: preserve empty lines between values in containers #83

awalterschulze commented Jul 31, 2020 •

edited

Loading

michalmuskala Aug 3, 2020

awalterschulze Aug 3, 2020

awalterschulze Aug 3, 2020

michalmuskala Aug 3, 2020

awalterschulze Aug 3, 2020

michalmuskala commented Aug 3, 2020

awalterschulze commented Aug 3, 2020

michalmuskala commented Aug 3, 2020

awalterschulze commented Aug 3, 2020 •

edited

Loading

michalmuskala commented Aug 3, 2020 •

edited

Loading

awalterschulze commented Aug 3, 2020

awalterschulze commented Aug 3, 2020

michalmuskala Aug 3, 2020

awalterschulze Aug 3, 2020

michalmuskala commented Aug 3, 2020

erlfmt_format: preserve empty lines between values in containers #83

erlfmt_format: preserve empty lines between values in containers #83

Conversation

awalterschulze commented Jul 31, 2020 • edited Loading

michalmuskala Aug 3, 2020

Choose a reason for hiding this comment

awalterschulze Aug 3, 2020

Choose a reason for hiding this comment

awalterschulze Aug 3, 2020

Choose a reason for hiding this comment

michalmuskala Aug 3, 2020

Choose a reason for hiding this comment

awalterschulze Aug 3, 2020

Choose a reason for hiding this comment

michalmuskala commented Aug 3, 2020

awalterschulze commented Aug 3, 2020

michalmuskala commented Aug 3, 2020

awalterschulze commented Aug 3, 2020 • edited Loading

michalmuskala commented Aug 3, 2020 • edited Loading

awalterschulze commented Aug 3, 2020

awalterschulze commented Aug 3, 2020

michalmuskala Aug 3, 2020

Choose a reason for hiding this comment

awalterschulze Aug 3, 2020

Choose a reason for hiding this comment

michalmuskala commented Aug 3, 2020

awalterschulze commented Jul 31, 2020 •

edited

Loading

awalterschulze commented Aug 3, 2020 •

edited

Loading

michalmuskala commented Aug 3, 2020 •

edited

Loading