gh-104683: Argument clinic: make `format_docstring()` a method on `Function` objects #107840

AlexWaygood · 2023-08-10T11:41:59Z

format_docstring() only ever modifies the internal state of the Function object that is self.function on the DSLParser. It makes much more sense for it to be an instance method on Function objects rather than DSLParser objects.

Issue: Modernise code in Tools/clinic/ #104683

…ects

…objects" This reverts commit 4edaf41.

erlend-aasland · 2023-08-10T12:28:48Z

Could you add tests for @text_signature first? There are a lot of untested paths here.

serhiy-storchaka · 2023-08-10T12:39:49Z

I am not sure that it is a good idea. Function is a small data class. Its function is to store the data related to a function. It has few simple properties which represent that data in different forms. format_docstring() is very different. It is not idempotent. It can (and should) only be called once after processing the clinic input for the function. It can be a part of the parser, or the part of the renderer, or an independent utility, but it does not fit as a Function method.

I do not see benefits from this change.

erlend-aasland · 2023-08-10T13:03:09Z

It can be a part of the parser, or the part of the renderer, or an independent utility, but it does not fit as a Function method.

IMO, it makes most sense to render (and format) the docstring in render_function(). Currently, format_docstring() is pretty complex. It may make more sense to try and clean it up first.

Regarding where the docstring render code should live: perhaps a DocstringRenderer class that can take care of this is the best option. I see the value of keeping the Function class small.

No matter where it ends up, I think it makes sense to tear this functionality out of the parser.

AlexWaygood · 2023-08-10T14:03:02Z

I do not see benefits from this change.

Since the format_docstring() method solely exists to modify the internal state of Function objects, and since it nearly entirely only used the internal state of Function objects, I felt like it would better fit the OOP principle of encapsulation to have it as a method on Function objects rather than as a method on DSLParser objects. The benefit I was trying to achieve was improved code readability and maintainability; the PR achieves small simplifications of the code in several places.

However, it is true that this PR does not fix any known bugs, so if we disagree that this improves readability, then it should be abandoned. It's also true that this would be a significant increase in the complexity of the Function class, which does currently mostly serve as a simple "data holder" class.

It is not idempotent. It can (and should) only be called once after processing the clinic input for the function.

It's true that these facts about format_docstring() differ from the other methods on Function objects currently. Other than that, however, I'm not sure I agree that they're good reasons, in and of themselves, for format_docstring() not to be a method on Function. The "footgun" whereby format_docstring could accidentally be called at any point during the state machine, but should only be called in one specific place during the state machine, already exists in the code's current form, where the method exists on the DSLParser class.

serhiy-storchaka · 2023-08-10T20:21:08Z

The problem is that Function.docstring serves different functions during its lifetime. It is a buffer to accumulate docstring lines while processing the clinic input. It is an input to format_docstring() used as a template for generating docstring. It is a generated docstring after calling format_docstring().

format_docstring() itself has two functions: parse and validate input docstring lines and generate a docstring. The former part should stay in the parser, the later part should be moved to the renderer.

The Function object passed from the parser to the renderer should have the following attributes: summary -- one line, prologue -- the part which will be inserted between the summary and the parameters list (it is empty in all current applications), epilogue -- the part which will be inserted between the summary and the parameters list. Then the renderer will combine a docstring from signature, summary, prologue, parameters and epilogue, in this order.

AlexWaygood · 2023-08-10T20:44:18Z

Thanks @serhiy-storchaka, that seems like a solid analysis. Let's leave this for now, then, and save it for a more principled refactor.

Argument clinic: make format_docstring() a method on Function obj…

2a26605

…ects

AlexWaygood added the skip news label Aug 10, 2023

AlexWaygood requested a review from serhiy-storchaka August 10, 2023 11:41

AlexWaygood requested a review from erlend-aasland as a code owner August 10, 2023 11:41

bedevere-bot added the awaiting core review label Aug 10, 2023

bedevere-bot mentioned this pull request Aug 10, 2023

Modernise code in Tools/clinic/ #104683

Closed

7 tasks

AlexWaygood added 2 commits August 10, 2023 13:08

Also make forced_text_signature an attribute on Function objects

4edaf41

Revert "Also make forced_text_signature an attribute on Function …

421013d

…objects" This reverts commit 4edaf41.

AlexWaygood closed this Aug 10, 2023

AlexWaygood deleted the refactor-format-docstring branch August 10, 2023 20:44

AlexWaygood mentioned this pull request Aug 10, 2023

[alt] Refactor DSLParser.format_docstring() Argument-Clinic/cpython#18

Closed

erlend-aasland mentioned this pull request Dec 20, 2023

Argument clinic: add support for creating method aliases #113270

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-104683: Argument clinic: make `format_docstring()` a method on `Function` objects #107840

gh-104683: Argument clinic: make `format_docstring()` a method on `Function` objects #107840

AlexWaygood commented Aug 10, 2023 •

edited

Loading

erlend-aasland commented Aug 10, 2023

serhiy-storchaka commented Aug 10, 2023

erlend-aasland commented Aug 10, 2023

AlexWaygood commented Aug 10, 2023

serhiy-storchaka commented Aug 10, 2023

AlexWaygood commented Aug 10, 2023

gh-104683: Argument clinic: make format_docstring() a method on Function objects #107840

gh-104683: Argument clinic: make format_docstring() a method on Function objects #107840

Conversation

AlexWaygood commented Aug 10, 2023 • edited Loading

erlend-aasland commented Aug 10, 2023

serhiy-storchaka commented Aug 10, 2023

erlend-aasland commented Aug 10, 2023

AlexWaygood commented Aug 10, 2023

serhiy-storchaka commented Aug 10, 2023

AlexWaygood commented Aug 10, 2023

gh-104683: Argument clinic: make `format_docstring()` a method on `Function` objects #107840

gh-104683: Argument clinic: make `format_docstring()` a method on `Function` objects #107840

AlexWaygood commented Aug 10, 2023 •

edited

Loading