`Requirement('installer')` errors after 2bd5da391c302f2f5a18fd3e2bd9fb3b75c02e34 #618

KOLANICH · 2022-11-22T17:47:38Z

from packaging.requirements import Requirement
Requirement('installer')

The regression has been introduced in 2bd5da391c302f2f5a18fd3e2bd9fb3b75c02e34.

pradyunsg · 2022-12-03T12:47:47Z

OK, I spent a bit of time looking at trying to fuzz our tests for the tokeniser with hypothesis, but I've hit multiple issues with trying to use hypothesis/hypofuzz at this point.

It's really cool tho, especially with the pytrace integration. :)

pradyunsg · 2022-12-03T12:48:18Z

FWIW, #619 has been significantly increased in size and scope.

pradyunsg · 2022-12-03T18:20:17Z

Well, here's another regression:

❯ python -c "import packaging; print(packaging.__version__); from packaging.requirements import Requirement; print(vars(Requirement(\"  name == 3.0.*  \")))"
21.4.dev0
IDENTIFIER   name
OP  ==
VERSION  3.0.*
Traceback (most recent call last):
  File "/Users/pradyunsg/Developer/github/packaging/packaging/requirements.py", line 35, in __init__
    req = parse_named_requirement(requirement_string)
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/pradyunsg/Developer/github/packaging/packaging/_parser.py", line 79, in parse_named_requirement
    specifier = parse_specifier(tokens)
                ^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/pradyunsg/Developer/github/packaging/packaging/_parser.py", line 119, in parse_specifier
    parsed_specifiers = parse_version_many(tokens)
                        ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/pradyunsg/Developer/github/packaging/packaging/_parser.py", line 136, in parse_version_many
    if not tokens.match("COMMA"):
           ^^^^^^^^^^^^^^^^^^^^^
  File "/Users/pradyunsg/Developer/github/packaging/packaging/_tokenizer.py", line 102, in match
    token = self.peek()
            ^^^^^^^^^^^
  File "/Users/pradyunsg/Developer/github/packaging/packaging/_tokenizer.py", line 95, in peek
    self.next_token = next(self.generator)
                      ^^^^^^^^^^^^^^^^^^^^
  File "/Users/pradyunsg/Developer/github/packaging/packaging/_tokenizer.py", line 163, in _tokenize
    raise self.raise_syntax_error(message="Unrecognized token")
          ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/pradyunsg/Developer/github/packaging/packaging/_tokenizer.py", line 138, in raise_syntax_error
    raise ParseExceptionError(
packaging._tokenizer.ParseExceptionError: Unrecognized token
at position 15:
      name == 3.0.*  
                   ^

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "/Users/pradyunsg/Developer/github/packaging/packaging/requirements.py", line 37, in __init__
    raise InvalidRequirement(str(e))
packaging.requirements.InvalidRequirement: Unrecognized token
at position 15:
      name == 3.0.*  
                   ^
❯ python -c "import packaging; print(packaging.__version__); from packaging.requirements import Requirement; print(vars(Requirement(\"  name == 3.0.*  \")))"
21.3
{'name': 'name', 'url': None, 'extras': set(), 'specifier': <SpecifierSet('==3.0.*')>, 'marker': None}

pradyunsg · 2022-12-03T18:46:51Z

I've started a parser rewrite, to couple the parsing and tokenising such that tokenising is context-sensitive.

hrnciar · 2022-12-05T15:08:23Z

I apologise for the issues that caused my contribution. I'd be happy to address them had I known about them. Since nobody mentioned me sooner, I learned about the problems during the weekend after @pradyunsg already spend time rewriting it and opened #624.

pradyunsg · 2022-12-05T21:22:03Z

@hrnciar There's no need to apologise. My work in that PR is very-much based off of your work, and I certainly wouldn't have gotten as far as I have without the existing base that you built! :)

I was already looking improving the error messages around requirement parsing back (partly what motivated this parser rewrite for me), so it was fairly organic to dive into this for me.

pradyunsg · 2022-12-05T21:28:48Z

On a different note... what do we want to do with strings like:

package(===arbitrarystring)

The behaviours we have:

(21.3)

❯ echo "package(===arbitrarystring)" | python -c "import packaging; from packaging.requirements import Requirement; print(packaging.__version__); print(vars(Requirement(input())))"
21.3
Traceback (most recent call last):
  [snip]
packaging.requirements.InvalidRequirement: Parse error at "'(===arbi'": Expected string_end

(main)

❯ echo "package(===arbitrarystring)" | python -c "import packaging; from packaging.requirements import Requirement; print(packaging.__version__); print(vars(Requirement(input())))"
21.4.dev0
Traceback (most recent call last):
  [snip]
packaging.requirements.InvalidRequirement: Closing right parenthesis is missing
at position 27:
    package(===arbitrarystring)
                               ^

And... I have two options:

(a) Error out:

❯ echo "package(===arbitrarystring)" | python -c "import packaging; from packaging.requirements import Requirement; print(packaging.__version__); print(vars(Requirement(input())))"
21.4.dev0
Traceback (most recent call last):
  [snip]
packaging.requirements.InvalidRequirement: Expected closing RIGHT_PARENTHESIS
    package(===arbitrarystring)
           ~~~~~~~~~~~~~~~~~~~~^

(b) Modify how Specifier parses arbitrary versions strings to parse this (and package ( === arbitrarystring )) as package===arbitrarystring

❯ echo "package(===arbitrarystring)" | python -c "import packaging; from packaging.requirements import Requirement; print(packaging.__version__); print(vars(Requirement(input())))"
21.4.dev0
{'name': 'package', 'url': None, 'extras': set(), 'specifier': <SpecifierSet('===arbitrarystring')>, 'marker': None}

pradyunsg · 2022-12-05T21:41:33Z

I'm inclined to go with option (b).

pradyunsg · 2022-12-05T22:06:08Z

The same situation as ) happens with ; for markers.

brettcannon · 2022-12-05T23:28:44Z

If someone messes up their specifier so it isn't valid, how will they know that with option (b)? Will it be based on the fact that the specifier will never be true?

pradyunsg · 2022-12-05T23:41:10Z

This is only an issue with ===, along with ) or ;. Currently, they both error out if you don't have whitespace after the arbitrary string; even though version strings are unlikely to contain ; or ).

I can't imagine someone messing up their version specifier in a manner that makes it difficult to understand what's happening TBH.

pradyunsg · 2022-12-05T23:43:13Z

FWIW, 39ae524 (#624) is the relevant change for fixing how this parses things.

This comment was marked as resolved.

Sign in to view

KOLANICH mentioned this issue Nov 22, 2022

Fixed the parser regression for the package names starting with the the same substrings as keywords. #619

Closed

brettcannon added bug packaging.requirements labels Nov 23, 2022

This was referenced Dec 3, 2022

Release 22.0 #569

Closed

raise NotImplementedError("unreachable") when a falsifying case is found Zac-HD/hypofuzz#15

Closed

example.via is not correctly type-annotated HypothesisWorks/hypothesis#3520

Closed

pradyunsg mentioned this issue Dec 3, 2022

Improve Requirement/Marker parser with context-sensitive tokenisation #624

Merged

pradyunsg closed this as completed in #624 Dec 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Requirement('installer')` errors after 2bd5da391c302f2f5a18fd3e2bd9fb3b75c02e34 #618

`Requirement('installer')` errors after 2bd5da391c302f2f5a18fd3e2bd9fb3b75c02e34 #618

KOLANICH commented Nov 22, 2022 •

edited by brettcannon

Loading

This comment was marked as resolved.

pradyunsg commented Dec 3, 2022

pradyunsg commented Dec 3, 2022

pradyunsg commented Dec 3, 2022

pradyunsg commented Dec 3, 2022

hrnciar commented Dec 5, 2022

pradyunsg commented Dec 5, 2022 •

edited

Loading

pradyunsg commented Dec 5, 2022 •

edited

Loading

pradyunsg commented Dec 5, 2022

pradyunsg commented Dec 5, 2022

brettcannon commented Dec 5, 2022

pradyunsg commented Dec 5, 2022

pradyunsg commented Dec 5, 2022

Requirement('installer') errors after 2bd5da391c302f2f5a18fd3e2bd9fb3b75c02e34 #618

Requirement('installer') errors after 2bd5da391c302f2f5a18fd3e2bd9fb3b75c02e34 #618

Comments

KOLANICH commented Nov 22, 2022 • edited by brettcannon Loading

This comment was marked as resolved.

pradyunsg commented Dec 3, 2022

pradyunsg commented Dec 3, 2022

pradyunsg commented Dec 3, 2022

pradyunsg commented Dec 3, 2022

hrnciar commented Dec 5, 2022

pradyunsg commented Dec 5, 2022 • edited Loading

pradyunsg commented Dec 5, 2022 • edited Loading

pradyunsg commented Dec 5, 2022

pradyunsg commented Dec 5, 2022

brettcannon commented Dec 5, 2022

pradyunsg commented Dec 5, 2022

pradyunsg commented Dec 5, 2022

`Requirement('installer')` errors after 2bd5da391c302f2f5a18fd3e2bd9fb3b75c02e34 #618

`Requirement('installer')` errors after 2bd5da391c302f2f5a18fd3e2bd9fb3b75c02e34 #618

KOLANICH commented Nov 22, 2022 •

edited by brettcannon

Loading

pradyunsg commented Dec 5, 2022 •

edited

Loading

pradyunsg commented Dec 5, 2022 •

edited

Loading