Python 3 fixes - fix contrib/python checkstyle #6274

Eric-Arellano · 2018-07-30T02:08:21Z

Beyond the usual unicode vs bytes issues, the AST grammar changed substantially between Python 2 vs 3. For example, ast.TryExcept was renamed to ast.Try. See http://joao.npimentel.net/2015/07/23/python-2-vs-python-3-ast-differences/ for all the changes.

Eric-Arellano · 2018-07-30T02:09:31Z

contrib/python/src/python/pants/contrib/python/checks/tasks/checkstyle/common.py

@@ -91,15 +91,15 @@ def _remove_coding_header(cls, blob):
    """
    # Remove the # coding=utf-8 to avoid AST erroneous parse errors
    #   https://bugs.python.org/issue22221
-    lines = blob.split('\n')
+    lines = blob.decode('utf-8').split('\n')


I kept blob always being binary string—even though I don't think it's necessary to read these source files as binary—to stay closer to original semantics.

lines are always unicode now. Earlier it was ambiguous what they were.

Eric-Arellano · 2018-07-30T02:10:21Z

contrib/python/src/python/pants/contrib/python/checks/tasks/checkstyle/common.py

@@ -273,7 +274,7 @@ def __init__(self, code, severity, filename, message, line_range=None, lines=Non
  def __str__(self):
    """convert ascii for safe terminal output"""
    flat = list(self.flatten_lines([self.message], self.lines))
-    return '\n     |'.join(flat).encode('ascii', errors='replace')
+    return '\n     |'.join(flat).encode('ascii', errors='replace').decode('ascii')


__str__ must return unicode in Py3 no matter what.

This function will still strip out any non-ascii, though, and then spit it back out as unicode.

That's probably comment worthy.

Eric-Arellano · 2018-07-30T02:15:53Z

contrib/python/src/python/pants/contrib/python/checks/tasks/checkstyle/print_statements.py

+    if PY3:
+      # Python 3 interpreter will raise SyntaxError upon reading a print statement.
+      # So, this module cannot be meaningfully used when ran with a Python 3 interpreter.
+      return


I'm not sure what to do about this. There is no way for Python 3 to parse a print statement, even if it's reading Python 2.

To reproduce, open Python 3 console:

import ast ast.dump(ast.parse('print "hi"'))

I'm concerned this implies once we switch Pants to Python 3 under-the-hood, it will start to complain about Python 2 only syntax like print statements and except Error, e syntax? If so, this means users still on Python 2 will have to convert all Py2-only syntax to Py2-3 compliant to use the newest version of Pants. I see no way around this, and think it is fairly acceptable because they do not have to fully migrate to Python 3.

Still, we should communicate well with users about what must be changed and how to use the futurize script to do it easily.

cc @CMLivingston has had some thoughts about this, which @jsirois recently incorporated: see how #6182 dynamically creates a pex to run a linter.

Interesting. So before we release a stable version of Pants running Python 3, we could create a Pex to run this contrib code using Py2 interpreter?

There's a counterpoint, the Python 2 interpreter will complain about Python 3 only syntax, like type hints

# Python2 ast.dump(ast.parse('x: int = 4'))

So, I don't think there's a way we could support linting for both Python 2-only syntax and Python 3-only syntax? We could get fancy and use the compatibility flag to trigger the corresponding linter for that module, perhaps. The common subset of Python 2 and Python 3 of course will work as well, like running Python 3 linter over Pants in its current Python 2 state.

It seems to me that unless we had two PEXes and dynamically chose which to use depending on the file, we'll have to choose between supporting Py 2 only + Py 2/3 subset, or Py3 only + Py 2/3 subset.

Dynamically creating the linter pex means you don't need to create "N" static pexes for N python versions... just linter code that is compatible with all of the (major) python versions you want to lint. You'd then dynamically create a pex for python 3 if you needed to lint 3, etc.

Eric-Arellano · 2018-07-30T02:16:38Z

...hon/tests/python/pants_test/contrib/python/checks/tasks/checkstyle/test_except_statements.py

+          'except KeyError, e :',
+          'except (KeyError, ValueError), e\t:'):
+        self.assertNit(EXCEPT_TEMPLATE.format(clause), 'T601')
+    except CheckSyntaxError:  # Fix Python 3 raising SyntaxError


See above discussion about print statements raising syntax error. Feeding the above input into assertNit raises a SyntaxError.

Eric-Arellano · 2018-07-30T02:18:31Z

...b/python/tests/python/pants_test/contrib/python/checks/tasks/checkstyle/test_import_order.py

-    self.assertItemsEqual(['T403', 'T405', 'T402'],
-                          [chunk_error.code for chunk_error in chunk_errors])
-    self.assertItemsEqual([ImportType.STDLIB, ImportType.THIRD_PARTY], module_types)
+    self.assertEqual(sorted(['T403', 'T405', 'T402']),


assertItemsEqual was renamed in Python3 to assertCountEqual. Calling sorted has the same effect, and is more explicit and cleaner than choosing the test function according to interpreter version.

Refer to https://docs.python.org/2/library/unittest.html#unittest.TestCase.assertItemsEqual

stuhood

Thanks!

stuhood · 2018-07-30T17:42:07Z

contrib/python/src/python/pants/contrib/python/checks/tasks/checkstyle/common.py

@@ -273,7 +274,7 @@ def __init__(self, code, severity, filename, message, line_range=None, lines=Non
  def __str__(self):
    """convert ascii for safe terminal output"""
    flat = list(self.flatten_lines([self.message], self.lines))
-    return '\n     |'.join(flat).encode('ascii', errors='replace')
+    return '\n     |'.join(flat).encode('ascii', errors='replace').decode('ascii')


That's probably comment worthy.

stuhood · 2018-07-30T17:52:33Z

contrib/python/src/python/pants/contrib/python/checks/tasks/checkstyle/print_statements.py

+    if PY3:
+      # Python 3 interpreter will raise SyntaxError upon reading a print statement.
+      # So, this module cannot be meaningfully used when ran with a Python 3 interpreter.
+      return


cc @CMLivingston has had some thoughts about this, which @jsirois recently incorporated: see how #6182 dynamically creates a pex to run a linter.

stuhood · 2018-07-30T17:54:07Z

It looks like there is 1 substantive travis failure.

Beyond the usual unicode vs bytes issues, the AST grammar changed substantially between Python 2 vs 3. For example, `ast.TryExcept` was renamed to `ast.Try`. See http://joao.npimentel.net/2015/07/23/python-2-vs-python-3-ast-differences/ for all the changes.

Eric Arellano added 6 commits July 29, 2018 22:01

Fix contrib/python unicode vs bytes issues

97cf359

Fix except statement AST changes

8dc7bbf

Disable print statements check on Py3

881f569

Fix changes to with statement AST grammar

694e3aa

Fix test no longer having assertItemsEqual()

8df5cd1

Fix extra test byes vs unicode I missed

c825bb5

Eric-Arellano commented Jul 30, 2018

View reviewed changes

stuhood approved these changes Jul 30, 2018

View reviewed changes

Fix python check style encoding issue

651fa32

stuhood merged commit 4a4e481 into pantsbuild:master Jul 31, 2018

Eric-Arellano deleted the py3-fixes_contrib-python branch August 11, 2018 14:59

jsirois mentioned this pull request Oct 11, 2018

Run pythonstyle under the appropriate interpreter. #6618

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python 3 fixes - fix contrib/python checkstyle #6274

Python 3 fixes - fix contrib/python checkstyle #6274

Eric-Arellano commented Jul 30, 2018

Eric-Arellano Jul 30, 2018

Eric-Arellano Jul 30, 2018

stuhood Jul 30, 2018

Eric-Arellano Jul 30, 2018

stuhood Jul 30, 2018

Eric-Arellano Jul 30, 2018

stuhood Jul 30, 2018 •

edited

Loading

Eric-Arellano Jul 30, 2018

Eric-Arellano Jul 30, 2018

stuhood left a comment

stuhood Jul 30, 2018

stuhood Jul 30, 2018

stuhood commented Jul 30, 2018

Python 3 fixes - fix contrib/python checkstyle #6274

Python 3 fixes - fix contrib/python checkstyle #6274

Conversation

Eric-Arellano commented Jul 30, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stuhood Jul 30, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stuhood left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stuhood commented Jul 30, 2018

stuhood Jul 30, 2018 •

edited

Loading