Refactor how we analyse classes #460

AlexWaygood · 2024-01-05T15:28:15Z

Currently we analyse ClassDef nodes in many different places across the tree in a somewhat scattershot and ad-hoc way. This means that there's a fair amount of duplication currently, and that it can be somewhat awkward to query information about the enclosing class definition when we're visiting a method.

This PR thoroughly refactors the way we analyse classes:

It's now all done in one place, the EnclosingClassContext class. As well as reducing duplication, this also reduces potential for bugs: e.g. we currently don't detect a protocol as being unused if the protocol is generic (because that's an ast.Subscript node in the class's bases tuple, rather than an ast.Name or ast.Attribute node).
Get rid of the _get_collections_abc_obj_id function. It's not really necessary; the name is really unclear; it's an over-abstraction that obfuscates rather than clarifies.
Get rid of the in_class nesting counter on the PyiVisitor class. This works well to keep track of other contexts when visiting subnodes, but doesn't work well for class definitions: it's no more code to just check whether self.enclosing_class_ctx is None or not.

AlexWaygood · 2024-01-05T15:55:34Z

Deleted a comment I made that I don't think made any sense — I don't think this is urgent :)

srittau

Good refactoring. I just have two questions below.

srittau · 2024-01-05T16:08:55Z

pyi.py

+            return node.id
+        if isinstance(node, ast.Attribute):
+            return f"{_unravel(node.value)}.{node.attr}"
+        return None


Can this happen with the current ast? Wouldn't it be safer to raise an exception here, or use an assertion?

It can definitely happen; the .value of an Attribute node can be any number of things:

>>> class EvenStranger: ... def __getitem__(self, key): ... return self ... def __getattr__(self, attr): ... return int ... >>> class Foo(EvenStranger()[1][2].foo): ... >>> tree = ast.parse("EvenStranger()[1][2].foo") >>> print(ast.dump(tree, indent=2)) Module( body=[ Expr( value=Attribute( value=Subscript( value=Subscript( value=Call( func=Name(id='EvenStranger', ctx=Load()), args=[], keywords=[]), slice=Constant(value=1), ctx=Load()), slice=Constant(value=2), ctx=Load()), attr='foo', ctx=Load()))], type_ignores=[])

This is obviously pretty unlikely to happen in a stub, but I don't think we should be raising an exception if we come across something like this

I'll add some tests for these edge cases; I think you're correct that the handling isn't quite right for them at present.

srittau · 2024-01-05T16:10:37Z

pyi.py

+            return ClassBase(_unravel(base_node.value), base_node.attr)
+        if isinstance(base_node, ast.Subscript) and top_level:
+            return _analyze_base_node(base_node.value, top_level=False)
+        return None


Same question here. (A subscript can't be a Subscript's value, I think.)

Well, it is possible...

(I'll admit I had to try quite hard here!)

Python 3.12.1 (tags/v3.12.1:2305ca5, Dec 7 2023, 22:03:25) [MSC v.1937 64 bit (AMD64)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> class Strange: ... def __class_getitem__(cls, item): ... return Strange ... >>> class Foo(Strange[Strange][Strange]): ... ... >>> import ast >>> tree = ast.parse("class Foo(Strange[Strange][Strange]): ...") >>> tree.body[0].bases[0] <ast.Subscript object at 0x000001E44FC43FD0> >>> _.value <ast.Subscript object at 0x000001E44FC43F90>

github-actions · 2024-01-05T17:54:18Z

⚠ Flake8 diff showing the effect of this PR on typeshed:

> ./stubs/openpyxl/openpyxl/xml/_functions_overloads.pyi:32:1: Y046 Protocol "_HasTagAndGet" is not used
> ./stubs/protobuf/google/protobuf/internal/type_checkers.pyi:5:1: Y046 Protocol "_ValueChecker" is not used

AlexWaygood · 2024-01-05T18:04:25Z

Thanks for the quick review!

…ols (#9405) I just fixed this false negative in flake8-pyi (PyCQA/flake8-pyi#460), and then realised ruff has the exact same bug! Luckily it's a very easy fix. (The bug is that unused protocols go undetected if they're generic.)

AlexWaygood added 2 commits January 5, 2024 15:10

Refactor how we analyse classes

36fc18b

changelog

f7d979e

This comment has been minimized.

Sign in to view

srittau reviewed Jan 5, 2024

View reviewed changes

Be more principled, and add tests for edge cases

401cf04

AlexWaygood requested a review from srittau January 5, 2024 17:55

srittau approved these changes Jan 5, 2024

View reviewed changes

AlexWaygood merged commit 8fba6ed into PyCQA:main Jan 5, 2024
20 checks passed

AlexWaygood deleted the class-context branch January 5, 2024 18:04

AlexWaygood mentioned this pull request Jan 5, 2024

[flake8-pyi] Fix false negative for PYI046 with unused generic protocols astral-sh/ruff#9405

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor how we analyse classes #460

Refactor how we analyse classes #460

AlexWaygood commented Jan 5, 2024

This comment has been minimized.

This comment has been minimized.

AlexWaygood commented Jan 5, 2024 •

edited

Loading

srittau left a comment

srittau Jan 5, 2024

AlexWaygood Jan 5, 2024

AlexWaygood Jan 5, 2024

srittau Jan 5, 2024

AlexWaygood Jan 5, 2024

github-actions bot commented Jan 5, 2024

AlexWaygood commented Jan 5, 2024

Refactor how we analyse classes #460

Refactor how we analyse classes #460

Conversation

AlexWaygood commented Jan 5, 2024

This comment has been minimized.

This comment has been minimized.

AlexWaygood commented Jan 5, 2024 • edited Loading

srittau left a comment

Choose a reason for hiding this comment

srittau Jan 5, 2024

Choose a reason for hiding this comment

AlexWaygood Jan 5, 2024

Choose a reason for hiding this comment

AlexWaygood Jan 5, 2024

Choose a reason for hiding this comment

srittau Jan 5, 2024

Choose a reason for hiding this comment

AlexWaygood Jan 5, 2024

Choose a reason for hiding this comment

github-actions bot commented Jan 5, 2024

AlexWaygood commented Jan 5, 2024

AlexWaygood commented Jan 5, 2024 •

edited

Loading