Should Python linters/checkers stream their partitioned results? #13380

Eric-Arellano · 2021-10-27T22:08:53Z

For Python, we may return >1 result. For example, MyPy and Flake8 can partition by interpreter constraints. When that happens, we don't dump the results until all partitions have run. Instead, we could do something like #13379 (review) to stream results.

However, I expect >90% of users don't use this partition feature. It would be confusing to render the per-partition result when there is only one partition, given that we will also still render CheckResults and LintResults due to the code in check.py and lint.py. So, we should probably only stream "partitions" when they are actually used.

This question also applies to the option --lint-per-file-caching.

--

Concretely, we might want to remove the CheckResults and LintResults feature in favor of having to return a single CheckResult/LintResult a la #13379 (review). Let each plugin determine if it wants to stream. (Although, that would make support for --lint-per-file-caching harder to implement)

The text was updated successfully, but these errors were encountered:

stuhood · 2021-10-27T23:03:54Z

Concretely, we might want to remove the CheckResults and LintResults feature in favor of having to return a single CheckResult/LintResult a la #13379 (review). Let each plugin determine if it wants to stream. (Although, that would make support for --lint-per-file-caching harder to implement)

Ideally the various linters could be oblivious to the number of partitions that are created, and instead just operate on whatever they're given... that would hopefully mean they wouldn't need to care about how many instances there were. But I think that that would require making the partitioning constraints more declarative, so that lint/check could execute the partitioning before calling the tools.

This relates to a conversation from the other day: it would be great to be able to optimize partition sizes independently of the constraints (one instance of yapf is too few on a 64 core machine, while one per file is too many: https://pantsbuild.slack.com/archives/C046T6T9U/p1635271709151400), and removing most of that logic from individual linters would help enable that. Then we could play with partition sizes independently of constraints.

stuhood · 2022-01-19T00:06:02Z

Relates to #13462.

But: yes: we probably should stream the results of individual partitions, in addition to cleaning them up / making them quieter by default, á la #14129.

cognifloyd added the backend: Python Python backend-related issues label Mar 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should Python linters/checkers stream their partitioned results? #13380

Should Python linters/checkers stream their partitioned results? #13380

Eric-Arellano commented Oct 27, 2021

stuhood commented Oct 27, 2021 •

edited

Loading

stuhood commented Jan 19, 2022

Should Python linters/checkers stream their partitioned results? #13380

Should Python linters/checkers stream their partitioned results? #13380

Comments

Eric-Arellano commented Oct 27, 2021

stuhood commented Oct 27, 2021 • edited Loading

stuhood commented Jan 19, 2022

stuhood commented Oct 27, 2021 •

edited

Loading