Avoid multiple iterations over `nodes` and `symbols` in linter #5204

overlookmotel · 2024-08-25T17:15:22Z

Various linter rules are implemented as a Rule::run_once method which iterates over nodes or symbols.

Usually this is because they work in 2 passes. The first pass collects info from the AST/symbols and then the 2nd processes that info.

This results in nodes or symbols being unnecessarily iterated over multiple times, when run and run_on_symbol methods are already provided for doing this.

I imagine it'd be faster to:

Add a Rule::run_once_at_end method which executes after run and run_on_symbol have executed on all nodes/symbols.
Make Rule::run and Rule::run_on_symbol take a &mut self instead of &self.
Reimplement these rules:
- To store their intermediate data in self.
- Do the main loop over nodes or symbols as a run / run_on_symbol method.
- Do the 2nd pass in run_once_at_end.

It's iterating over nodes repeatedly which would likely have the largest perf impact. But there are so many rules which iterate over symbols in run_once that it might also be having a noticeable impact in aggregate.

Which rules?

Rules which use run_once in this way are:

Iterating over `nodes`

eslint/func_names
eslint/no_this_before_super
import/no_named_as_default_member
jest/no_large_snapshots
jest/prefer_hooks_in_order
jsdoc/require_returns
tree_shaking/no_side_effects_in_initialization (which does a partial tree traversal)

Iterating over `symbols`

eslint/no_shadow_restricted_name
jest/consistent_test_it
jest/expect_expect
jest/max_expects
jest/max_nested_describes
jest/no_alias_methods
jest/no_conditional_expect
jest/no_confusing_set_timeout
jest/no_disabled_tests
jest/no_done_callback
Loads more jest rules + vitest rules via collect_possible_jest_call_node

@DonIsaac What do you think?

The text was updated successfully, but these errors were encountered:

DonIsaac · 2024-08-25T21:16:51Z

I like where this is going. I need to think on this for a bit before I give an answer.

Boshen · 2024-08-26T01:58:30Z

This is not a priority, none of the rules are enabled by default.

overlookmotel · 2024-08-26T08:18:42Z

#5201 was an example of fixing a rule which was unnecessarily iterating over symbols.

Boshen · 2024-09-07T10:55:03Z

cc @mysteryven if you're intrested.

shulaoda · 2024-09-12T09:12:36Z

Is there any progress? I think this is a good idea. 🤔

Boshen · 2024-09-12T09:37:54Z

Is there any progress? I think this is a good idea. 🤔

I marked this as good first issue since no one is working on this 😅

shulaoda · 2024-09-12T09:40:05Z

Let me implement it. 👀

overlookmotel · 2024-09-12T19:32:32Z

Thank you @shulaoda! That'd be great.

The ones which iterate over nodes should be first priority, as they'll be the most expensive.

Please keep PRs small so we can review them more easily. 1 PR for the changes to Rule (run_once_at_end etc). And then please update each rule in a separate PR.

Updating Rule methods to use a mutable &mut self may be more complicated than it sounds, because need a nice API for setting up internal state. Feel free to ask for advice if you run into any snags.

Boshen · 2024-09-13T04:10:50Z

Make Rule::run and Rule::run_on_symbol take a &mut self instead of &self.

This is not going to work. The rules are singletons, immutable by design and cannot contain state. They are shared across threads for all files run in parallel.

If we really need state, we need to add the states to context, something like FxHashMap<Rule, Box<dyn RuleState>>

overlookmotel · 2024-09-13T17:31:28Z

Ah I didn't realize that rules were shared across threads.

As far as I understand, each rule gets its own LinterCtx object. Are they also shared across threads?

shulaoda · 2024-09-25T10:16:55Z

We seem to be able to close this issue now.

overlookmotel added A-linter Area - Linter C-performance Category - Solution not expected to change functional behavior, only performance labels Aug 25, 2024

Boshen added the good first issue Experience Level - Good for newcomers label Sep 7, 2024

Boshen assigned shulaoda Sep 12, 2024

shulaoda mentioned this issue Sep 13, 2024

refactor(linter): move run_once to the end #5741

Closed

shulaoda mentioned this issue Sep 14, 2024

feat(linter): support shared rule state in ctx #5770

Closed

overlookmotel mentioned this issue Sep 16, 2024

[RFC] feat(linter): support persisted rule state #5799

Closed

Boshen closed this as completed Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid multiple iterations over `nodes` and `symbols` in linter #5204

Avoid multiple iterations over `nodes` and `symbols` in linter #5204

overlookmotel commented Aug 25, 2024

DonIsaac commented Aug 25, 2024

Boshen commented Aug 26, 2024

overlookmotel commented Aug 26, 2024

Boshen commented Sep 7, 2024

shulaoda commented Sep 12, 2024

Boshen commented Sep 12, 2024

shulaoda commented Sep 12, 2024

overlookmotel commented Sep 12, 2024 •

edited

Loading

Boshen commented Sep 13, 2024 •

edited

Loading

overlookmotel commented Sep 13, 2024

shulaoda commented Sep 25, 2024

Avoid multiple iterations over nodes and symbols in linter #5204

Avoid multiple iterations over nodes and symbols in linter #5204

Comments

overlookmotel commented Aug 25, 2024

Which rules?

Iterating over nodes

Iterating over symbols

DonIsaac commented Aug 25, 2024

Boshen commented Aug 26, 2024

overlookmotel commented Aug 26, 2024

Boshen commented Sep 7, 2024

shulaoda commented Sep 12, 2024

Boshen commented Sep 12, 2024

shulaoda commented Sep 12, 2024

overlookmotel commented Sep 12, 2024 • edited Loading

Boshen commented Sep 13, 2024 • edited Loading

overlookmotel commented Sep 13, 2024

shulaoda commented Sep 25, 2024

Avoid multiple iterations over `nodes` and `symbols` in linter #5204

Avoid multiple iterations over `nodes` and `symbols` in linter #5204

Iterating over `nodes`

Iterating over `symbols`

overlookmotel commented Sep 12, 2024 •

edited

Loading

Boshen commented Sep 13, 2024 •

edited

Loading