fix: faster workspace mapping #138

thecodrr · 2023-12-22T17:56:06Z

This PR changes the workspace finding algorithm to be around 2x faster by:

globing only once instead of for each pattern
Using ignore in glob for negated patterns

The results on my machine look quite good:

┌─────────┬───────────────┬──────────┬────────────────────┬───────────┬─────────┐
│ (index) │   Task Name   │ ops/sec  │ Average Time (ns)  │  Margin   │ Samples │
├─────────┼───────────────┼──────────┼────────────────────┼───────────┼─────────┤
│    0    │     'old'     │   '97'   │ 10300189.995765686 │ '±58.58%' │   10    │
│    1    │     'new'     │  '195'   │ 5119704.985618591  │ '±6.59%'  │   20    │
│    2    │ 'new virtual' │ '24,096' │  41499.5856304881  │ '±3.61%'  │  2410   │
│    3    │ 'old virtual' │ '54,503' │ 18347.38597894367  │ '±1.64%'  │  5451   │
└─────────┴───────────────┴──────────┴────────────────────┴───────────┴─────────┘

This PR doesn't break any test or introduce any new package.

References

tap-snapshots/test/test.js.test.cjs

lib/index.js

thecodrr · 2024-01-07T16:43:31Z

lib/index.js

+  // we must preserve the order of results according to the given list of
+  // workspace patterns
+  const orderedMatches = []
+  for (const pattern of patterns) {
+    orderedMatches.push(...matches.filter((m) => {
+      return minimatch(m, pattern, { partial: true, windowsPathsNoEscape: true })
+    }))
+  }


I am not a fan of this approach because there can be edge cases where the reverse matching fails. We will have to account for every case that is handled by glob. Another problem could be a single pattern matching multiple matches causing unnecessary duplicates. We can sift them out but it seems unnecessary to me.

The only reason we have to do this is due to this line:

matches = matches.sort((a, b) => a.localeCompare(b, 'en'))

which sorts all matches alphabetically. glob maintains the given order of patterns automatically so we take this line out, everything works as intended. However, what's "intended" is vague. For well-defined patterns like docs, smoke-tests, there's no problem but if the user adds a wildcard like workspaces/* then we have to decide on how to order the results.

If we take out the above line, the order for wildcard matches depend on glob and I have found no way to change this behavior.

@wraithgar what do you think?

Additionally, not sorting the results alphabetically gives us an extra 5% performance boost.

Sorry I didn't see this notification and it got lost in the avalanche of cli work.

I didn't quite follow what you were meaning about the "reverse matching".

As long as the results from something like workspaces/* comes back sorted, and those results are in the same order as the entry for workspaces/* is in the package.json, we are fine.

As far as I can tell, glob doesn't have a guaranteed sort order so that is why the sort was added here.

Sorry if this doesn't clarify, I would really like to see this land and again apologize that it slipped through the cracks.

wraithgar · 2024-04-10T17:02:59Z

lib/index.js

+      seenPackagePathnames = new Set()
+      seen.set(name, seenPackagePathnames)
+    }
+    seenPackagePathnames.add(packagePathname)


Because seenPackagePathnames is always added to at least once for every name, line 167 (the continue below) is now unreachable:

for (const [packageName, seenPackagePathnames] of seen) { if (seenPackagePathnames.size === 0) { continue }

I think at if statement can be removed.

wraithgar · 2024-04-10T17:03:54Z

I have a local copy of this branch that removes the now unreachable if condition, and adds a test for the last line of coverage missing. I will make a new PR for it and we can land that if we're ready. I think this works as-is, even if there may be more we can do in the future.

#143

@thecodrr

This is a cleanup of #138 to get code coverage finished in the tests. Credit: @thecodrr --------- Co-authored-by: Abdullah Atta <abdullahatta@streetwriters.co>

thecodrr added 2 commits December 22, 2023 22:51

2x faster workspace mapping

420ce56

update snapshots

657037d

thecodrr requested a review from a team as a code owner December 22, 2023 17:56

thecodrr commented Dec 22, 2023

View reviewed changes

tap-snapshots/test/test.js.test.cjs Outdated Show resolved Hide resolved

sort and fix the order

e4a04dd

wraithgar reviewed Jan 4, 2024

View reviewed changes

lib/index.js Outdated Show resolved Hide resolved

preserve order of workspaces

a7c5781

thecodrr commented Jan 7, 2024

View reviewed changes

wraithgar self-assigned this Jan 8, 2024

wraithgar changed the title ~~2x faster workspace mapping~~ fix: faster workspace mapping Jan 31, 2024

wraithgar reviewed Apr 10, 2024

View reviewed changes

wraithgar mentioned this pull request Apr 10, 2024

fix: faster workspace mapping #143

Merged

wraithgar added a commit that referenced this pull request Apr 10, 2024

fix: faster workspace mapping (#143)

c89a529

This is a cleanup of #138 to get code coverage finished in the tests. Credit: @thecodrr --------- Co-authored-by: Abdullah Atta <abdullahatta@streetwriters.co>

wraithgar closed this Apr 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: faster workspace mapping #138

fix: faster workspace mapping #138

thecodrr commented Dec 22, 2023

thecodrr Jan 7, 2024 •

edited

Loading

thecodrr Jan 7, 2024 •

edited

Loading

wraithgar Jan 31, 2024

wraithgar Apr 10, 2024

wraithgar commented Apr 10, 2024 •

edited

Loading

fix: faster workspace mapping #138

fix: faster workspace mapping #138

Conversation

thecodrr commented Dec 22, 2023

References

thecodrr Jan 7, 2024 • edited Loading

Choose a reason for hiding this comment

thecodrr Jan 7, 2024 • edited Loading

Choose a reason for hiding this comment

wraithgar Jan 31, 2024

Choose a reason for hiding this comment

wraithgar Apr 10, 2024

Choose a reason for hiding this comment

wraithgar commented Apr 10, 2024 • edited Loading

thecodrr Jan 7, 2024 •

edited

Loading

thecodrr Jan 7, 2024 •

edited

Loading

wraithgar commented Apr 10, 2024 •

edited

Loading