Cache manifest everywhere #14537

Hexcles · 2018-12-14T22:16:05Z

Move the cache layer of manifest objects from wpt.testfiles to
manifest.manifest so that all users of the manifest module can benefit.

This prevents us from having two in-memory copies of the manifest when
running affected tests (one loaded by wpt.testfiles and the other loaded
by wptrunner.testloader). Partly fixes #14421 (the memory usage part of
it, but not the code health part).

Hexcles · 2018-12-14T22:17:09Z

tools/wpt/testfiles.py

-
-load_manifest = _init_manifest_cache()
+def load_manifest(manifest_path=None, manifest_update=True):
+    from manifest import manifest


Switch to a delayed import here because otherwise Python will think there are two different manifest modules (because of different import paths) and we don't get the singleton.

What do you say we document your rationale with a comment? I'm afraid of your
optimization being accidentally subverted by some future refactoring (the
corresponding performance degradation might not be noticed immediately).

This is beyond my experience with Python, though. Can you advise on any
longer-term goals that would obviate this kind of consideration?

I added a comment here, but I don't really have great ideas how to make sure we always get a singleton.

Did you verify that the previous code was broken? I know that foo.bar and bar are different even if they're the same bar.py file, but I'm slightly surprised if that also affects relative imports.

Yes I did. It took me a while to realize what was going on.

Hexcles · 2018-12-14T22:18:22Z

On my machine, this reduces 2GB of memory usage (peak memory 5GB -> 3GB) when running

./wpt run --affected HEAD^1 --log-mach - --binary ~/tools/firefox-nightly/firefox firefox

(with a dummy local commit)

tools/manifest/manifest.py

Hexcles · 2018-12-18T23:09:53Z

I addressed the review comments, but am having trouble debugging the tools/wpt failure on macOS -- I can't reproduce the error locally on my Macbook.

I'll try to dig deeper.

Hexcles · 2018-12-19T04:17:05Z

So my working theory is that test_wpt.py had a module-level fixture to create a temporary manifest, which means the manifest path was the same for all test cases, but some test cases need to modify the manifest, so caching by path failed. The workaround is to use a different temporary manifest for each test case (and the temporary manifest is copied from a "persistent" one, which is initialized in a module-level setup function to avoid having to generate the manifest every time). See my second commit. This seems to work. However, I don't really understand why the problem only demonstrated on macOS.

@jgraham PTAL again.

foolip · 2018-12-19T15:34:18Z

FYI, I've rebased this branch to trigger "Travis CI - Pull Request", manually setting things right after the travis-ci.com transition in #14499 changed the name of the required check, which applies retroactively.

Move the cache layer of manifest objects from wpt.testfiles to manifest.manifest so that all users of the manifest module can benefit. This prevents us from having two in-memory copies of the manifest when running affected tests (one loaded by wpt.testfiles and the other loaded by wptrunner.testloader). Partly fixes #14421 (the memory usage part of it, but not the code health part).

The test module uses the same temporary manifest in all test cases, but some test cases modify the manifest, which causes inconsistency in the cache. This commit changes the test setup to initialize a persistent manifest on the module level, and each test case will then make a temporary copy of it. Also speed up tools/wpt tests on Azure by updating the manifest first.

wpt-pr-bot added infra manifest wpt labels Dec 14, 2018

wpt-pr-bot assigned jugglinmike Dec 14, 2018

wpt-pr-bot requested review from gsnedders, jgraham and jugglinmike December 14, 2018 22:16

Hexcles commented Dec 14, 2018

View reviewed changes

Hexcles force-pushed the cache-manifest branch from 45679cc to 681f95b Compare December 14, 2018 22:46

jugglinmike reviewed Dec 15, 2018

View reviewed changes

tools/manifest/manifest.py Outdated Show resolved Hide resolved

jgraham requested changes Dec 17, 2018

View reviewed changes

tools/manifest/manifest.py Outdated Show resolved Hide resolved

tools/manifest/manifest.py Outdated Show resolved Hide resolved

Hexcles force-pushed the cache-manifest branch from 681f95b to 5680792 Compare December 18, 2018 22:47

Hexcles force-pushed the cache-manifest branch from 3bbb06e to 4225541 Compare December 19, 2018 04:05

foolip force-pushed the cache-manifest branch from 4225541 to 754cf11 Compare December 19, 2018 15:33

Hexcles added 2 commits December 19, 2018 12:15

Hexcles force-pushed the cache-manifest branch from 754cf11 to 9cedce9 Compare December 19, 2018 17:19

jgraham approved these changes Dec 20, 2018

View reviewed changes

Hexcles merged commit 5d72055 into master Dec 20, 2018

Hexcles deleted the cache-manifest branch December 20, 2018 18:29

Hexcles mentioned this pull request Jan 16, 2019

Azure Pipelines affected tests jobs are being canceled #14860

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache manifest everywhere #14537

Cache manifest everywhere #14537

Hexcles commented Dec 14, 2018

Hexcles Dec 14, 2018

jugglinmike Dec 15, 2018

Hexcles Dec 19, 2018

jgraham Dec 20, 2018

Hexcles Dec 20, 2018

Hexcles commented Dec 14, 2018

Hexcles commented Dec 18, 2018

Hexcles commented Dec 19, 2018

foolip commented Dec 19, 2018

Cache manifest everywhere #14537

Cache manifest everywhere #14537

Conversation

Hexcles commented Dec 14, 2018

Hexcles Dec 14, 2018

Choose a reason for hiding this comment

jugglinmike Dec 15, 2018

Choose a reason for hiding this comment

Hexcles Dec 19, 2018

Choose a reason for hiding this comment

jgraham Dec 20, 2018

Choose a reason for hiding this comment

Hexcles Dec 20, 2018

Choose a reason for hiding this comment

Hexcles commented Dec 14, 2018

Hexcles commented Dec 18, 2018

Hexcles commented Dec 19, 2018

foolip commented Dec 19, 2018