[internal] Add a benchmark for Snapshot capture #12569

stuhood · 2021-08-13T23:53:55Z

Refactor the test harness to allow for creating larger files (without allocating their full contents in memory), and add benchmarks for a series of file counts, sizes, and repetitions.

Preparation for #12563.

[ci skip-build-wheels]

stuhood · 2021-08-13T23:55:02Z

src/rust/engine/fs/store/benches/store.rs

+    (20, 10_000_000, true, 10),
+    (1, 200_000_000, true, 10),
+  ] {
+    let (count, size, _immutable, captures) = params;


The _immutable flag becomes relevant in #12563.

I'm not sure I understand the point of captures > 1? If we expect them to be no-ops, why are we benchmarking them?

Anytime we Snapshot the filesystem, if we discover that we've already captured a file in the LMDB store, we'll skip re-storing it. But that still involves hashing it to determine that we don't need to store it again. So it's a noop in terms of storage, but not in terms of hashing: LMDB::put will skip storing the value, and in #12563, LMDB::reserve will fail with an "exists" error and prevent the second pass over the data.

The captures ratio here is supposed to align with usecases (I'll add a comment in #12563):

(100, 100, false, 100) is supposed to represent Snapshotting the workspace: lots of small hand written files that change relatively infrequently (100 captures is 1 write and 99 noops)

(20, 10_000_000, true, 10) and (1, 200_000_000, true, 10) both represent capturing out of local process execution sandboxes: immutable data that is more unique, so more writes and fewer noops.

illicitonion

LGTM - one question

illicitonion · 2021-08-14T10:48:14Z

src/rust/engine/fs/store/benches/store.rs

+    (20, 10_000_000, true, 10),
+    (1, 200_000_000, true, 10),
+  ] {
+    let (count, size, _immutable, captures) = params;


I'm not sure I understand the point of captures > 1? If we expect them to be no-ops, why are we benchmarking them?

A private repository experienced OOM issues in CI with a large number of tests. Allocation profiling highlighted that a large batch of `32MB`/`64MB` allocations was occurring around the same time, all caused by the fact that `PosixFS::read_file` slurps an entire file into memory as `FileContent`. Because local process execution uses `PosixFS::read_file` to capture sandbox outputs, a bunch of processes finishing at once could lead to memory usage spikes. To fix that, this change removes `PosixFS::read_file` (which was only ever used to digest files), and replaces it with `{Store,local::ByteStore,ShardedLmdb}::store*` methods which make two-ish passes over a `Read` instance in order to digest and capture it. The methods take an `immutable_data` parameter so that callers can indicate that they know that data will not change, which allows for avoiding hashing it on the second pass. Captures from process sandboxes are treated as immutable, while memoized captures from the workspace are not. ---- #12569 adds relevant benchmarks. Profiling allocation during those benchmarks shows peak memory usage of `945MB` on `main`, and `362MB` on the branch. Additionally, the benchmarks all run faster (likely due to reduced allocation): ``` snapshot_capture/snapshot_capture((100, 100, false, 100)) time: [1.8909 s 1.9000 s 1.9109 s] change: [-27.459% -25.841% -24.085%] (p = 0.00 < 0.05) Performance has improved. snapshot_capture/snapshot_capture((20, 10000000, true, 10)) time: [1.2539 s 1.2655 s 1.2782 s] change: [-62.138% -61.318% -60.505%] (p = 0.00 < 0.05) Performance has improved. snapshot_capture/snapshot_capture((1, 200000000, true, 10)) time: [3.5281 s 3.5773 s 3.6299 s] change: [-13.420% -11.985% -10.434%] (p = 0.00 < 0.05) Performance has improved. ```

stuhood added 3 commits August 13, 2021 16:47

Refactor benchmark setup to allow larger files.

e4e11ff

[ci skip-build-wheels]

Add a snapshot_capture benchmark.

a2d1065

[ci skip-build-wheels]

Check rather than building.

765c5cc

stuhood requested review from jsirois, tdyas and illicitonion August 13, 2021 23:53

stuhood commented Aug 13, 2021

View reviewed changes

stuhood mentioned this pull request Aug 14, 2021

Stream files into the local store while capturing them #12563

Merged

illicitonion approved these changes Aug 14, 2021

View reviewed changes

stuhood merged commit 180ed91 into pantsbuild:main Aug 14, 2021

stuhood deleted the stuhood/snapshot-capture-benchmark branch August 14, 2021 16:35

stuhood mentioned this pull request Aug 14, 2021

Prepare 2.7.0.dev4 #12570

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[internal] Add a benchmark for Snapshot capture #12569

[internal] Add a benchmark for Snapshot capture #12569

stuhood commented Aug 13, 2021

stuhood Aug 13, 2021

illicitonion Aug 14, 2021

stuhood Aug 14, 2021

illicitonion left a comment

illicitonion Aug 14, 2021

[internal] Add a benchmark for Snapshot capture #12569

[internal] Add a benchmark for Snapshot capture #12569

Conversation

stuhood commented Aug 13, 2021

stuhood Aug 13, 2021

Choose a reason for hiding this comment

illicitonion Aug 14, 2021

Choose a reason for hiding this comment

stuhood Aug 14, 2021

Choose a reason for hiding this comment

illicitonion left a comment

Choose a reason for hiding this comment

illicitonion Aug 14, 2021

Choose a reason for hiding this comment