parallelize batch flushing #4296

Stebalien · 2017-10-11T02:46:15Z

Modern storage devices (i.e., SSDs) tend to be highly parallel.
Allows us to read and write at the same time (avoids pausing while flushing).

This makes ipfs add --local ~3.5x faster with the flatfs datastore (untested
with badger).

License: MIT
Signed-off-by: Steven Allen steven@stebalien.com

1. Modern storage devices (i.e., SSDs) tend to be highly parallel. 2. Allows us to read and write at the same time (avoids pausing while flushing). fixes #898 (comment) License: MIT Signed-off-by: Steven Allen <steven@stebalien.com>

Stebalien · 2017-10-11T02:48:38Z

We may want to reduce the parallelism. 2*NumCpu is slightly faster than NumCpu but will allocate 2x the memory (on a 4 thread CPU, it will allocate 64MiB instead of 32MiB).

However, we should probably test badger first (it may work better with increased parallelism).

Stebalien · 2017-10-11T04:23:42Z

This makes adds with sync enabled almost as fast adds with sync disabled with the badger datastore (and the same 2*NumCpu constant seems to work).

whyrusleeping · 2017-10-11T09:38:14Z

This is really nice, great find here :)

Kubuxu · 2017-10-11T17:14:18Z

merkledag/batch.go

+)
+
+// ParallelBatchCommits is the number of batch commits that can be in-flight before blocking.
+// TODO: Experiment with multiple datastores, storage devices, and CPUs to find


Can you create issues for this instead of in code TODO (they just get forgotten).

Fine! (...grumble... we'll never get to it anyways)

that might be true but it is still better than having TODO in code.
As you said in your issue, someone might just take a stab at it out of pure boredom.

You're right, I was just being lazy 🙂.

Kubuxu · 2017-10-11T17:21:56Z

merkledag/batch.go

+	}(t.blocks)
+
+	t.activeCommits++
+	t.blocks = nil


I would preallocate a buffer here of MaxBlocks as appending will expand the buffer and cause more allocations and copies.

Technically, the max size of this array is 128 pointers to blocks (2KiB). However, it will likely never be greater than 32 pointers (0.5KiB) assuming that we have 256KiB blocks. Does 32 sound like a reasonable default size?

Personally, I don't think that will make much of a difference. We already do 1 allocation per block so this will only add another log(n) allocations.

Actually, I just preallocated a blocks array of the same size as the one we just filled. That should be a reasonable guess.

As go allocates next power of 2, getting to 128 would be 9 reallocations and copies. IMO it is worth it.

You're right, my log(n) estimate was incorrect anyways. log(n) per batch but still n overall (7-15% allocation overhead depending on the block sizes).

License: MIT Signed-off-by: Steven Allen <steven@stebalien.com>

It's probably safe to assume that this buffer will be about the same time each flush. This could cause 1 extra allocation (if this is the last commit) but that's unlikely to be an issue. License: MIT Signed-off-by: Steven Allen <steven@stebalien.com>

Stebalien · 2017-10-11T23:28:17Z

After further testing, the effect isn't nearly so pronounced for medium-size files (the tests above were on single large files) and is probably non-existent for small files as we create a new batch per file. We should consider using the same batch when adding multiple small files.

(ipfs/kubo#4296) 1. Modern storage devices (i.e., SSDs) tend to be highly parallel. 2. Allows us to read and write at the same time (avoids pausing while flushing). fixes ipfs/kubo#898 (comment)

parallelize batch flushing

09136e9

1. Modern storage devices (i.e., SSDs) tend to be highly parallel. 2. Allows us to read and write at the same time (avoids pausing while flushing). fixes #898 (comment) License: MIT Signed-off-by: Steven Allen <steven@stebalien.com>

Stebalien requested a review from whyrusleeping October 11, 2017 02:48

Stebalien added topic/merkledag Topic merkledag topic/perf Performance labels Oct 11, 2017

This was referenced Oct 11, 2017

ipfs add horrendously slow #898

Closed

Printing progress on add significantly slows down add #4297

Open

Kubuxu reviewed Oct 11, 2017

View reviewed changes

Stebalien mentioned this pull request Oct 11, 2017

Tune batch add parallelism constant #4299

Open

Stebalien added 2 commits October 11, 2017 11:40

create an issue for tuning the parallelism constant

e41848c

License: MIT Signed-off-by: Steven Allen <steven@stebalien.com>

Kubuxu approved these changes Oct 11, 2017

View reviewed changes

magik6k approved these changes Oct 12, 2017

View reviewed changes

whyrusleeping merged commit 9c06a0e into master Oct 14, 2017

whyrusleeping deleted the feat/parallel-batch branch October 14, 2017 12:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parallelize batch flushing #4296

parallelize batch flushing #4296

Stebalien commented Oct 11, 2017

Stebalien commented Oct 11, 2017

Stebalien commented Oct 11, 2017

whyrusleeping commented Oct 11, 2017

Kubuxu Oct 11, 2017

Stebalien Oct 11, 2017

Kubuxu Oct 11, 2017

Stebalien Oct 11, 2017

Kubuxu Oct 11, 2017 •

edited

Loading

Stebalien Oct 11, 2017

Stebalien Oct 11, 2017

Kubuxu Oct 11, 2017

Stebalien Oct 11, 2017

Stebalien commented Oct 11, 2017 •

edited

Loading

parallelize batch flushing #4296

parallelize batch flushing #4296

Conversation

Stebalien commented Oct 11, 2017

Stebalien commented Oct 11, 2017

Stebalien commented Oct 11, 2017

whyrusleeping commented Oct 11, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Kubuxu Oct 11, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Stebalien commented Oct 11, 2017 • edited Loading

Kubuxu Oct 11, 2017 •

edited

Loading

Stebalien commented Oct 11, 2017 •

edited

Loading