Perf improvements #10

achingbrain · 2018-12-20T15:36:38Z

Fixes #9

Converts the chunk-by-chunk passing of blocks to IPLD to be done in parallel instead using pull-paramap.

You can limit the concurrency in pull-paramap - I wasn't sure which value to use so I did a comparison using this test which uses pull-buffer-stream to generate streams of buffers of random bytes:

it('massive', function (done) {
  this.timeout(600000000)

  const size = 500000000
  let read = 0
  let lastDate = Date.now()
  let lastPercent = 0

  options.progress = (bufSize) => {
    read += bufSize

    const percent = parseInt((read / size) * 100)

    if (percent > lastPercent) {
      console.info(`${Date.now() - lastDate}`)

      lastDate = Date.now()
      lastPercent = percent
    }
  }

  pull(
    values([{
      path: '200Bytes.txt',
      content: bufferStream(size, {
        chunkSize: 256000
      })
    }]),
    importer(ipld, options),
    onEnd((err) => {
      expect(err).to.not.exist()
      done()
    })
  )
})

The results were (lower is better):

So there's a slight overhead with 100x concurrency, but unbounded is about the same as 50x on my laptop so in this PR I leave it unbounded.

This is only 0-20% of the file being added. I was interested by the jump so left it running in unbounded parallel mode to see if there were any other bottlenecks:

The line is reasonably straight, so at least it's constant and that's ok, right? Wrong. Each value on the x axis is 1%, but the y axis is how long it took to ingest that 1% so it's getting slower over time.

Adding a file to IPFS is basically going at O(n log n) time which is kind of bad for large files.

What's causing this? Turns out using pull-stream/throughs/map to calculate file ingest progress is causing this, and it allocates a new buffer for the file chunk to boot, so slow and memory inefficient.

Switching that out for a pull-stream/throughs/through results in the following graph:

Or about 2300x faster.

Writing the file chunks in series results in:

Or about 2200x faster.

So switching from series to parallel writes gets you a modest speed increase, but not changing pull stream data in-flight gets you an enormous boost.

In real-world use, this changed the time it takes to jsipfs add a 260MB file to a fresh repo from 13.7s to 1.95s. By comparisongo-ipfs takes 1.58s to add the same file to a fresh repo.

vmx · 2018-12-20T15:48:08Z

Wow, this is really amazing! Really great analysis.

alanshaw

LGTM

mcollina · 2018-12-21T11:12:21Z

wow! :D

achingbrain · 2018-12-21T11:15:51Z

I'm trying to dig deeper into what is the actual bit of work that was getting repeated or backed up here. Fundamentally I don't believe it. I've seen it with my own eyes but I still don't believe it.

mcollina · 2018-12-21T11:22:44Z

I was expecting something like this. You might see some more improvement once ipfs/js-datastore-fs#21 lands. We should likely do a call to understand what is happening and why.

alanshaw · 2019-01-02T10:34:25Z

The only significant difference I can see in the pull stream code between through and map is that map has a try/catch.

daviddias · 2019-01-03T07:14:08Z

Wow, this is absolutely fantastic!!! Amazing work @achingbrain!! 👏🏽👏🏽👏🏽

@achingbrain would you be down record a 5 minute screencast (maybe even just 2) to share with the community? Also, would love to have a session on how you approach this performance problems and generate these graphs so that everyone can learn from it.

alanshaw · 2019-01-15T17:10:38Z

I was just checking this out with this script:

const Ipfs = require('ipfs')
const Os = require('os')
const Path = require('path')
const Crypto = require('crypto')

const KB = 1024
const MB = KB * 1024

console.log('starting...')

const ipfs = new Ipfs({ repo: Path.join(Os.tmpdir(), `${Date.now()}`) })

ipfs.on('ready', async () => {
  console.log('ready...')
  const data = Crypto.randomBytes(240 * MB)
  console.log('adding...')
  console.time('added')
  const res = await ipfs.add(data)
  console.log(res[0].hash)
  console.timeEnd('added')
  await ipfs.stop()
  process.exit()
})

ipfs.on('error', console.error)

console.log('waiting for ready...')

I get the output:

starting...
waiting for ready...
Swarm listening on /ip4/127.0.0.1/tcp/4002/ipfs/QmcNzyg3kVu7n1Gmwc1am8BU1kqVh158oLUP1JGmxubstt
Swarm listening on /ip4/192.168.1.68/tcp/4002/ipfs/QmcNzyg3kVu7n1Gmwc1am8BU1kqVh158oLUP1JGmxubstt
Swarm listening on /ip4/127.0.0.1/tcp/4003/ws/ipfs/QmcNzyg3kVu7n1Gmwc1am8BU1kqVh158oLUP1JGmxubstt
ready...
adding...
QmXWAhTKcYSJtSgVoEuPAZsc77pH8aG4HpsYgtBoM5uG7z
added: 6986.134ms

Am I doing this right? With 0.34.0-rc.1 I'm getting ~6s minimum which is definitely way faster than with 0.33 (confirmed ~13s) but its a way off <2s.

mcollina · 2019-01-16T04:22:32Z

We are measuring 2.07s for Go on our benchmark server and 2.19s for Node (it dropped from 3s to 2s) to add a 64MB file.

https://benchmarks.ipfs.team/d/HCaaM78mk/ipfs-benchmarks?orgId=1&var-Measurement=All&var-Warmup=off&var-FileSet=One64MBFile&var-commit=

achingbrain added 3 commits December 20, 2018 13:17

perf: do not create new buffers

4ef5dbc

perf: write files in parallel chunks, use a through instead of a map

6a86d55

fix: pull-stream/throughs/through is not pull-through

df0abfa

ghost assigned achingbrain Dec 20, 2018

ghost added the in progress label Dec 20, 2018

achingbrain requested review from alanshaw, daviddias and vmx December 20, 2018 15:36

vmx approved these changes Dec 20, 2018

View reviewed changes

alanshaw approved these changes Dec 20, 2018

View reviewed changes

achingbrain merged commit 69504ea into master Dec 21, 2018

ghost removed the in progress label Dec 21, 2018

achingbrain deleted the perf-improvements branch December 21, 2018 10:41

mcollina mentioned this pull request Dec 21, 2018

major bottleneck in the usage of write-file-atomic ipfs/js-ipfs#1785

Closed

alanshaw mentioned this pull request Jan 14, 2019

⚡️ v0.34.0 RELEASE 🚀 ipfs/js-ipfs#1721

Closed

49 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf improvements #10

Perf improvements #10

achingbrain commented Dec 20, 2018

vmx commented Dec 20, 2018

alanshaw left a comment

mcollina commented Dec 21, 2018

achingbrain commented Dec 21, 2018

mcollina commented Dec 21, 2018

alanshaw commented Jan 2, 2019

daviddias commented Jan 3, 2019

alanshaw commented Jan 15, 2019

mcollina commented Jan 16, 2019

Perf improvements #10

Perf improvements #10

Conversation

achingbrain commented Dec 20, 2018

vmx commented Dec 20, 2018

alanshaw left a comment

Choose a reason for hiding this comment

mcollina commented Dec 21, 2018

achingbrain commented Dec 21, 2018

mcollina commented Dec 21, 2018

alanshaw commented Jan 2, 2019

daviddias commented Jan 3, 2019

Wow, this is absolutely fantastic!!! Amazing work @achingbrain!! 👏🏽👏🏽👏🏽

alanshaw commented Jan 15, 2019

mcollina commented Jan 16, 2019