Improve InfluxDB through-put performance #2251

sparrc · 2017-01-11T13:40:19Z

This changes the current use of the InfluxDB client to instead use a
baked-in client that uses the fasthttp library.

This allows for significantly smaller allocations, the re-use of http
body buffers, and the re-use of the actual bytes of the line-protocol
metric representations.

Required for all PRs:

CHANGELOG.md updated (we recommend not updating this until the PR has been approved by a maintainer)
Sign CLA (if not already signed)
README.md updated (if adding a new plugin)

sebito91 · 2017-01-11T14:06:22Z

This looks awesome! By chance did you note what kind of improvement this library gave you? Is it really worthwhile to replace net/http request with fasthttp?

sparrc · 2017-01-11T14:13:13Z

Yes, I'm working on getting together some performance benchmarks right now :)

sparrc · 2017-01-11T14:51:18Z

Ah, I misread your comment, I have not directly compared using net/http vs. fasthttp when writing this influxdb client. I was largely relying on ad-hoc testing done while we made the write client for https://github.com/influxdata/influx-stress.

Also, the first question in the fasthttp faq is very relevant. The 1st and 3rd bullet points create significant allocation overhead, whereas fasthttp solves this overhead easily by providing buffer re-use functions, see here and here.

That being said, it's the request body that has the memory overhead and I'm not sure how much each "request" structure itself really matters. I'm going to try to refactor to use net/http to see what the exact difference would be in performance, as I don't really like adding another third-party dependency if we don't have to.

sparrc · 2017-01-11T14:52:52Z

This change currently doubles the performance compared with master (writing to a single influxdb instance):

fasthttp:

$ influx-stress insert --fast --batch-size 1000 --points 10000000 --host http://localhost:8186
...
Write Throughput: 242383
Points Written: 10000000

master:

$ influx-stress insert --fast --batch-size 1000 --points 10000000 --host http://localhost:8186
...
Write Throughput: 122554
Points Written: 10000000

sparrc · 2017-01-11T16:09:43Z

Surprisingly I'm actually getting only very slightly improved performance using the fasthttp library, but much worse GC performance.

With fasthttp telegraf needs to run ~105 GC cycles to handle 4,000,000 metrics.

with net/http telegraf only needs to run ~31

Not sure why that is exactly, but I suspect that the standard library might have a more efficient way of reusing buffers when handling io.Reader bodies.

sebito91 · 2017-01-11T16:23:39Z

Hmm, thanks for the detailed feedback. Increased GC seems somewhat counterintuitive using fasthttp since it's meant to reuse buffers all day long. The doubled throughput is incredibly interesting and would warrant (IMHO) discussion with valyala.

If you run a flamegraph (thanks @jwilder) with fasthttp, do you see lots of activity in (*Response).AppendBodyString like in valyala/fasthttp#155

sparrc · 2017-01-11T16:26:01Z

@sebito91 that was comparing it with current master, which uses a different client and spends a lot of CPU time marshalling points from bytes to the influxdb point object and back to bytes.

This changes the current use of the InfluxDB client to instead use a baked-in client that uses the fasthttp library. This allows for significantly smaller allocations, the re-use of http body buffers, and the re-use of the actual bytes of the line-protocol metric representations.

sparrc force-pushed the influxdb-fasthttp-client branch from ff1153b to 9f46a27 Compare January 11, 2017 14:05

sparrc changed the title ~~Improve the InfluxDB through-put performance~~ Improve InfluxDB through-put performance Jan 11, 2017

sparrc force-pushed the influxdb-fasthttp-client branch from d5d0c87 to 8a22a12 Compare January 11, 2017 16:39

sparrc added this to the 1.3.0 milestone Jan 11, 2017

sparrc force-pushed the influxdb-fasthttp-client branch 2 times, most recently from 87d58b9 to 1b17842 Compare January 25, 2017 22:51

Revert using fasthttp library to net/http

c9e87a3

sparrc force-pushed the influxdb-fasthttp-client branch from 1b17842 to c9e87a3 Compare January 26, 2017 00:42

sparrc merged commit c9e87a3 into master Jan 26, 2017

sparrc deleted the influxdb-fasthttp-client branch January 27, 2017 22:59

francois2metz mentioned this pull request Feb 13, 2017

Fix setting the username and the password to the influxdb output. #2401

Merged

rlex mentioned this pull request May 25, 2017

Metric corruption when using http2 nginx reverse proxy with influxdb output #2854

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve InfluxDB through-put performance #2251

Improve InfluxDB through-put performance #2251

sparrc commented Jan 11, 2017 •

edited

Loading

sebito91 commented Jan 11, 2017

sparrc commented Jan 11, 2017

sparrc commented Jan 11, 2017

sparrc commented Jan 11, 2017 •

edited

Loading

sparrc commented Jan 11, 2017

sebito91 commented Jan 11, 2017

sparrc commented Jan 11, 2017

Improve InfluxDB through-put performance #2251

Improve InfluxDB through-put performance #2251

Conversation

sparrc commented Jan 11, 2017 • edited Loading

Required for all PRs:

sebito91 commented Jan 11, 2017

sparrc commented Jan 11, 2017

sparrc commented Jan 11, 2017

sparrc commented Jan 11, 2017 • edited Loading

sparrc commented Jan 11, 2017

sebito91 commented Jan 11, 2017

sparrc commented Jan 11, 2017

sparrc commented Jan 11, 2017 •

edited

Loading

sparrc commented Jan 11, 2017 •

edited

Loading