Speed up JSON output #1114

mstoykov · 2019-08-14T09:25:50Z

The commit includes three main changes:

Don't write on Collect instead (as all other outputs) append to a
buffer and write the buffer in a separate goroutine every 100
milliseconds
Use json.Encoder for speed up
If the output file ends in .gz gzip the output which in my testing
decreases the file size by 30x. I couldn't measure any performance
degradation from the fact we need to compress if anything there is some
performance gain because of the smaller io writes

I was testing with simple script doing batch requests for a local zero
length file with 40VUS, that makes around 8k RPS using around 600-700mb
of memory. When running before those I was getting 5.8-6k RPS, after
this I am getting 6.5-6.9k RPS with around 3GB of memory usage.

At this point my cpu and memory profiling shows that the problem is the
json encoding upwards of 10 seconds and as such the next possible
optimizations are in using some kind of faster json encoding.

The commit includes three main changes: 1. Don't write on Collect instead (as all other outputs) append to a buffer and write the buffer in a separate goroutine every 100 milliseconds 2. Use json.Encoder for speed up 3. If the output file ends in `.gz` gzip the output which in my testing decreases the file size by 30x. I couldn't measure any performance degradation from the fact we need to compress if anything there is some performance gain because of the smaller io writes I was testing with simple script doing batch requests for a local zero length file with 40VUS, that makes around 8k RPS using around 600-700mb of memory. When running before those I was getting 5.8-6k RPS, after this I am getting 6.5-6.9k RPS with around 3GB of memory usage. At this point my cpu and memory profiling shows that the problem is the json encoding upwards of 10 seconds and as such the next possible optimizations are in using some kind of faster json encoding.

stats/json/collector.go

mstoykov · 2019-08-14T09:33:24Z

I would like to add that the test here and in #1113 is basically using all available CPU either way ... so there is essentially no head room so adding an output WILL decrease performance in all cases.

In both PRs I am basically making that hit smaller and hopefully less noticeable in the cases where the CPU usage is at 100% , while still saving all of the metric data.

In this PR for example the fact that the gzip decreases the json output by 30x makes the json output viable as a metric output even for large and long tests. The example above with 6.5k+ RPS for 4 minutes was producing 3GB files which got reduced to 100-120MB with gzip.
While in the example above we will probably fill our memory with unwritten samples before a 2 hour test ends it will probably work with some lower RPS.

I did some testing and around 5.5-5.6RPS all the json writes finish within 100ms or less which makes it viable for that load IMO

codecov-io · 2019-08-14T09:37:57Z

Codecov Report

Merging #1114 into master will decrease coverage by 0.15%.
The diff coverage is 13.46%.

@@            Coverage Diff             @@
##           master    #1114      +/-   ##
==========================================
- Coverage   73.25%   73.09%   -0.16%     
==========================================
  Files         141      141              
  Lines       10260    10282      +22     
==========================================
  Hits         7516     7516              
- Misses       2302     2322      +20     
- Partials      442      444       +2

Impacted Files	Coverage Δ
stats/json/collector.go	`12.34% <13.46%> (-1.22%)`	⬇️
core/engine.go	`92.99% <0%> (-0.94%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e3c279f...67cede7. Read the comment docs.

stats/json/collector.go

imiric

👍

mstoykov requested a review from na-- August 14, 2019 09:25

golangcibot reviewed Aug 14, 2019

View reviewed changes

stats/json/collector.go Outdated Show resolved Hide resolved

fix golangci warning

7a3d0d8

na-- requested changes Aug 14, 2019

View reviewed changes

stats/json/collector.go Outdated Show resolved Hide resolved

stats/json/collector.go Show resolved Hide resolved

stats/json/collector.go Show resolved Hide resolved

stats/json/collector.go Show resolved Hide resolved

na-- changed the title ~~perf(jsonOutput): speed up json output~~ Speed up json output Aug 14, 2019

na-- changed the title ~~Speed up json output~~ Speed up JSON output Aug 14, 2019

cuonglm suggested changes Aug 23, 2019

View reviewed changes

stats/json/collector.go Outdated Show resolved Hide resolved

stats/json/collector.go Outdated Show resolved Hide resolved

stats/json/collector.go Show resolved Hide resolved

stats/json/collector.go Show resolved Hide resolved

mstoykov mentioned this pull request Aug 26, 2019

Output check/threshold results in a machine-readable unit test format to publish test results in CI #1120

Closed

na-- mentioned this pull request Aug 28, 2019

Investigate telegraf integration in k6 #1064

Closed

na-- added this to the v0.26.0 milestone Aug 29, 2019

na-- requested a review from imiric August 29, 2019 08:50

mstoykov added 2 commits August 29, 2019 12:31

fix copy paste error json output log

f26c24f

json output: use logrus.WithError instead of WithField

67cede7

na-- approved these changes Aug 29, 2019

View reviewed changes

mstoykov requested a review from cuonglm August 29, 2019 11:14

mstoykov mentioned this pull request Aug 29, 2019

stats/cloud: stop sending cloud metrics when limit reached #1130

Merged

cuonglm approved these changes Aug 29, 2019

View reviewed changes

imiric approved these changes Aug 30, 2019

View reviewed changes

mstoykov merged commit 2dbf695 into master Sep 2, 2019

mstoykov deleted the fix/jsonOutputBeingSlow branch September 2, 2019 05:39

mstoykov mentioned this pull request Jul 12, 2020

Add option to csv output to gzip the file #1550

Closed

na-- mentioned this pull request Jan 4, 2021

Document that the JSON output can produce gzipped files grafana/k6-docs#180

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up JSON output #1114

Speed up JSON output #1114

mstoykov commented Aug 14, 2019

mstoykov commented Aug 14, 2019

codecov-io commented Aug 14, 2019 •

edited

Loading

imiric left a comment

Speed up JSON output #1114

Speed up JSON output #1114

Conversation

mstoykov commented Aug 14, 2019

mstoykov commented Aug 14, 2019

codecov-io commented Aug 14, 2019 • edited Loading

Codecov Report

imiric left a comment

Choose a reason for hiding this comment

codecov-io commented Aug 14, 2019 •

edited

Loading