Reduce memory consumption when parsing large objects and arrays. #101

iv-menshenin · 2024-02-22T21:31:44Z

We use fastjson in our project and recently I found an unpleasant issue when processing huge json files (we have 25 megabytes and more than 300 thousand elements in the array):
when cache.vs reaches its limit and performs memory reallocation, if the GC fails to clean up the memory in time, we experience a sharp spike in memory usage graphs.

The funniest part is that we barely noticed this, because our application kept crashing with OOM, so we had to temporarily disable the limits for testing.

After working with pprof for a while, I found where the most memory allocations are happening and worked on improvements. I tried several options and this one turned out to be the most efficient, as shown by the tests in our environment.

If you don't accept this pull request, it's okay, then we'll just continue using our fork.

Thank you.

ysawa0 · 2024-03-06T06:28:24Z

This patch helped mitigate memory spikes in our application. The memory usage chart was clearly improved. Since this seems to be a positive change, I wonder if maintainers (cc @valyala) can consider it? Thank you.

StarpTech · 2024-11-03T13:51:42Z

Hi @iv-menshenin, could you provide a way to validate your improvements?

iv-menshenin · 2024-11-03T14:31:22Z

Hi @iv-menshenin, could you provide a way to validate your improvements?

It's working on our production nativerent as long as my PR lives. (≈500 RPS)
Or you need some unit-tests ?

StarpTech · 2024-11-03T16:50:12Z

No, I'm asking for a simple reproduction. How did you test you improvements? Is it possible to verify this on the benchmarks of this repo? If not, can you provide one?

iv-menshenin · 2024-11-07T09:27:27Z

No, I'm asking for a simple reproduction. How did you test you improvements? Is it possible to verify this on the benchmarks of this repo? If not, can you provide one?

I'll see what I can do.

iv-menshenin · 2024-11-07T21:02:33Z

@StarpTech
Here's what I found.
As I have said before, my suggestion is related to GC load peaks when processing really big data.
I tried multiplying the contents of the ./testdata/canada.json file so that it reaches 18 megabytes and ran the tests.
Here is what I saw for valyala

BenchmarkParse/canada/fastjson-8 123 9405854 ns/op 1914.52 MB/s 37 B/op 0 allocs/op
BenchmarkParse/canada/fastjson-get-8 49 24405223 ns/op 737.86 MB/s 99548908 B/op 130551 allocs/op

Here's what happens after the improvement:

BenchmarkParse/canada/fastjson-8 136 8963098 ns/op 2009.09 MB/s 12 B/op 0 allocs/op
BenchmarkParse/canada/fastjson-get-8 111 9018742 ns/op 1996.69 MB/s 15 B/op 0 allocs/op

If required, I can do a comit with this test file to demonstrate what said

StarpTech · 2024-11-07T23:11:20Z

@iv-menshenin that looks promising. Is there a big performance drop for smaller files? A test file would be great. We maintain a fork which we actively maintain. We have already incorporated a lot of other changes that didn't get merged in this repository. Would you be interested in contributing to it?

iv-menshenin · 2024-11-08T02:41:33Z

Is there a big performance drop for smaller files?

No. There isn't
There is a prior memory allocation of 32 kilobytes, but in practice due to the reuse of the parser this has only a positive impact

A test file would be great.

I will add a new case

Would you be interested in contributing to it?

I would like to.

iv-menshenin · 2024-11-09T04:40:48Z

@StarpTech
Digging deeper into the problem again, I remembered that instead of allocations, a huge amount of memory is being consumed. After checking the benchmarks, I realized that things are bad in the tests too.
Look at this var benchPool ParserPool that we are sharing between all the tests! So all the results here have become non-deterministic.
And when I run the tests over and over again, I get different allocation count over and over again, then it's zero, then it's not. So there is no allocation benefit in my PR, sorry (these were bad tests).
But! There is a memory consumption benefit in my PR that I can show you, just give me time =)
I will do all this in your fork

iv-menshenin · 2024-11-10T19:03:05Z

In the meantime, I'm giving some validation in this PR

You can get this memory consumption graph via pprof if you run the tests with the following parameters:

go test --run=None --bench="BenchmarkParse/.*/fastjson" --memprofile=mem.out

No matter how many times I run the tests, I see 0 memory allocations as a result, but profiling gives a clearer picture of what's going on. And what is happening is consuming more than 1 gigabyte of memory

And this is what the graph looks like after my improvements

I think it's pretty clear

iv-menshenin added 6 commits February 20, 2024 08:36

hypothesis: reduce allocations

279fad3

fit Value slice to sizeclass valyala#67 (32768 bytes)

a4ccd62

progressively increase cache buffer

1d95b47

used fixed size for cache allocation

a536b72

used double-linked list for cache

2156dc1

reuse cache chunks

1932ccd

azdagron mentioned this pull request Apr 18, 2024

spire-agent gets OOMKilled after pod restart spiffe/spire#5067

Closed

paralin added a commit to aperturerobotics/fastjson that referenced this pull request Apr 24, 2024

Merge improvements from valyala#101

e1ca805

jachym-tousek-keboola mentioned this pull request Sep 10, 2024

feat: Json type optimization keboola/keboola-as-code#2006

Merged

iv-menshenin mentioned this pull request Nov 10, 2024

Memory consumption improvements (less append-reallocs) wundergraph/astjson#5

Open

Improved Object.getKV using a linked list, test

e81832e

iv-menshenin changed the title ~~Reduced slice allocations in parser cache~~ Reduce memory consumption when parsing large objects and arrays. Nov 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory consumption when parsing large objects and arrays. #101

Reduce memory consumption when parsing large objects and arrays. #101

iv-menshenin commented Feb 22, 2024

ysawa0 commented Mar 6, 2024

StarpTech commented Nov 3, 2024 •

edited

Loading

iv-menshenin commented Nov 3, 2024 •

edited

Loading

StarpTech commented Nov 3, 2024 •

edited

Loading

iv-menshenin commented Nov 7, 2024

iv-menshenin commented Nov 7, 2024

StarpTech commented Nov 7, 2024

iv-menshenin commented Nov 8, 2024 •

edited

Loading

iv-menshenin commented Nov 9, 2024

iv-menshenin commented Nov 10, 2024

Reduce memory consumption when parsing large objects and arrays. #101

Are you sure you want to change the base?

Reduce memory consumption when parsing large objects and arrays. #101

Conversation

iv-menshenin commented Feb 22, 2024

ysawa0 commented Mar 6, 2024

StarpTech commented Nov 3, 2024 • edited Loading

iv-menshenin commented Nov 3, 2024 • edited Loading

StarpTech commented Nov 3, 2024 • edited Loading

iv-menshenin commented Nov 7, 2024

iv-menshenin commented Nov 7, 2024

StarpTech commented Nov 7, 2024

iv-menshenin commented Nov 8, 2024 • edited Loading

iv-menshenin commented Nov 9, 2024

iv-menshenin commented Nov 10, 2024

StarpTech commented Nov 3, 2024 •

edited

Loading

iv-menshenin commented Nov 3, 2024 •

edited

Loading

StarpTech commented Nov 3, 2024 •

edited

Loading

iv-menshenin commented Nov 8, 2024 •

edited

Loading