Streaming postings decoding leads to OOMs #6434

fpetkovski · 2023-06-09T16:10:41Z

Object Storage Provider: GCS

What happened:

We have certain queries that have very high postings cardinality, like:

sum(container_memory_working_set_bytes{namespace=~"<namespace>", container!="", pod!=""})

The streaming postings decoding seems to hold on to one snappy decoder for each posting, so for the container and pod labels we end up with a huge amount of open decoders even before we start the merge.

This lead to stores OOMing instantly because each snappy reader uses a 64KB buffer.

What you expected to happen:

I would like to have some mechanism for a more controlled memory management. Maybe we should decide on the strategy based on the size of the compressed data. Alternatively we can decide based on the number of keys.

How to reproduce it (as minimally and precisely as possible):

The query above could be sufficient.

Anything else we need to know:

Screenshot from a small staging environment when running the query

The text was updated successfully, but these errors were encountered:

yeya24 · 2023-06-09T16:46:23Z

Is it caused by #6303? I feel the old format will have the same problem?
If we can include postings as part of the index header, will it solve this problem and will it increase disk space a lot? At least we don't have to deal with postings cache

fpetkovski · 2023-06-09T17:30:04Z

I believe it was introduced by #6303. With the old format we decode one posting fully at a time, and we recycle the snappy decoder. With the streaming decoding we keep all decoders open until everything is merged, or at least that's how I understand it. Since we open one decoding reader per posting, for a matcher like container!="" we end up with many open
decoders and each one has a footprint of 64KB.

I think that with the query above, decoding postings fully has a lower footprint than allocating buffers for decoders, so we end up being worse off.

GiedriusS · 2023-06-09T18:51:54Z

Thanks for the report. Let me try to write an optimized routine. It should be pretty straightforward given that we have preallocated []byte slices.

yeya24 · 2023-06-15T03:49:02Z

I think we saw the same thing. 40GB when streaming decode postings

GiedriusS · 2023-06-15T12:27:55Z

I think a simple fix would be to try to estimate the block size instead of opting to use the default block size which is 65KiB. The only problem is that reader size needs to be passed to NewReader so we would have to use a different sync.Pool than the one used for gRPC calls. Perhaps it would make sense to keep a running average of postings list sizes and then use that as the block size during the creation of a new reader?

Creating a new input buffer seems to be unavoidable because in Snappy there can be references to uncompressed data so we need to save the last bytes of uncompressed data.

yeya24 · 2023-07-12T04:18:09Z

Hi @fpetkovski, have you tested latest Thanos with #6475, does this pr fix the OOM kill issue?

fpetkovski · 2023-07-12T16:49:50Z

We haven't had a chance to test this yet, but I will report back if we see more OOMs once we upgrade.

fpetkovski added the component: store label Jun 9, 2023

yeya24 mentioned this issue Jun 15, 2023

store: read postings directly into diffvarint format #6442

Merged

MichaHoffmann mentioned this issue Jun 21, 2023

Store: smaller block size for snappy decoding #6463

Closed

2 tasks

GiedriusS mentioned this issue Jun 27, 2023

store: optimized snappy streamed reading #6475

Merged

2 tasks

GiedriusS closed this as completed in #6475 Jul 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming postings decoding leads to OOMs #6434

Streaming postings decoding leads to OOMs #6434

fpetkovski commented Jun 9, 2023 •

edited

Loading

yeya24 commented Jun 9, 2023

fpetkovski commented Jun 9, 2023 •

edited

Loading

GiedriusS commented Jun 9, 2023

yeya24 commented Jun 15, 2023

GiedriusS commented Jun 15, 2023 •

edited

Loading

yeya24 commented Jul 12, 2023

fpetkovski commented Jul 12, 2023 •

edited

Loading

Streaming postings decoding leads to OOMs #6434

Streaming postings decoding leads to OOMs #6434

Comments

fpetkovski commented Jun 9, 2023 • edited Loading

yeya24 commented Jun 9, 2023

fpetkovski commented Jun 9, 2023 • edited Loading

GiedriusS commented Jun 9, 2023

yeya24 commented Jun 15, 2023

GiedriusS commented Jun 15, 2023 • edited Loading

yeya24 commented Jul 12, 2023

fpetkovski commented Jul 12, 2023 • edited Loading

fpetkovski commented Jun 9, 2023 •

edited

Loading

fpetkovski commented Jun 9, 2023 •

edited

Loading

GiedriusS commented Jun 15, 2023 •

edited

Loading

fpetkovski commented Jul 12, 2023 •

edited

Loading