Use a single index Results when querying across blocks #1469

richardartoul · 2019-03-17T14:34:25Z

Right now, when we query across many index blocks we allocate a results map for each one, run all the queries in parallel for each block filling up the results map, and then we merge all the results maps together at the end. This makes parallelization easy and straightforward, but since many workloads contain many of the same documents in each block, it causes memory utilization to scale linearly with the number of index blocks. This is problematic for queries that return a large number of documents.

To address this issue, we will use a single index Results which we will push down into all the block queries. That way, each document will only get allocated in memory once regardless of the number of blocks that are being queried. This will significantly improve memory utilization but will make the results map very heavily contended. To address that, instead of acquiring a lock each time to add a single document, we'll use pooled slices to temporarily gather a configurable number of documents together within the block, then acquire the lock once and add them all together as a batch.

This should give us significantly reduced memory utilization without causing too much contention on the lock.

Fixes #1469

richardartoul assigned prateek, robskillington, richardartoul and arnikola Mar 17, 2019

richardartoul added T: Perf P: High T: Memory Utilization T: Optimization area:db All issues pertaining to dbnode area:index All issues pertaining to m3ninx and m3db's index labels Mar 17, 2019

robskillington mentioned this issue Mar 19, 2019

Use a single index Results when querying across blocks #1474

Merged

robskillington closed this as completed in #1474 Mar 21, 2019

robskillington added a commit that referenced this issue Mar 21, 2019

Use a single index Results when querying across blocks (#1474)

b9205e8

Fixes #1469

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a single index Results when querying across blocks #1469

Use a single index Results when querying across blocks #1469

richardartoul commented Mar 17, 2019

Use a single index Results when querying across blocks #1469

Use a single index Results when querying across blocks #1469

Comments

richardartoul commented Mar 17, 2019