Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce #60683

jimczi · 2020-08-04T19:26:31Z

This commit removes the ability to test the top level result of an aggregator
before it runs the final reduce. All aggregator tests that use AggregatorTestCase#search
are rewritten with AggregatorTestCase#searchAndReduce in order to ensure that we test
the final output (the one sent to the end user) rather than an intermediary result
that could be different.
This change also removes spurious commits triggered on top of a random index writer.
These commits slow down the tests and are redundant with the commits that the
random index writer performs.

…duce This commit removes the ability to test the top level result of an aggregator before it runs the final reduce. All aggregator tests that use AggregatorTestCase#search are rewritten with AggregatorTestCase#searchAndReduce in order to ensure that we test the final output (the one sent to the end user) rather than an intermediary result that could be different. This change also removes spurious commits triggered on top of a random index writer. These commits slow down the tests and are redundant with the commits that the random index writer performs.

elasticmachine · 2020-08-04T19:26:33Z

Pinging @elastic/es-analytics-geo (:Analytics/Aggregations)

not-napoleon

Looks good. Please clean up that one for loop that got missed, otherwise 👍

not-napoleon · 2020-08-04T20:20:10Z

...a/org/elasticsearch/search/aggregations/bucket/histogram/InternalVariableWidthHistogram.java

@@ -451,7 +450,8 @@ private void mergeBucketsWithPlan(List<Bucket> buckets, List<BucketRange> plan,
            }
            toMerge.add(buckets.get(startIdx)); // Don't remove the startIdx bucket because it will be replaced by the merged bucket

-            reduceContext.consumeBucketsAndMaybeBreak(- (toMerge.size() - 1));
+            int toRemove = toMerge.stream().mapToInt(b -> countInnerBucket(b)+1).sum();


Is this an actual bug you found while making this change?

A minor one, yes. The max bucket count is not accurate when buckets are auto-merged.

...src/test/java/org/elasticsearch/search/aggregations/bucket/filter/FilterAggregatorTests.java

...va/org/elasticsearch/search/aggregations/bucket/histogram/RangeHistogramAggregatorTests.java

nik9000

I think for the auto date histo and variable width histo I'll miss being able to assert on things that aren't yet finally reduced, but it is worth it.

This commit fixes the computation of the subset size on empty buckets (doc count of 0). The aggregator test refactoring in elastic#60683 revealed this bug.

This commit fixes the computation of the subset size on empty buckets (doc count of 0). The aggregator test refactoring in #60683 revealed this bug.

With #60683 we stopped forcing aggregating all docs using a single Aggregator which made some of our accuracy assumptions about the stats aggregator incorrect. This adds a test that does the forcing and asserts the old accuracy and adds a test without the forcing with much looser accuracy guarantees. Closes #61132

jimczi added >non-issue >test Issues or PRs that are addressing/adding tests :Analytics/Aggregations Aggregations v8.0.0 v7.10.0 labels Aug 4, 2020

jimczi requested review from nik9000 and not-napoleon August 4, 2020 19:26

elasticmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Aug 4, 2020

fix additional tests

5453d7e

not-napoleon approved these changes Aug 4, 2020

View reviewed changes

jimczi added 2 commits August 5, 2020 00:53

address review and fix another test

1dd98d1

Merge branch 'master' into aggregator_tests_search_and_reduce

f11a8cf

nik9000 approved these changes Aug 5, 2020

View reviewed changes

jimczi merged commit 5de0ed9 into elastic:master Aug 6, 2020

jimczi deleted the aggregator_tests_search_and_reduce branch August 6, 2020 12:08

jimczi mentioned this pull request Aug 6, 2020

Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce #60816

Merged

jimczi mentioned this pull request Aug 6, 2020

Fix AOOBE when setting min_doc_count to 0 in significant_terms #60823

Merged

nik9000 mentioned this pull request Aug 6, 2020

CI: VariableWidthHistogramAggregatorTests#testMultipleSegments #60673

Closed

nik9000 mentioned this pull request Aug 11, 2020

Reproducible failure in ChildrenToParentAggregatorTests.testParentChildTerms #60980

Closed

Mpdreamz mentioned this pull request Nov 16, 2020

7.10.1 Meta Ticket elastic/elasticsearch-net#5096

Closed

61 tasks

stevejgordon mentioned this pull request Dec 17, 2020

7.11.0 Meta Ticket elastic/elasticsearch-net#5198

Closed

jakelandis removed the v8.0.0 label Jul 26, 2021

jakelandis added the v8.0.0-alpha1 label Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce #60683

Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce #60683

jimczi commented Aug 4, 2020

elasticmachine commented Aug 4, 2020

not-napoleon left a comment

not-napoleon Aug 4, 2020

jimczi Aug 4, 2020

nik9000 left a comment

Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce #60683

Replace AggregatorTestCase#search with AggregatorTestCase#searchAndReduce #60683

Conversation

jimczi commented Aug 4, 2020

elasticmachine commented Aug 4, 2020

not-napoleon left a comment

Choose a reason for hiding this comment

not-napoleon Aug 4, 2020

Choose a reason for hiding this comment

jimczi Aug 4, 2020

Choose a reason for hiding this comment

nik9000 left a comment

Choose a reason for hiding this comment