Increase search.max_buckets to 65,535 #57042

imotov · 2020-05-21T14:02:16Z

Increases the default search.max_buckets limit to 65,535, and only counts
buckets during reduce phase.

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes elastic#51731

elasticmachine · 2020-05-21T14:02:18Z

Pinging @elastic/es-analytics-geo (:Analytics/Aggregations)

imotov · 2020-05-21T14:23:16Z

@elasticmachine run elasticsearch-ci/bwc

polyfractal · 2020-05-21T17:16:19Z

Changes look good (yay deletions). I think there are a few more things we could clean up too, e.g. some (all?) of the Internal* bucketing agg classes have extra stuff to add/remove buckets to the breaker while reducing. DateHisto as an example. We could probably purge all of these types of accounting and just do a single call at the end of reduce()

I don't think it functionally matters, since Nik's recent change means we only count buckets on the final reduce (and not the partial reductions). But it would tidy up some of the Internal* classes.

Haven't looked too closely though, so I might be misremembering :)

…se-max-buckets

polyfractal

Skimmed the new deletions and they look good (will look closer before merge), but left a comment about BucketsAggregator... think we need to tweak it a bit more

polyfractal · 2020-05-29T15:11:26Z

server/src/main/java/org/elasticsearch/search/aggregations/bucket/BucketsAggregator.java

@@ -91,6 +87,9 @@ public final void collectBucket(LeafBucketCollector subCollector, int doc, long
     * Same as {@link #collectBucket(LeafBucketCollector, int, long)}, but doesn't check if the docCounts needs to be re-sized.
     */
    public final void collectExistingBucket(LeafBucketCollector subCollector, int doc, long bucketOrd) throws IOException {
+        if (doc == 1) {


Hm, I think this isn't quite right. doc is the doc ID, so that could be all over the place, and also all going into the same bucket. I think there are two options here:

Down below, we do if (docCounts.increment(bucketOrd, 1) == 1) { <breaker stuff> } which I think will work because the increment method returns the count after incrementing. So if we have a doc count of 1, it's the first doc and a new bucket so we can account it

Alternatively, we could just account for it up in collectBucket without a conditional, since theoretically that should only be called on new buckets. It's not guaranteed by the API but in practice that's how aggs use it.

There are two other issues we need to address though:

The old breaker logic only checked every 1024 buckets, since checking the real-memory breaker has a certain amount of overhead. So we should re-implement that somehow

Trickier situation which I didn't think about when suggesting BucketsAggregator... if we add the 1024 threshold back, it's only a local count so aggs with 1023 buckets will never trigger the breaker even if the overall query has millions of buckets.

Perhaps we continue to use the MultiBucketConsumer service thing, but move the breaker accounting to a different method? That way it could maintain the global count and BucketsAggregator just calls a method on it or something? Not sure, we can discuss more offline

…se-max-buckets

polyfractal

Left a comment about comments :), but otherwise I'm happy with it. Re-using the MultiBucketConsumer feels like an OK tradeoff in this case, since we'll need something global to track usage. And this is simpler/less invasive than trying to bolt on a new thing.

👍

polyfractal · 2020-06-02T21:05:30Z

server/src/main/java/org/elasticsearch/search/aggregations/bucket/BucketsAggregator.java

@@ -91,7 +91,9 @@ public final void collectBucket(LeafBucketCollector subCollector, int doc, long
     * Same as {@link #collectBucket(LeafBucketCollector, int, long)}, but doesn't check if the docCounts needs to be re-sized.
     */
    public final void collectExistingBucket(LeafBucketCollector subCollector, int doc, long bucketOrd) throws IOException {
-        docCounts.increment(bucketOrd, 1);
+        if (docCounts.increment(bucketOrd, 1) == 1) {
+            multiBucketConsumer.accept(0);


Let's add a comment here why we're using 0, just in case a lost soul stumbles over this and is confused :)

polyfractal · 2020-06-02T21:10:26Z

server/src/main/java/org/elasticsearch/search/aggregations/MultiBucketConsumerService.java

-            if (value > 0 && (count & 0x3FF) == 0) {
+            // check parent circuit breaker every 1024 calls
+            callCount++;
+            if ((callCount & 0x3FF) == 0) {


👍 for fixing this behavior

…se-max-buckets

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes elastic#51731

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes #51731

Before elastic#57042 the max_buckets test would consistently pass because the request would consistently fail. In particular, the request would fail on the data node. After elastic#57042 it only fails on the coordinating node. When the max_buckets test is run in a mixed version cluster it consistently fails on *either* the data node or the coordinating node. Except when the coordinating node is missing elastic#43095. In that case if the one data node has elastic#57042 and one does not, *and* the one that doesn't gets the request first, fails it as expected, and then the coordinating node retries the request on the node with elastic#57042. When that happens the request fails mysteriously with "partial shard failures" as the error message but not partial failures reported. This is *exactly* the bug fixed in elastic#43095. This updates the test to be skipped in mixed version clusters without elastic#43095 because they *sometimes* fail the test spuriously. The request fails in those cases, just like we expect, but with a mysterious error message. Closes elastic#57657

Before #57042 the max_buckets test would consistently pass because the request would consistently fail. In particular, the request would fail on the data node. After #57042 it only fails on the coordinating node. When the max_buckets test is run in a mixed version cluster it consistently fails on *either* the data node or the coordinating node. Except when the coordinating node is missing #43095. In that case if the one data node has #57042 and one does not, *and* the one that doesn't gets the request first, fails it as expected, and then the coordinating node retries the request on the node with #57042. When that happens the request fails mysteriously with "partial shard failures" as the error message but not partial failures reported. This is *exactly* the bug fixed in #43095. This updates the test to be skipped in mixed version clusters without #43095 because they *sometimes* fail the test spuriously. The request fails in those cases, just like we expect, but with a mysterious error message. Closes #57657

Increase search.max_buckets to 65,535

5ab18c6

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes elastic#51731

imotov added >enhancement :Analytics/Aggregations Aggregations v8.0.0 v7.9.0 labels May 21, 2020

imotov requested a review from polyfractal May 21, 2020 14:02

elasticmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label May 21, 2020

Remove unnecessary checks form reduce

1041847

imotov added the WIP label May 28, 2020

imotov added 5 commits May 28, 2020 14:38

Suppress number of buckets check in some aggregations

625c6dd

Merge remote-tracking branch 'elastic/master' into issue-51731-increa…

fef7b26

…se-max-buckets

Cleanup after merge

8252b03

Merge remote-tracking branch 'elastic/master' into issue-51731-increa…

b871bcc

…se-max-buckets

Suppress bucket count checks in more tests

eed4060

imotov removed the WIP label May 29, 2020

$polyfractal$

polyfractal reviewed May 29, 2020

View reviewed changes

imotov added 4 commits May 29, 2020 15:51

Fix counter in BucketsAggregator

8ec8048

Make sure that we still call breaker occasionally

39e034a

Merge remote-tracking branch 'elastic/master' into issue-51731-increa…

c9e2763

…se-max-buckets

Fix HierarchyCircuitBreakerServiceTests

8870925

$polyfractal$

polyfractal approved these changes Jun 2, 2020

View reviewed changes

imotov added 2 commits June 3, 2020 09:56

Merge remote-tracking branch 'elastic/master' into issue-51731-increa…

7a389ec

…se-max-buckets

Add comment on collectExistingBucket

6bddc55

imotov merged commit 29b5643 into elastic:master Jun 3, 2020

imotov added the backport pending label Jun 3, 2020

imotov added a commit to imotov/elasticsearch that referenced this pull request Jun 3, 2020

Increase search.max_buckets to 65,535 (elastic#57042)

aba5d17

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes elastic#51731

imotov added a commit that referenced this pull request Jun 3, 2020

Increase search.max_buckets to 65,535 (#57042)

8d7f389

Increases the default search.max_buckets limit to 65,535, and only counts buckets during reduce phase. Closes #51731

imotov removed the backport pending label Jun 3, 2020

nreese mentioned this pull request Jun 3, 2020

[Maps] Update DEFAULT_MAX_BUCKETS_LIMIT to new limit elastic/kibana#68177

Closed

hendrikmuhs mentioned this pull request Jun 5, 2020

[Transform] increase page size limit to 65k #57719

Closed

nik9000 mentioned this pull request Jun 8, 2020

Add Variable Width Histogram Aggregation #42035

Merged

nik9000 mentioned this pull request Jun 12, 2020

Skip max_buckets test when it is flaky #58038

Merged

imotov deleted the issue-51731-increase-max-buckets branch June 12, 2020 15:36

$@polyfractal$ polyfractal mentioned this pull request Jun 18, 2020

Bucket Aggregation size setting should never throw too_many_buckets_exception if size is less than respect search.max_buckets #51559

Closed

not-napoleon mentioned this pull request Jun 23, 2020

Document the default max number of buckets in the auto_date_histogram documentation #32950

Closed

$@polyfractal$ polyfractal mentioned this pull request Jun 24, 2020

Aggregations enhancement - better memory usage estimates to avoid circuit breaking #28220

Closed

nreese mentioned this pull request Jun 30, 2020

[Maps] point-to-point source can exceed default search.max_buckets and fail elastic/kibana#46515

Closed

$@polyfractal$ polyfractal mentioned this pull request Aug 13, 2020

Change search.max_buckets from Cluster Setting to Index Level Setting #61042

Closed

lqbilbo mentioned this pull request Nov 17, 2020

Aggregations can be bottlenecked on ChildMemoryCircuitBreaker.limit() #58647

Closed

maosuhan mentioned this pull request Feb 4, 2021

Set parameter search.max_buckets in request level #68504

Closed

This was referenced Mar 22, 2021

Increase search.max_bucket default value by one #70645

Merged

Increase search.max_bucket by one (#70645) #70706

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

boicehuang mentioned this pull request Mar 21, 2022

Still OOM in the processing of search reduce phase #85166

Open

PSeitz mentioned this pull request Jan 23, 2023

Investigate elastic search max bucket behaviour quickwit-oss/tantivy#1822

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase search.max_buckets to 65,535 #57042

Increase search.max_buckets to 65,535 #57042

imotov commented May 21, 2020

elasticmachine commented May 21, 2020

imotov commented May 21, 2020

polyfractal commented May 21, 2020

$@polyfractal$ polyfractal left a comment

$@polyfractal$ polyfractal May 29, 2020

$@polyfractal$ polyfractal left a comment

$@polyfractal$ polyfractal Jun 2, 2020

$@polyfractal$ polyfractal Jun 2, 2020

Increase search.max_buckets to 65,535 #57042

Increase search.max_buckets to 65,535 #57042

Conversation

imotov commented May 21, 2020

elasticmachine commented May 21, 2020

imotov commented May 21, 2020

polyfractal commented May 21, 2020

polyfractal left a comment

Choose a reason for hiding this comment

polyfractal May 29, 2020

Choose a reason for hiding this comment

polyfractal left a comment

Choose a reason for hiding this comment

polyfractal Jun 2, 2020

Choose a reason for hiding this comment

polyfractal Jun 2, 2020

Choose a reason for hiding this comment

$@polyfractal$ polyfractal left a comment

$@polyfractal$ polyfractal May 29, 2020

$@polyfractal$ polyfractal left a comment

$@polyfractal$ polyfractal Jun 2, 2020

$@polyfractal$ polyfractal Jun 2, 2020