You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe. #8618 adds in a heuristic to do a sort based agg in some cases. As a part of my testing I found a number of cases (as small as 16 decimal sum aggregations) where the sort based aggregation was faster than the hash based version. I really would like to understand why this is happening and ideally once we know that design/extend the heuristic in #8618 to take advantage of this so that we can speed up large numbers of aggregations.
We don't necessarily have to sort all of the input data. We could sort each batch individually instead. Would be good to see if there is a big improvement in having the full sort or not.
Is your feature request related to a problem? Please describe.
#8618 adds in a heuristic to do a sort based agg in some cases. As a part of my testing I found a number of cases (as small as 16 decimal sum aggregations) where the sort based aggregation was faster than the hash based version. I really would like to understand why this is happening and ideally once we know that design/extend the heuristic in #8618 to take advantage of this so that we can speed up large numbers of aggregations.
A few things to note.
The text was updated successfully, but these errors were encountered: