Adaptive radix tree aggregate matching #2

tonyhb · 2023-12-23T21:29:34Z

This PR introduces aggregate adaptive radix tree matching for expressions. The general idea here is to ignore expressions that don't matter given input data. For example:

If you have 100m expressions matching on account IDs (event.data.account_id = $foo)
And you process 10 events
We want to evaluate 10 expressions instead of 1B expressions.

In order to do that, we must parse the CEL expression into its AST, normalize into a canonical representation, then parse each operand into an aggregate tree for matching.

Each expression is broken down into groups. For example:

a == "yes" && b == "no"

Is broken down into a single matching group containing two predicate operators for A and B. For each ident matching strings, ("a" and "b"), we create a new adaptive radix tree matching the given literal value.

When an event is received, we iterate through the keys in the given event. If there's an ART for the key in the event, we match the event's data against the tree. This returns potential expressions to evaluate.

From there, we evaluate only all matched expressions (plus any expressions that cannot be evaluated by aggregate trees). This cuts down the number of expressions evaluated by orders of magnitude.

eg. event.data.foo == event.data.bar. This is necessary for proper parsing.

tonyhb force-pushed the feature/art-prefix-searching branch from 1a8cd3d to 89d7643 Compare December 23, 2023 21:42

tonyhb force-pushed the feature/art-prefix-searching branch from ba05941 to c958f69 Compare January 4, 2024 14:36

tonyhb added 4 commits January 4, 2024 06:37

Add aggregate evaluations for strings using adaptive radix trees

1702030

Add lifted expression parsing (naive), support for ident matching

63e3953

Handle comparing idents with each other,

ea03432

eg. event.data.foo == event.data.bar. This is necessary for proper parsing.

Add CI, various minor comment changes

5693949

tonyhb force-pushed the feature/art-prefix-searching branch from c958f69 to 5693949 Compare January 4, 2024 14:37

tonyhb merged commit 49e3d60 into main Jan 4, 2024
2 checks passed

tonyhb deleted the feature/art-prefix-searching branch January 5, 2024 00:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adaptive radix tree aggregate matching #2

Adaptive radix tree aggregate matching #2

tonyhb commented Dec 23, 2023

Adaptive radix tree aggregate matching #2

Adaptive radix tree aggregate matching #2

Conversation

tonyhb commented Dec 23, 2023