feat(discover): Use SnQL for some of event-stats #29471

wmak · 2021-10-21T01:20:05Z

This enables the use of a new TimeseriesQueryBuilder on event-stats
queries that don't include comparsion, and aren't top events queries
Had to refactor a lot of the tests to use the same do_request
pattern from eventsv2 to make testing easier
Needed to add a 4th list to resolve_equation_list so we could
differentiate easily between equations that are on functions or not
- Didn't go with a class or namedtuple in the response since we'll
  likely be removing some of this once we're done migrating
Only adding project_threshold_config when auto_fields are on, which
means that the table response is unchanged, but now we no longer need
to do the averaging for graphing threshold based functions

- This enables the use of a new TimeseriesQueryBuilder on event-stats queries that don't include comparsion, and aren't top events queries - Had to refactor a lot of the tests to use the same `do_request` pattern from eventsv2 to make testing easier - Needed to add a 4th list to resolve_equation_list so we could differentiate easily between equations that are on functions or not - Didn't go with a class or namedtuple in the response since we'll likely be removing some of this once we're done migrating - Only adding project_threshold_config when auto_fields are on, which means that the table response is unchanged, but now we no longer need to do the averaging for graphing threshold based functions

Zylphrex · 2021-10-21T14:40:14Z

src/sentry/api/serializers/rest_framework/dashboard.py

@@ -71,7 +71,7 @@ def validate(self, data):

        if equations is not None:
            try:
-                resolved_equations, _, _ = resolve_equation_list(equations, fields)
+                resolved_equations, _, _, _ = resolve_equation_list(equations, fields)


Not sure of a better suggestion but destructuring a 4-tuple is getting a little unwieldy. Perhaps consider returning an object with named attributes?

I considered that, but decided since this is only temporary (first two go away once we no longer need to return JSON) to leave this as-is for now

Zylphrex · 2021-10-21T14:48:28Z

src/sentry/search/events/builder.py

+            select=self.select,
+            where=self.where,
+            having=self.having,
+            # This is a timeseries, the groupby will always be time


This assumption breaks with top 5 charts right? Did you already have an idea for that?

Dang I considered commenting this but thought it was overly verbose for this PR

Yeah I think for top events we should have just introduce a different timeseries builder that inherits from this one and changes the groupby etc. as needed

Zylphrex · 2021-10-21T14:51:10Z

src/sentry/search/events/fields.py

-            ):
-                stripped_columns.append(PROJECT_THRESHOLD_CONFIG_ALIAS)
-                break
+        if self.auto_fields:


This auto_fields condition did not exist previously, what is the motivation behind this?

Ah hm leftover from earlier iteration where i slammed all of self.select into the query and not just the aggregates. I'll undo this change 👍

Zylphrex · 2021-10-21T14:52:51Z

src/sentry/snuba/discover.py

+        if isinstance(obj["time"], str):
+            obj["time"] = int(to_timestamp(parse_datetime(obj["time"])))


Why is this needed? Does snql return a str now instead of a int like it did previously?

so the query has always returned the date as a str, but there was a reverse processor here
Which we're no longer using.

- Adding a comment explaining why we need to parse the times now - Removing the auto fields that we no longer need - Fixing tests that broke cause I removed the select property

Zylphrex · 2021-10-21T20:31:44Z

src/sentry/search/events/fields.py

+            for index, (equation, is_function) in enumerate(
+                zip(parsed_equations, contains_function)
+            ):


This is now dependent on the fact that parsed_equations and contains_function are the same lengths. Does this need a comment in resolve_equation_list to make it clear that this must be the case? Or even add a test?

dangit this comment convinced me, going to have resolve_equation_list return a list of objects instead of two lists for the snql syntax, which should mean after the migration we only need the single list in the response.

wmak requested a review from a team October 21, 2021 01:20

wmak requested a review from a team as a code owner October 21, 2021 01:20

vercel bot deployed to Preview – storybook October 21, 2021 01:20 View deployment

Zylphrex reviewed Oct 21, 2021

View reviewed changes

wmak mentioned this pull request Oct 21, 2021

feat(discover): Update timeseries to support comparison #29492

Merged

ref: Addressing PR comments

1208edc

- Adding a comment explaining why we need to parse the times now - Removing the auto fields that we no longer need - Fixing tests that broke cause I removed the select property

vercel bot deployed to Preview – sentry October 21, 2021 19:17 View deployment

vercel bot deployed to Preview – storybook October 21, 2021 19:17 View deployment

wmak requested a review from Zylphrex October 21, 2021 19:21

Merge branch 'master' into wmak/feat/snql-event-stats-simple

c2fd3b6

vercel bot deployed to Preview – sentry October 22, 2021 20:11 View deployment

vercel bot deployed to Preview – storybook October 22, 2021 20:11 View deployment

Zylphrex approved these changes Oct 22, 2021

View reviewed changes

ref: Return a list of ParsedEquation instead of 2 lists

1fe826b

vercel bot deployed to Preview – sentry October 22, 2021 20:23 View deployment

vercel bot deployed to Preview – storybook October 22, 2021 20:23 View deployment

wmak merged commit 84e73fe into master Oct 25, 2021

wmak deleted the wmak/feat/snql-event-stats-simple branch October 25, 2021 15:18

github-actions bot locked and limited conversation to collaborators Nov 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(discover): Use SnQL for some of event-stats #29471

feat(discover): Use SnQL for some of event-stats #29471

wmak commented Oct 21, 2021

Zylphrex Oct 21, 2021

wmak Oct 21, 2021

Zylphrex Oct 21, 2021

wmak Oct 21, 2021 •

edited

Loading

Zylphrex Oct 21, 2021

wmak Oct 21, 2021

Zylphrex Oct 21, 2021

wmak Oct 21, 2021

Zylphrex Oct 21, 2021

wmak Oct 22, 2021

		if isinstance(obj["time"], str):
		obj["time"] = int(to_timestamp(parse_datetime(obj["time"])))

feat(discover): Use SnQL for some of event-stats #29471

feat(discover): Use SnQL for some of event-stats #29471

Conversation

wmak commented Oct 21, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wmak Oct 21, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wmak Oct 21, 2021 •

edited

Loading