fix(experiments): apply new count method and fix continuous #27639

andehen · 2025-01-17T10:43:11Z

Problem

We are currently not applying the new count method as intended due to this check if self.query.count_query.series[0].math:. This is also set for count metrics. F.ex, when selecting Total count, the value here is total.
The input to calculate credible intervals and significance for the continues method is currently the total sum, but they expect the mean. Same problem as addressed in this PR, but these two were missed then.

Changes

Apply the new count stats method for count metrics by modifying the condition.
Adjust continuous methodology to work with total sum as input

Note: I introduced a new ExperimentMetricType. This is only used in the posthog.hogql_queries.experiments module at the moment, so it lives there for now. But it can easily be pulled out into f.ex posthog.schema if/when we want to use this more broadly, f.ex in the front-end.

How did you test this code?

tested locally by simulating many experiments
added more tests
tests pass

github-actions · 2025-01-17T10:55:23Z

Size Change: 0 B

Total Size: 1.16 MB

ℹ️ View Unchanged

Filename	Size
`frontend/dist/toolbar.js`	1.16 MB

_{compressed-size-action}

posthog/hogql_queries/experiments/test/test_trends_statistics_continuous.py

posthog/hogql_queries/experiments/experiment_trends_query_runner.py

posthog-bot · 2025-01-20T21:00:18Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 1)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

posthog-bot · 2025-01-20T21:20:11Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 1)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

jurajmajerik · 2025-01-21T10:15:38Z

posthog/hogql_queries/experiments/experiment_trends_query_runner.py

+                    )
+                    credible_intervals = calculate_credible_intervals_v2_count([control_variant, *test_variants])
+                case _:
+                    raise ValueError(f"Unsupported metric type: {self._get_metric_type()}")


I agree that we shouldn't return results for unsupported metric types. However, there are likely some experiments with unsupported metric types that are currently returning results but will start throwing errors after this PR is merged. Have you thought about how to handle any complaints from these users?

_get_metric_type() is defaulting to count, so we won't throw errors. So this is a safe guard for future work to make sure all metric types are handled. Does that make sense?

jurajmajerik · 2025-01-21T10:20:11Z

Nice work 🙏

Meta feedback on this PR: I think it would benefit from a clearer problem description.

Now:

Apply the new count stats method for count metrics

Better:

Problem

Currently, we're applying the continuous calculation for any Trend metric which contains the "math" field. This is incorrect, we should only apply it to the valid continuous math types, and throw an error for the rest.

Changes

...

danielbachhuber · 2025-01-21T11:56:55Z

posthog/hogql_queries/experiments/test/test_experiment_trends_query_runner.py

@@ -2363,3 +2366,45 @@ def test_validate_event_variants_no_exposure(self):
            }
        )
        self.assertEqual(cast(list, context.exception.detail)[0], expected_errors)
+
+    def test_get_metric_type(self):


Thanks for adding this

danielbachhuber · 2025-01-21T12:00:20Z

posthog/hogql_queries/experiments/test/test_trends_statistics_continuous.py

+
+                # Test: ~$105 mean with narrow interval due to old implementation
+                self.assertAlmostEqual(intervals["test"][0], 103, delta=3)
+                self.assertAlmostEqual(intervals["test"][1], 107, delta=3)


Oh, I intentionally didn't change the v1 values previously.

Yeah, I was a little confused here. But as the old implementation is also getting the "total" as input (what it gets from the query runner), not the mean, the test cases should be updated to reflect that. And hence the assertions had to be updated.

Does that make sense? I think the values in the assertions make more sense now as well.

Yep, the explanation makes sense. Just flagging that I intentionally didn't change the behavior / return values for v1. I'm not strongly opposed to doing so, but the original intent was to keep v1 exactly how it was.

I see. To be clear, the implementation for v1 has not changed here. Only the input for the test cases that were added here 1979d74. And the reason is to reflect what the behavior is and has been in production: the queries does not return the mean, but the total.

Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>

andehen changed the title ~~fix(experiments): use correct stats method for count metrics~~ fix(experiments): apply new count methodology for count metrics Jan 20, 2025

danielbachhuber reviewed Jan 20, 2025

View reviewed changes

posthog/hogql_queries/experiments/test/test_trends_statistics_continuous.py Outdated Show resolved Hide resolved

posthog/hogql_queries/experiments/experiment_trends_query_runner.py Outdated Show resolved Hide resolved

andehen force-pushed the experiment-fix-count-stats-method branch from 45355e5 to ddfb8cf Compare January 20, 2025 17:12

andehen marked this pull request as ready for review January 21, 2025 07:58

andehen force-pushed the experiment-fix-count-stats-method branch from e32516c to 1000c41 Compare January 21, 2025 07:58

andehen changed the title ~~fix(experiments): apply new count methodology for count metrics~~ fix(experiments): apply new count method and fix continous Jan 21, 2025

andehen changed the title ~~fix(experiments): apply new count method and fix continous~~ fix(experiments): apply new count method and fix continuous Jan 21, 2025

andehen requested a review from a team January 21, 2025 08:20

jurajmajerik reviewed Jan 21, 2025

View reviewed changes

danielbachhuber approved these changes Jan 21, 2025

View reviewed changes

andehen force-pushed the experiment-fix-count-stats-method branch from 1000c41 to 4df1c30 Compare January 22, 2025 05:41

jurajmajerik mentioned this pull request Jan 22, 2025

Sprint - Jan 27 - Feb 7 #27540

Closed

andehen and others added 12 commits January 22, 2025 16:53

fix stats method for count metrics

318b70c

fix tests

47d1603

WIP fix tests 2

fd32971

fix test values for old method

c29960c

tweak test value

86b35e7

define metric types and use them

a67b377

Update UI snapshots for chromium (1)

86ba59d

Update UI snapshots for chromium (1)

1ca29d1

update comments, adjust deltas

56c2c5a

add test for _get_metric_type

466025b

handle hogql math type

7497b7d

add tests for hogql math type and None

0e4ce77

andehen force-pushed the experiment-fix-count-stats-method branch from 4df1c30 to 0e4ce77 Compare January 22, 2025 16:46

fix mypy

6d78af2

andehen merged commit 8e3b930 into master Jan 23, 2025
99 checks passed

andehen deleted the experiment-fix-count-stats-method branch January 23, 2025 07:50

timgl pushed a commit that referenced this pull request Jan 28, 2025

fix(experiments): apply new count method and fix continuous (#27639)

ff94e60

Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(experiments): apply new count method and fix continuous #27639

fix(experiments): apply new count method and fix continuous #27639

andehen commented Jan 17, 2025 •

edited

Loading

github-actions bot commented Jan 17, 2025 •

edited

Loading

posthog-bot commented Jan 20, 2025

posthog-bot commented Jan 20, 2025

jurajmajerik Jan 21, 2025

andehen Jan 21, 2025

jurajmajerik commented Jan 21, 2025

danielbachhuber Jan 21, 2025

danielbachhuber Jan 21, 2025

andehen Jan 21, 2025

danielbachhuber Jan 21, 2025

andehen Jan 21, 2025

fix(experiments): apply new count method and fix continuous #27639

fix(experiments): apply new count method and fix continuous #27639

Conversation

andehen commented Jan 17, 2025 • edited Loading

Problem

Changes

How did you test this code?

github-actions bot commented Jan 17, 2025 • edited Loading

posthog-bot commented Jan 20, 2025

📸 UI snapshots have been updated

posthog-bot commented Jan 20, 2025

📸 UI snapshots have been updated

jurajmajerik Jan 21, 2025

Choose a reason for hiding this comment

andehen Jan 21, 2025

Choose a reason for hiding this comment

jurajmajerik commented Jan 21, 2025

Problem

Changes

danielbachhuber Jan 21, 2025

Choose a reason for hiding this comment

danielbachhuber Jan 21, 2025

Choose a reason for hiding this comment

andehen Jan 21, 2025

Choose a reason for hiding this comment

danielbachhuber Jan 21, 2025

Choose a reason for hiding this comment

andehen Jan 21, 2025

Choose a reason for hiding this comment

andehen commented Jan 17, 2025 •

edited

Loading

github-actions bot commented Jan 17, 2025 •

edited

Loading