refactor(plugin-server): refactor the event pipeline #9829

macobo · 2022-05-18T07:35:08Z

Problem

This PR refactors how the event pipeline works. It's a follow-up to #9738

The problem being solved by these PRs is that the event processing pipeline was really hard to follow due to confusing terminology, worker/main thread split, return value dependencies and more.

This in turn is needed to split plugin-server processing in a nice way.

Changes

Event pipeline is now a pipeline of steps where every step can call a subsequent one or stop processing.

Note that the problem isn't fully solved - process-event.ts probably does too much right now, but we're in a better position than before.

Also note that this way of structuring things also lent itself really well for unit testing and caught some bugs that previously slipped through.

macobo · 2022-05-18T07:41:18Z

plugin-server/src/worker/ingestion/event-pipeline/determineShouldBufferStep.ts

+    const isAnonymousEvent =
+        event.properties && event.properties['$device_id'] && event.distinctId === event.properties['$device_id']
+    const isRecentPerson =
+        !person || DateTime.now().diff(person.created_at).as('seconds') < hub.BUFFER_CONVERSION_SECONDS


Note: This was previously bugged, would return 0 if e.g. a user was created 1 day ago. Only caught this thanks to unit tests - very hard to catch this sort of bug otherwise.

Let's write more unit tests!

yakkomajuri

Awesome! This monster was not as bad as it seemed given all the tests.

Very close to approving ✅

Left some comments but didn't find any major gaps, love the test coverage. Let's maybe get this in on Monday though and I'll make sure to also keep an eye out

yakkomajuri · 2022-05-18T13:26:28Z

plugin-server/src/worker/ingestion/event-pipeline/runner.ts

+    error?: string
+}
+
+const STEPS_TO_EMIT_TO_DLQ_ON_FAILURE: Array<StepType> = [


nit: could be a set

plugin-server/src/worker/ingestion/event-pipeline/determineShouldBufferStep.ts

yakkomajuri · 2022-05-19T12:35:27Z

plugin-server/src/worker/ingestion/event-pipeline/prepareEventStep.ts

+
+    if (preIngestionEvent && preIngestionEvent.event !== '$snapshot') {
+        return runner.nextStep('determineShouldBufferStep', preIngestionEvent)
+    } else if (preIngestionEvent && preIngestionEvent.event === '$snapshot') {


Suggested change

} else if (preIngestionEvent && preIngestionEvent.event === '$snapshot') {

} else if (preIngestionEvent) {

Intentional - I was trying to make it clearer to the reader it's dealing with a snapshot here.

plugin-server/src/worker/ingestion/event-pipeline/runner.ts

yakkomajuri · 2022-05-19T14:13:31Z

plugin-server/src/worker/ingestion/event-pipeline/runAsyncHandlersStep.ts

+    }
+
+    const processedPluginEvent = convertToProcessedPluginEvent(event)
+    const isSnapshot = event.event === '$snapshot'


we could move this up given we have a isSnapshot test on line 15

yakkomajuri · 2022-05-19T14:18:51Z

plugin-server/src/worker/ingestion/event-pipeline/runAsyncHandlersStep.ts

+    const promises = []
+    let actionMatches: Action[] = []
+    if (event.event !== '$snapshot') {
+        actionMatches = await runner.hub.actionMatcher.match(event, person, elements)


can we actually run the onEvent / onSnapshot flow earlier? I'd like to make sure that runs if the action path is broken. Also diff is probably minimal but I'd like to trigger exporting an event before action webhooks etc.

will anyway group code together that's used together

I'm not sure what you mean. Do you want to ignore errors from action matching when deciding whether to run onAction? That's a new requirement if so.

No - let me submit a suggestion as to what I want

I mistyped my question - onEvent calling should not be affected by action-related errors? If so, I think re-ordering is too implicit about that and we should make that obvious in the code.

Let's resolve this in a follow-up PR - I'm in merge conflict hell until this is in.

plugin-server/src/worker/ingestion/event-pipeline/runAsyncHandlersStep.ts

yakkomajuri · 2022-05-19T15:45:18Z

plugin-server/src/worker/ingestion/event-pipeline/runAsyncHandlersStep.ts

+    const promises = []
+    let actionMatches: Action[] = []
+    if (event.event !== '$snapshot') {
+        actionMatches = await runner.hub.actionMatcher.match(event, person, elements)


No - let me submit a suggestion as to what I want

plugin-server/tests/postgres/teardown.test.ts

* master: chore: start stack once in cloud tests (#9879) feat(apps): frontend apps (#9831) chore: Fix snapshots on master (#9885) chore(apps): rename plugins to apps (#9755) refactor: Remove constance library dependency, use json-encoded model (#9852) chore(clickhouse): avoid creating kafka_events, events_mv (#9863) fix(insights): Fix timezone date issues (#9678) refactor(plugin-server): refactor the event pipeline (#9829) feat(object storage): add unused object storage (#9846) fix: make kafka health check timeout test reliable (#9857) fix: query elements from start of day (#9827)

* Start refactoring event pipeline * Add some initial metrics * Handle DLQ error messages in pipeline runner * Add public functions for the pipeline * Tests for runner.ts * Tests for every step in event pipeline * yeet some now-unneeded worker code * Add timeoutGuard * Emit to DLQ from buffer * Move some tests to a separate file * fix internal metrics * Refactor method location, WIP * Fix code determining if user is a recent person or not * Update tests to deal with new pipeline * Rename methods for consistency * Remove now-dead test * Update process-event.test.ts * Update DLQ test * Ignore test under yeet * Remove mocked * Remove dead code * Update naming

macobo added 15 commits May 18, 2022 10:34

Start refactoring event pipeline

20dc8b8

Add some initial metrics

7e26d78

Handle DLQ error messages in pipeline runner

6988d0c

Add public functions for the pipeline

9d28061

Tests for runner.ts

8bafa40

Tests for every step in event pipeline

f5c88cd

yeet some now-unneeded worker code

9f5f764

Add timeoutGuard

05901a1

Emit to DLQ from buffer

14c6a7a

Move some tests to a separate file

d897eaf

fix internal metrics

40a0aa7

Refactor method location, WIP

2cac16c

Fix code determining if user is a recent person or not

6766a5d

Update tests to deal with new pipeline

bfbbc84

Rename methods for consistency

c8a1537

macobo requested a review from yakkomajuri May 18, 2022 07:36

macobo commented May 18, 2022

View reviewed changes

macobo added 3 commits May 18, 2022 10:43

Remove now-dead test

f3b81e9

Update process-event.test.ts

1bc00f5

Update DLQ test

a29ab5f

macobo marked this pull request as ready for review May 18, 2022 08:39

macobo requested a review from tiina303 May 18, 2022 08:39

Ignore test under yeet

1752b6f

macobo mentioned this pull request May 18, 2022

feat(plugin-server): use swc for running jest tests #9832

Merged

yakkomajuri suggested changes May 19, 2022

View reviewed changes

yakkomajuri reviewed May 19, 2022

View reviewed changes

Twixes reviewed May 19, 2022

View reviewed changes

plugin-server/tests/postgres/teardown.test.ts Outdated Show resolved Hide resolved

macobo added 3 commits May 20, 2022 10:17

Merge remote-tracking branch 'origin/master' into eventPipeline

ccdb921

Remove mocked

464f17b

Remove dead code

e679d72

Update naming

5b29842

yakkomajuri approved these changes May 20, 2022

View reviewed changes

macobo merged commit 18535fa into master May 20, 2022

macobo deleted the eventPipeline branch May 20, 2022 10:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(plugin-server): refactor the event pipeline #9829

refactor(plugin-server): refactor the event pipeline #9829

macobo commented May 18, 2022 •

edited

Loading

macobo May 18, 2022

yakkomajuri left a comment

yakkomajuri May 18, 2022

yakkomajuri May 19, 2022

macobo May 20, 2022

yakkomajuri May 20, 2022

yakkomajuri May 19, 2022

yakkomajuri May 19, 2022

macobo May 19, 2022

yakkomajuri May 19, 2022

macobo May 20, 2022

macobo May 20, 2022

yakkomajuri May 19, 2022

	} else if (preIngestionEvent && preIngestionEvent.event === '$snapshot') {
	} else if (preIngestionEvent) {

refactor(plugin-server): refactor the event pipeline #9829

refactor(plugin-server): refactor the event pipeline #9829

Conversation

macobo commented May 18, 2022 • edited Loading

Problem

Changes

Choose a reason for hiding this comment

yakkomajuri left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

macobo commented May 18, 2022 •

edited

Loading