batch plugin logs on postgres #486

yakkomajuri · 2021-06-24T19:00:51Z

Changes

Addresses #424 and supersedes #431.

This makes log inserts to Postgres from plugins happen in batches every second. Didn't implement batching for other logs but should be easy to do so later if we feel there's a need. Big improvements from this locally. No slow query warnings, and despite the buffer, the logs actually appear faster (since they're not "stuck in traffic").

Checklist

Updated Settings section in README.md, if settings are affected
Jest tests

neilkakkar

LGTM.

Surprised though that this solves the underlying issue.

In effect, were we making too many queries in succession to postgres? We're still making about 1/second. So does this imply earlier, we were even faster than that?

mariusandra

Very nice! Just two thoughts:

even though we flush every second, perhaps for posterity it makes sense to also flush this buffer in teardown/shutdown mode? it's yet another stream/buffer/service/etc, so might make sense to control in a similar way...
it looks like you're always inserting to postgres, even if kafka is enabled.

src/worker/vm/extensions/console.ts

src/utils/db/db.ts

yakkomajuri · 2021-06-25T12:54:38Z

@mariusandra re

even though we flush every second, perhaps for posterity it makes sense to also flush this buffer in teardown/shutdown mode? it's yet another stream/buffer/service/etc, so might make sense to control in a similar way...

yeah that'd make sense

yakkomajuri · 2021-07-02T21:31:06Z

@Twixes thread-level batching now

yakkomajuri · 2021-07-05T14:32:27Z

friendly ping @Twixes

Twixes

Looks like this works so if unbatched Postgres log insertion is a hair-on-fire problem, it's okay, but still has this thing where Kafka logs are now batched two-fold – first via addLog and then via queueMessage. Not ideal since that only increases the time in which a log entry gets into ClickHouse, without benefits. Basically ideally createPluginLogEntries should queue a Kafka message (which it already does) or queue a Postgres row, instead of LogsBuffer creating an additional buffer layer above createPluginLogEntries.

tests/postgres/e2e.test.ts

tests/postgres/vm.test.ts

yakkomajuri · 2021-07-06T10:52:13Z

That's a good point - addLog can just directly throw it in Kafka if that's available

yakkomajuri · 2021-07-06T13:48:01Z

done @Twixes

Twixes

Sorry for dragging this on a bit, but my initial point still stands unfortunately 😅

src/utils/logs-buffer.ts

src/utils/db/db.ts

yakkomajuri · 2021-07-07T18:35:21Z

apologies for the lazy updates on the existing system instead of just rearchitecting in the first place :D

@Twixes

Twixes · 2021-07-13T16:11:14Z

src/worker/plugins/teardown.ts

-                                server.instanceId
-                            )
+                                source: PluginLogEntrySource.System,
+                                type: PluginLogEntryType.Info,


Why the change to Info?

great catch! probably lingered from a copy paste

Twixes

Benissimo

) * batch plugin logs on postgres * remove comment * batch at the thread level * fixes * await logs flushing in e2e tests * update tests * flush immediately on tests * fix vm tests * update setupPlugin tests * immediately add to kafka queue if available * rearchitect * Make code subjectively a bit cleaner in a few places * info -> error Co-authored-by: Michael Matloka <dev@twixes.com>

yakkomajuri added 2 commits June 24, 2021 16:00

batch plugin logs on postgres

8624d71

remove comment

ac200a0

This was referenced Jun 24, 2021

Capture db query errors to sentry #437

Closed

Buffer console writes every 100ms #431

Closed

yakkomajuri requested a review from mariusandra June 24, 2021 19:35

neilkakkar approved these changes Jun 25, 2021

View reviewed changes

mariusandra suggested changes Jun 25, 2021

View reviewed changes

Twixes suggested changes Jun 25, 2021

View reviewed changes

src/worker/vm/extensions/console.ts Outdated Show resolved Hide resolved

yakkomajuri commented Jun 25, 2021

View reviewed changes

src/utils/db/db.ts Outdated Show resolved Hide resolved

yakkomajuri added 7 commits July 1, 2021 15:59

batch at the thread level

eee2dbb

fixes

feb7ebe

await logs flushing in e2e tests

8e9e463

update tests

a5e040e

flush immediately on tests

73cf442

fix vm tests

71e4a07

update setupPlugin tests

cee010b

Twixes self-requested a review July 3, 2021 18:07

Twixes reviewed Jul 6, 2021

View reviewed changes

tests/postgres/e2e.test.ts Outdated Show resolved Hide resolved

tests/postgres/vm.test.ts Outdated Show resolved Hide resolved

immediately add to kafka queue if available

15b4628

yakkomajuri added the bump patch Bump patch version when this PR gets merged label Jul 6, 2021

Twixes suggested changes Jul 7, 2021

View reviewed changes

src/utils/logs-buffer.ts Outdated Show resolved Hide resolved

src/utils/db/db.ts Outdated Show resolved Hide resolved

yakkomajuri mentioned this pull request Jul 7, 2021

Team Extensibility Planning (1.28.0 1/2) #492

Closed

38 tasks

rearchitect

96c6faa

Merge branch 'master' into batch-logs

30e02ea

Make code subjectively a bit cleaner in a few places

b4965fc

Twixes reviewed Jul 13, 2021

View reviewed changes

Twixes and others added 2 commits July 13, 2021 18:40

Merge branch 'master' into batch-logs

a67a2a5

info -> error

9d1ac0a

Twixes approved these changes Jul 13, 2021

View reviewed changes

Twixes merged commit d1a43fe into master Jul 13, 2021

Twixes deleted the batch-logs branch July 13, 2021 17:17

posthog-bot mentioned this pull request Jul 13, 2021

Update plugin server to 1.1.5 PostHog/posthog#5109

Merged

yakkomajuri mentioned this pull request Jul 16, 2021

Sprint 1.27.0 3/2 - Jul 5 to Jul 16 (Funnels #2) PostHog/posthog#4968

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch plugin logs on postgres #486

batch plugin logs on postgres #486

yakkomajuri commented Jun 24, 2021 •

edited

Loading

neilkakkar left a comment

mariusandra left a comment

yakkomajuri commented Jun 25, 2021

yakkomajuri commented Jul 2, 2021

yakkomajuri commented Jul 5, 2021

Twixes left a comment

yakkomajuri commented Jul 6, 2021

yakkomajuri commented Jul 6, 2021

Twixes left a comment

yakkomajuri commented Jul 7, 2021

Twixes Jul 13, 2021

yakkomajuri Jul 13, 2021

Twixes left a comment

batch plugin logs on postgres #486

batch plugin logs on postgres #486

Conversation

yakkomajuri commented Jun 24, 2021 • edited Loading

Changes

Checklist

neilkakkar left a comment

Choose a reason for hiding this comment

mariusandra left a comment

Choose a reason for hiding this comment

yakkomajuri commented Jun 25, 2021

yakkomajuri commented Jul 2, 2021

yakkomajuri commented Jul 5, 2021

Twixes left a comment

Choose a reason for hiding this comment

yakkomajuri commented Jul 6, 2021

yakkomajuri commented Jul 6, 2021

Twixes left a comment

Choose a reason for hiding this comment

yakkomajuri commented Jul 7, 2021

Twixes Jul 13, 2021

Choose a reason for hiding this comment

yakkomajuri Jul 13, 2021

Choose a reason for hiding this comment

Twixes left a comment

Choose a reason for hiding this comment

yakkomajuri commented Jun 24, 2021 •

edited

Loading