Fix aggregator size estimation #155

jamiees2 · 2021-08-14T19:50:06Z

Description of changes:

The aggregator tries to keep track of the size of the serialised message, so that it can know if it should flush before adding a new record, and not overflow Kinesis' 1MB limit. The current implementation keeps track of this by adding seemingly random constants, which break when e.g an integer is passed that does not fit in a 1-byte varint.

This caused issues for us in production, where fluent bit would get stuck on a record that Kinesis refused to accept because the plugin would keep trying to submit a record larger than 1MB that it thought was going to be smaller than 1MB.

This changes the code to map to protobuf's actual size calculation, keeping track of the sizes of varints, buffer lengths, etc. This change fixed the bug in our production environment, and caused the plugin to correctly keep the record data below 1MB.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

PettitWesley · 2021-08-16T03:16:20Z

@zackwine FYI

PettitWesley

The code looks alright to me, and it sounds like you have thoroughly tested it in your production to verify it fixed the issue and didn't add any new regressions?

jamiees2 · 2021-08-16T08:35:30Z

Yeah, to be really clear on what I did: I deployed the new plugin to a machine that was stuck on a record like this with debug logging enabled, and saw it get past the bad records,and still compute correct sizes for everything else. I watched it for ~5m before opening the PR and didn't see any size mismatches.

PettitWesley · 2021-08-16T16:54:01Z

@jamiees2 Ok, we will merge soon and then release it. I want to give @zackwine a chance to review since he originally built this feature and understands the code a lot better than I do. If he doesn't review after a few days, we'll just merge it and release.

jamiees2 · 2021-08-16T17:21:05Z

Cool, we have already deployed the fix to our production environments, so that will just let this bake for a bit 🙂

aggregate/aggregator.go

hossain-rayhan · 2021-08-18T20:21:17Z

@jamiees2 I think it would be nice to have some positive and negative unit tests.

Signed-off-by: James Elias Sigurdarson <jamiees2@gmail.com>

…-for-fluent-bit into mainline

jamiees2 · 2021-08-21T11:43:52Z

Added tests, and realized protowire which is included in google.golang.org/protobuf exposes the sizeof functions we need, so I don't need to inline them here. This cleans up the code somewhat :)

fix aggregator

98935b2

jamiees2 requested a review from a team as a code owner August 14, 2021 19:50

PettitWesley approved these changes Aug 16, 2021

View reviewed changes

hossain-rayhan approved these changes Aug 18, 2021

View reviewed changes

aggregate/aggregator.go Outdated Show resolved Hide resolved

jamiees2 added 4 commits August 21, 2021 09:39

Update aggregate/aggregator.go

d80558e

Merge branch 'mainline' into mainline

eb50236

add tests and use protowire

bec13b9

Signed-off-by: James Elias Sigurdarson <jamiees2@gmail.com>

Merge branch 'mainline' of github.com:jamiees2/amazon-kinesis-streams…

112701d

…-for-fluent-bit into mainline

hossain-rayhan merged commit bc02b2e into aws:mainline Aug 23, 2021

hossain-rayhan mentioned this pull request Aug 23, 2021

Fix partition key computation for aggregation #158

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix aggregator size estimation #155

Fix aggregator size estimation #155

jamiees2 commented Aug 14, 2021

PettitWesley commented Aug 16, 2021

PettitWesley left a comment •

edited

Loading

jamiees2 commented Aug 16, 2021

PettitWesley commented Aug 16, 2021

jamiees2 commented Aug 16, 2021

hossain-rayhan commented Aug 18, 2021

jamiees2 commented Aug 21, 2021

Fix aggregator size estimation #155

Fix aggregator size estimation #155

Conversation

jamiees2 commented Aug 14, 2021

PettitWesley commented Aug 16, 2021

PettitWesley left a comment • edited Loading

Choose a reason for hiding this comment

jamiees2 commented Aug 16, 2021

PettitWesley commented Aug 16, 2021

jamiees2 commented Aug 16, 2021

hossain-rayhan commented Aug 18, 2021

jamiees2 commented Aug 21, 2021

PettitWesley left a comment •

edited

Loading