Move ingester.spread-flushes from flush time to sample add time #1578

bboreham · 2019-08-14T09:38:12Z

So that all ingesters will make the same decision, hence chunks flushed to the DB are more likely to overlap. This is a replacement for #1414, keeping the option name but implementing it differently.

Also add a test and fix an incorrect comment on memorySeries.add().

Trying this out in staging environment, I see that the 'reason' stats are less helpful because the chunks that hit their timeslot are reported as "multiple chunks in series". We could add more metadata to report the different cases accurately.

EDIT: I blogged about the efficiency gain: https://www.weave.works/blog/how-i-halved-the-storage-of-cortex

Signed-off-by: Bryan Boreham <bryan@weave.works>

So that all ingesters will make the same decision, hence chunks flushed to the DB are more likely to overlap. Also add a test. Signed-off-by: Bryan Boreham <bryan@weave.works>

csmarchbanks

👍

bboreham added 3 commits August 14, 2019 08:13

Fix comment

1b67f8a

Signed-off-by: Bryan Boreham <bryan@weave.works>

Extend test utility with timestamp offset

39d3111

Signed-off-by: Bryan Boreham <bryan@weave.works>

Move ingester.spread-flushes from flush time to sample add time

941c906

So that all ingesters will make the same decision, hence chunks flushed to the DB are more likely to overlap. Also add a test. Signed-off-by: Bryan Boreham <bryan@weave.works>

csmarchbanks approved these changes Aug 15, 2019

View reviewed changes

bboreham merged commit 6beeeb2 into master Aug 16, 2019

bboreham deleted the spread-flushes-2 branch August 16, 2019 07:48

bboreham mentioned this pull request Aug 16, 2019

Reduce duplication when writing #607

Closed

csmarchbanks mentioned this pull request Aug 23, 2019

Update the changelog to reflect recent PRs #1605

Merged

bboreham mentioned this pull request Dec 17, 2019

Distributors OOM on a single slow ingester in the cluster #1895

Closed

bboreham mentioned this pull request Jan 14, 2020

Report chunks flushed by spread-flushes option under separate label #1978

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move ingester.spread-flushes from flush time to sample add time #1578

Move ingester.spread-flushes from flush time to sample add time #1578

bboreham commented Aug 14, 2019 •

edited

Loading

csmarchbanks left a comment

Move ingester.spread-flushes from flush time to sample add time #1578

Move ingester.spread-flushes from flush time to sample add time #1578

Conversation

bboreham commented Aug 14, 2019 • edited Loading

csmarchbanks left a comment

Choose a reason for hiding this comment

bboreham commented Aug 14, 2019 •

edited

Loading