fix: Add time slicing to splitstore purging to reduce lock congestion #11269

fridrik01 · 2023-09-14T13:50:19Z

Related Issues

Context

When splitstore is enabled we perform compaction every 7h30m. During this compaction (which takes several minutes on my fast SSD) there is relatively small purging step which takes approx 15sec. The purging step chunks the cids which should be purged and operates on them in blocks. In each block it acquires an exclusive lock which it shares with other I/O operation (for example FVM when applying messages).

So when purging happens, other operations like MpoolSelect block for the whole 15sec duration when it needs to compute tipset state (TipSetState).

Proposed Changes

I added a simple time slicing to the purging step which only tries to purge for 1 sec, while pausing for 1 sec until it is done. This effectively makes the purge 2x slower, but I don't think we care so much as it only happens every 7h30m and purging is already relatively fast compare the the whole splitstore compaction.

Test plan

Using the same testing method as described in this comment I observed with this fix that during splitstore compaction the whole TipSetState went down from 15sec to ~2sec.

Additional Info

See #11251 (comment)

Checklist

Before you mark the PR ready for review, please make sure that:

Commits have a clear commit message.
PR title is in the form of of <PR type>: <area>: <change being made>
- example: fix: mempool: Introduce a cache for valid signatures
- PR type: fix, feat, build, chore, ci, docs, perf, refactor, revert, style, test
- area, e.g. api, chain, state, market, mempool, multisig, networking, paych, proving, sealing, wallet, deps
If the PR affects users (e.g., new feature, bug fix, system requirements change), update the CHANGELOG.md and add details to the UNRELEASED section.
New features have usage guidelines and / or documentation updates in
- Lotus Documentation
- Discussion Tutorials
Tests exist for new functionality or change in behavior
CI is green

Stebalien · 2023-09-14T18:56:46Z

We likely either need to:

Take some kind of lock when computing tipsets (and/or mining blocks). 1s isn't quite enough to compute a tipset so we'll end up pausing multiple times.
Pause longer. IMO, there's no rush here unless I'm missing something.

That's not to say that the current patch won't help. Even if we only pause for 1s, that means we'll be able to spend 50% of the time computing the tipset which means it'll take ~6s on average instead of 3. But I'd still consider pausing for longer.

blockstore/splitstore/splitstore_compact.go

Stebalien · 2023-09-14T19:03:10Z

That's not to say that the current patch won't help. Even if we only pause for 1s, that means we'll be able to spend 50% of the time computing the tipset which means it'll take ~6s on average instead of 3. But I'd still consider pausing for longer.

I.e., I'd consider pausing for 4s every 1s, or something like that. That gives us 20% utilization, which means it should only make blocks take 20% longer.

Ideally we'd look at datastore metrics and determine that the datastore is "busy". But that's more complicated.

Stebalien · 2023-09-15T17:35:08Z

LGTM but this one needs an approval from @arajasek.

arajasek · 2023-09-18T14:07:54Z

I don't see any harm in doing this, but I'm not sure I fully believe the logic here.

When splitstore is enabled we perform compaction every 7h30m. During this compaction (which takes several minutes on my fast SSD) there is relatively small purging step which takes approx 15sec. The purging step chunks the cids which should be purged and operates on them in blocks. In each block it acquires an exclusive lock which it shares with other I/O operation (for example FVM when applying messages).

We're saying that the purging step takes 15 seconds, but operates in batches. The batch size is 16k, which seems relatively small to me. We take the lock for each batch.

So when purging happens, other operations like MpoolSelect block for the whole 15sec duration when it needs to compute tipset state (TipSetState).

In that case, we shouldn't be hogging the lock for the entire 15s, no? We're aggressively trying to grab it between batches, but we should still be releasing the lock for other operations in between batches. Is it the case that a singly batch is taking 15s?

arajasek

See comment about the premise itself. If we're landing, let's make the "work" time and "sleep" time constants (no need to be configurable yet).

arajasek · 2023-09-18T14:02:12Z

blockstore/splitstore/splitstore_compact.go

+			elapsed := time.Since(now)
+			if elapsed > time.Second {
+				// work 1 second, sleep 4, or 20% utilization
+				time.Sleep(4 * elapsed)


4 * elapsed? Do we not want 4 * time.Second here?

if we schedule slightly over second, this means that we sleep a bit longer guaranteeing a 20% utilization

Yup, that makes sense.

fridrik01 · 2023-09-18T14:27:07Z

I don't see any harm in doing this, but I'm not sure I fully believe the logic here.

When splitstore is enabled we perform compaction every 7h30m. During this compaction (which takes several minutes on my fast SSD) there is relatively small purging step which takes approx 15sec. The purging step chunks the cids which should be purged and operates on them in blocks. In each block it acquires an exclusive lock which it shares with other I/O operation (for example FVM when applying messages).

We're saying that the purging step takes 15 seconds, but operates in batches. The batch size is 16k, which seems relatively small to me. We take the lock for each batch.

So when purging happens, other operations like MpoolSelect block for the whole 15sec duration when it needs to compute tipset state (TipSetState).

In that case, we shouldn't be hogging the lock for the entire 15s, no? We're aggressively trying to grab it between batches, but we should still be releasing the lock for other operations in between batches. Is it the case that a singly batch is taking 15s?

Yes we release the lock between batches but when applying messages we fight for that lock independently for each message so if purging takes 15sec we end up trying to acquire the lock for these messages until the whole purging step is done.

arajasek · 2023-09-18T15:01:25Z

@fridrik01 Aaah, the lock contention is for every single message application. Got it, that makes complete sense. Thank you!

arajasek

Make time.Second a constant, but LGTM!

Add time slicing ot splitstore purging to reduce lock congestion

d9755df

fridrik01 changed the title ~~Add time slicing ot splitstore purging to reduce lock congestion~~ fix: Add time slicing to splitstore purging to reduce lock congestion Sep 14, 2023

Update changelog

131f87a

fridrik01 force-pushed the throttle-splitstore-purging branch from caaa28e to 131f87a Compare September 14, 2023 13:58

fridrik01 marked this pull request as ready for review September 14, 2023 13:59

fridrik01 requested a review from a team as a code owner September 14, 2023 13:59

Stebalien reviewed Sep 14, 2023

View reviewed changes

blockstore/splitstore/splitstore_compact.go Outdated Show resolved Hide resolved

Increase time spent pausing to 4sec

087d799

Stebalien approved these changes Sep 15, 2023

View reviewed changes

fridrik01 mentioned this pull request Sep 18, 2023

mining instability issue tracker #11251

Open

5 tasks

arajasek reviewed Sep 18, 2023

View reviewed changes

arajasek approved these changes Sep 18, 2023

View reviewed changes

Moved work duration to a constant

2d7a9b0

fridrik01 merged commit 8aaa8de into master Sep 18, 2023

fridrik01 deleted the throttle-splitstore-purging branch September 18, 2023 16:54

Fatman13 mentioned this pull request Sep 25, 2023

[venus] splitstore自动截链，自动prune? filecoin-project/venus#5182

Open

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Add time slicing to splitstore purging to reduce lock congestion #11269

fix: Add time slicing to splitstore purging to reduce lock congestion #11269

fridrik01 commented Sep 14, 2023 •

edited

Loading

Stebalien commented Sep 14, 2023

Stebalien commented Sep 14, 2023

Stebalien commented Sep 15, 2023

arajasek commented Sep 18, 2023

arajasek left a comment

arajasek Sep 18, 2023

fridrik01 Sep 18, 2023

arajasek Sep 18, 2023

fridrik01 commented Sep 18, 2023

arajasek commented Sep 18, 2023

arajasek left a comment

fix: Add time slicing to splitstore purging to reduce lock congestion #11269

fix: Add time slicing to splitstore purging to reduce lock congestion #11269

Conversation

fridrik01 commented Sep 14, 2023 • edited Loading

Related Issues

Context

Proposed Changes

Test plan

Additional Info

Checklist

Stebalien commented Sep 14, 2023

Stebalien commented Sep 14, 2023

Stebalien commented Sep 15, 2023

arajasek commented Sep 18, 2023

arajasek left a comment

Choose a reason for hiding this comment

arajasek Sep 18, 2023

Choose a reason for hiding this comment

fridrik01 Sep 18, 2023

Choose a reason for hiding this comment

arajasek Sep 18, 2023

Choose a reason for hiding this comment

fridrik01 commented Sep 18, 2023

arajasek commented Sep 18, 2023

arajasek left a comment

Choose a reason for hiding this comment

fridrik01 commented Sep 14, 2023 •

edited

Loading