feat(sync): add `BufferStage` #2066

Mirko-von-Leipzig · 2024-06-10T09:32:52Z

This PR adds the BufferStage trait to the sync stages framework.

This trait and its associated SyncReceiver::buffer method abstracts over the common pattern of taking a p2p element stream and transforming it into a stream of block data.

The abstraction is not entirely perfect; but I think this will help simplify things.

As a demonstration I've implemented this (somewhat hackily) for the transaction checkpoint sync (without connecting it to anything yet). For tracking sync, we would implement something similar that simply receives these counts from the header fanout.

Transaction source from p2p then becomes a normal transaction stream, which is then connected to one the above stages.

Mirko-von-Leipzig · 2024-06-10T09:38:08Z

This also doesn't really account for something weird like StateDiff counting - though I think one can handle this with an additional method in the trait.

crates/pathfinder/src/sync/stream.rs

sistemd · 2024-06-10T12:07:46Z

It remains to be seen how well this will fit with all of the use cases, but looks promising 👍

Abstracts over the common scenario of buffering a stream of items into a stream of blocks.

This stage is used to split a stream of transactions into a stream of block's of transactions.

Remove BufferStage::T since it wasn't linked to usage directly. Reword BufferStage::Meta -> AdditionalData Co-authored-by: Nikša Sporin <niksa@equilibrium.co>

Mirko-von-Leipzig · 2024-06-14T09:58:56Z

Sounded great; has a pretty fatal flaw in practice.

A source cannot be separate from the process of transforming the input item stream into a stream of block data (which is what BufferStage was doing).

As it stands, if the origin p2p stream fails, or ends early there is no way for the source to know at which block number it should re-attempt from - only the buffer stage knows how many blocks of data were actually processed.

This implies that the source stream and the transformation stage should instead be a single unified system.

Mirko-von-Leipzig requested a review from a team as a code owner June 10, 2024 09:32

Mirko-von-Leipzig mentioned this pull request Jun 10, 2024

Tracking sync sources shouldn't wait for headers #2067

Open

sistemd reviewed Jun 10, 2024

View reviewed changes

crates/pathfinder/src/sync/stream.rs Outdated Show resolved Hide resolved

sistemd reviewed Jun 10, 2024

View reviewed changes

crates/pathfinder/src/sync/stream.rs Outdated Show resolved Hide resolved

sistemd approved these changes Jun 10, 2024

View reviewed changes

Mirko-von-Leipzig and others added 4 commits June 10, 2024 16:07

feat(sync): buffer stage

ad710e3

Abstracts over the common scenario of buffering a stream of items into a stream of blocks.

feat(sync): DatabaseBlockBuffer for transactions

91607a9

This stage is used to split a stream of transactions into a stream of block's of transactions.

chore(sync): simplify BufferStage

c68d79b

Remove BufferStage::T since it wasn't linked to usage directly. Reword BufferStage::Meta -> AdditionalData Co-authored-by: Nikša Sporin <niksa@equilibrium.co>

chore: rebase fixups

bd9dab1

Mirko-von-Leipzig force-pushed the mirko/buffer_stage branch from 7c374ef to bd9dab1 Compare June 10, 2024 14:13

Mirko-von-Leipzig closed this Jun 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sync): add `BufferStage` #2066

feat(sync): add `BufferStage` #2066

Mirko-von-Leipzig commented Jun 10, 2024

Mirko-von-Leipzig commented Jun 10, 2024

sistemd commented Jun 10, 2024

Mirko-von-Leipzig commented Jun 14, 2024

feat(sync): add BufferStage #2066

feat(sync): add BufferStage #2066

Conversation

Mirko-von-Leipzig commented Jun 10, 2024

Mirko-von-Leipzig commented Jun 10, 2024

sistemd commented Jun 10, 2024

Mirko-von-Leipzig commented Jun 14, 2024

feat(sync): add `BufferStage` #2066

feat(sync): add `BufferStage` #2066