ref impl: driver agent: respond to head events and stay in sync #131

protolambda · 2022-01-10T23:31:58Z

Part of #119: staging -> main migration

This:

EngineDriver is like a binding to a remote engine, tracking to where it's synced.
Drive is a sync process that can be started/shutdown per remote engine, and responds to head events from L1, and syncs (happy extend-case, or worst-case sync with reorg) the L2 chain by running a driver-step on the right starting-point. There is only a single active driving process at a time per engine.

Depends on #130

Review: any team, systems team preferred.

Spec: Do we want to go-indepth, like back-off strategy and failure-cases of sync in the spec? Or maybe just the "if out of sync, run a driver-step on with L1 block X_n and L2 block parent Y_{n-1}" summary of driver routine behavior? (we already more or less have the latter)

Testing: this is the most time/event/integration heavy of all of the reference implementation. Only a single event hub, but still challenging to unit-test. It works e2e though. Any suggestions/help how to best test this?

opnode/l2/driver.go

norswap · 2022-01-18T13:40:20Z

Quick question: this is divergent from staging, I assume this is the canonical version?

norswap

Nice! I think the architecture that we conceptually have three concurrent loops for checking l1 head, l2 head and taking a sync step (with the sync loop slowing down each time it doesn't move closer to sync) is pretty neat.

Also something that could use a description. I'm thinking we probably need some markdown implementation notes.

opnode/l2/driver.go

norswap · 2022-01-18T14:02:54Z

opnode/l2/driver.go

+					e.Log.Error("failed to fetch L2 head info", "err", err)
+				}
+				cancel()
+				continue


Don't we want to verify that the L2 head of the execution engine is either:

something we're already aware of

congruent with the referenced L1 block (i.e. valid) (in the case this is a head that was synced by the execution engine that we didn't set ourselves)

We only need to track what the engine says it has to determine if we can go through the happy sync path. We trust our engine to have L2 blocks with correct L1 information (since we inserted it and set it with forkchoice updates). FindSyncStart can handle any out-of-sync cases.

But if we're making the assumption that we are the one inserting the L2 blocks, what's the purpose of getting the execution engine to tell us about new heads (which we already know about & track).

I guess the point of this is being a stub for later, when we do have L2-level sync, at which point we will need to validate the block, right?

I guess the point of this is being a stub for later, when we do have L2-level sync, at which point we will need to validate the block, right?

We don't need to validate, state-sync via engine is always towards a block-hash we already trust (i.e. we messaged the engine with a forkchoice update, finalized or head, depending on trust assumptions). And even when the engine has the wrong head, we can just reorg away if it's not canonical.

opnode/l2/driver.go

protolambda · 2022-01-19T16:21:42Z

Rebased on base branch

protolambda · 2022-01-19T16:30:29Z

Went back and forth whether or not we need refL1 in the FindSyncStart return params, but we just need a name update to nextRefL1 in the driver.

…date

protolambda · 2022-01-19T18:25:32Z

Rebased on the updated impl-sync-start PR (rebased that on main)

trianglesphere · 2022-01-20T16:59:26Z

Testing: this is the most time/event/integration heavy of all of the reference implementation. Only a single event hub, but still challenging to unit-test. It works e2e though. Any suggestions/help how to best test this?

My recommendation for this is to make the Drive loop as small and self contained as possible. With explicit inputs, it's easy too set up a unit test. Unfortunately having timers does make this harder, but not impossible.

protolambda · 2022-01-20T21:40:05Z

@trianglesphere implemented your suggestion (large refactor, no functional changes, but should pay off in test-ability), can you review?

trianglesphere

Nice work on the refactor. I think pulling it apart like this teases out the fundamental structure and makes it a bit easier to understand.

One comment was for more comments, and the other is about what I think's a bug. Feel free to merge this once you think you've got it resolved.

opnode/l2/driver_state.go

codecov-commenter · 2022-01-21T15:53:42Z

Codecov Report

Merging #131 (94dba5a) into main (f5d7631) will decrease coverage by 6.29%.
The diff coverage is 8.84%.

@@            Coverage Diff             @@
##             main     #131      +/-   ##
==========================================
- Coverage   40.56%   34.26%   -6.30%     
==========================================
  Files           8       11       +3     
  Lines         604      750     +146     
==========================================
+ Hits          245      257      +12     
- Misses        339      468     +129     
- Partials       20       25       +5

Impacted Files	Coverage Δ
opnode/l2/driver.go	`0.00% <0.00%> (ø)`
opnode/l2/driver_loop.go	`0.00% <0.00%> (ø)`
opnode/l2/driver_state.go	`14.47% <14.47%> (ø)`
opnode/l2/sync_start.go	`68.57% <100.00%> (+0.45%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f5d7631...94dba5a. Read the comment docs.

protolambda added the C-ref-impl label Jan 10, 2022

protolambda added this to the Deposit Rollup Release Candidate milestone Jan 10, 2022

protolambda self-assigned this Jan 10, 2022

This was referenced Jan 10, 2022

EPIC CHECKLIST: Review/Integrate staged deposit-only ref impl work #119

Closed

ref impl: main node process (connect with RPC, start drivers, handle shutdown) #132

Merged

trianglesphere reviewed Jan 12, 2022

View reviewed changes

opnode/l2/driver.go Outdated Show resolved Hide resolved

norswap approved these changes Jan 18, 2022

View reviewed changes

protolambda force-pushed the impl-driver branch from 153473a to 538412b Compare January 19, 2022 16:21

protolambda force-pushed the impl-driver branch from 3415946 to 8c74a7a Compare January 19, 2022 16:27

protolambda force-pushed the impl-sync-start branch 2 times, most recently from ad0b2c8 to f5d7631 Compare January 19, 2022 18:17

protolambda added 4 commits January 19, 2022 19:23

ref impl: driver agent: respond to head events and stay in sync

acf6a15

ref impl: no ctx in driver struct

26d812d

ref impl: driver - fix naming of nextRefL1 in driver, clarify head up…

e3b8a00

…date

ref impl: undo extra refL1 return param

6314c80

protolambda force-pushed the impl-driver branch from 8c74a7a to 6314c80 Compare January 19, 2022 18:24

protolambda mentioned this pull request Jan 20, 2022

[Do not merge] Merge staging to main #98

Closed

Base automatically changed from impl-sync-start to main January 20, 2022 18:42

protolambda added 4 commits January 20, 2022 21:54

ref impl: refactor driver into separate engine, loop and state

a18d078

ref impl: include testlog for testing

092adcd

ref impl: test requestSync of driver state

b3b11d1

specs/rollup-node: fix typo

3b27b60

protolambda requested a review from trianglesphere January 20, 2022 22:07

trianglesphere approved these changes Jan 21, 2022

View reviewed changes

opnode/l2/driver_state.go Show resolved Hide resolved

opnode/l2/driver_state.go Show resolved Hide resolved

ref impl: document the l2 StateMachine and Driver interfaces

94dba5a

protolambda merged commit 4493ff4 into main Jan 21, 2022

protolambda deleted the impl-driver branch January 21, 2022 15:54

trianglesphere mentioned this pull request Jan 25, 2022

Tighten up the driver and drive_step code #150

Closed

trianglesphere added implementation and removed C-ref-impl labels Mar 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ref impl: driver agent: respond to head events and stay in sync #131

ref impl: driver agent: respond to head events and stay in sync #131

protolambda commented Jan 10, 2022 •

edited

Loading

norswap commented Jan 18, 2022

norswap left a comment

norswap Jan 18, 2022

protolambda Jan 19, 2022

norswap Jan 20, 2022

protolambda Jan 20, 2022

protolambda commented Jan 19, 2022

protolambda commented Jan 19, 2022

protolambda commented Jan 19, 2022

trianglesphere commented Jan 20, 2022

protolambda commented Jan 20, 2022

trianglesphere left a comment

codecov-commenter commented Jan 21, 2022 •

edited

Loading

ref impl: driver agent: respond to head events and stay in sync #131

ref impl: driver agent: respond to head events and stay in sync #131

Conversation

protolambda commented Jan 10, 2022 • edited Loading

norswap commented Jan 18, 2022

norswap left a comment

Choose a reason for hiding this comment

norswap Jan 18, 2022

Choose a reason for hiding this comment

protolambda Jan 19, 2022

Choose a reason for hiding this comment

norswap Jan 20, 2022

Choose a reason for hiding this comment

protolambda Jan 20, 2022

Choose a reason for hiding this comment

protolambda commented Jan 19, 2022

protolambda commented Jan 19, 2022

protolambda commented Jan 19, 2022

trianglesphere commented Jan 20, 2022

protolambda commented Jan 20, 2022

trianglesphere left a comment

Choose a reason for hiding this comment

codecov-commenter commented Jan 21, 2022 • edited Loading

Codecov Report

protolambda commented Jan 10, 2022 •

edited

Loading

codecov-commenter commented Jan 21, 2022 •

edited

Loading