PoC: FVM Debug Dual Execution #8841

vyzo · 2022-06-10T14:12:01Z

Depends on:

How it works

When LOTUS_FVM_DEBUG=1, dual execution is triggered with actor debugging for side effect. If also LOTUS_FVM_DEBUG_BUNDLE_V8 is also specified, then the bundle is loaded and execution is redirected from the canonical (consensus) actors to the actors in thr bundle during debug execution.

Some niceties to consider

Cache at init the debug bundle in an in memory overlay blockstore and reuse it in each debug execution. This will eliminate load iverhead.
Add an output syscall to fvm that captures output during debug execution and is noop otherwise.

vyzo · 2022-06-10T14:15:10Z

CI will have to rerun once the ffi helper publishes the relevant build artifacts (not there yet, have been testing locally with a devnet)

arajasek

Thank you for taking this on! As a PoC, I think this looks good. I think using envvars to dictate where to look for potential overrides is good enough.

The biggest thing I'd like to see is some way to automatically make this happen for select state computations (eg. when estimating gas, or when StateReplay is called). That is not directly relevant to this work, but really needs to happen soon.

chain/vm/fvm.go

jennijuju · 2022-06-22T03:20:48Z

@arajasek id prefer this PR to be base off master, let me know your thoughts!

arajasek · 2022-06-22T14:41:15Z

@arajasek id prefer this PR to be base off master, let me know your thoughts!

I'm not sure -- there are users who want to use them, and ideally they would be able to do so in the minimum release.

But it is ultimately nice-to-have, and any change late in the release process is undesirable, so happy to go into master instead.

vyzo · 2022-06-22T14:42:38Z

I'll leave the rebase up to you, updating the code now; I just have to move the util functions and it's done.

arajasek · 2022-06-28T23:19:01Z

chain/vm/vmi.go

 		return NewFVM(ctx, opts)
 	}

 	// Remove after v16 upgrade, this is only to support testing and validation of the FVM
 	if useFvmForMainnetV15 && opts.NetworkVersion >= network.Version15 {
+		if os.Getenv("LOTUS_FVM_DEBUG") == "1" {


Okay with this duplication cuz it's getting dropped soon

codecov · 2022-06-28T23:41:38Z

Codecov Report

Merging #8841 (f9cf25f) into master (cafb110) will decrease coverage by 0.11%.
The diff coverage is 12.33%.

❗ Current head f9cf25f differs from pull request most recent head 906463b. Consider uploading reports for the commit 906463b to get more accurate results

@@            Coverage Diff             @@
##           master    #8841      +/-   ##
==========================================
- Coverage   40.73%   40.62%   -0.12%     
==========================================
  Files         705      705              
  Lines       78574    78692     +118     
==========================================
- Hits        32009    31965      -44     
- Misses      41101    41239     +138     
- Partials     5464     5488      +24

Impacted Files	Coverage Δ
chain/actors/manifest.go	`82.35% <0.00%> (-9.46%)`	⬇️
chain/gen/genesis/genesis.go	`46.29% <0.00%> (ø)`
chain/vm/vmi.go	`40.00% <0.00%> (-26.67%)`	⬇️
chain/vm/fvm.go	`29.61% <7.08%> (-14.93%)`	⬇️
chain/consensus/filcns/upgrades.go	`33.68% <66.66%> (-0.07%)`	⬇️
chain/stmgr/utils.go	`25.78% <66.66%> (+0.24%)`	⬆️
storage/wdpost/wdpost_sched.go	`75.49% <0.00%> (-5.89%)`	⬇️
chain/consensus/filcns/weight.go	`70.58% <0.00%> (-5.89%)`	⬇️
storage/pipeline/currentdealinfo.go	`71.42% <0.00%> (-4.77%)`	⬇️
chain/events/events_called.go	`83.90% <0.00%> (-4.40%)`	⬇️
... and 21 more

chain/consensus/filcns/upgrades.go

itests/lite_migration_test.go

chain/vm/vmi.go

chain/actors/manifest.go

chain/vm/fvm.go

Stebalien · 2022-06-29T04:43:47Z

chain/vm/fvm.go

+	go func() {
+		defer wg.Done()
+		ret, err = vm.main.ApplyMessage(ctx, cmsg)
+	}()
+
+	go func() {
+		defer wg.Done()
+		if _, err := vm.debug.ApplyMessage(ctx, cmsg); err != nil {
+			log.Errorf("debug execution failed: %w", err)
+		}
+	}()


So... this doesn't work as the debug messages will have different gas fees. That means:

The balances will be different.

Some messages may just fail.

This will lead to really annoying and hard to debug issues. We can merge this as a WIP_JUST_TESTING patch, but we need to make that clear.

The right way to do this would be to:

Apply in debug mode (disabling gas accounting?). Balances will be correct because we charge for gas by charging for the gas limit up-front, then refunding any leftovers. So the balance will be correct for the duration of the message execution, just not after the message is done executing.

Revert.

Apply normally.

Go back to 1 for the next message.

But:

We can't currently perform that revert.

That is even more likely to cause problems...

I think the answer here is to leave this as a super experimental feature, but be very clear that this is just for debugging and absolutely not to be relied on.

Yeah, I'll throw on a disclaimer.

Honestly, the biggest use I envision getting out of this is actually intentionally failing messages, and using the actor error string to convey debug information (eg. adding actor_error!("i reached this if branch because the miner power is non-zero"); to the miner_actor).

chain/consensus/filcns/upgrades.go

Stebalien · 2022-06-29T17:51:23Z

chain/consensus/filcns/upgrades.go

+	var newManifestData manifest.ManifestData
 	if err := store.Get(ctx, newManifest.Data, &newManifestData); err != nil {


This will cause us to load the ManifestData twice, right?

Do we even use this?

Yes, solely to check the number of entries, which I can't do from newManifest directly because there's no way (yet) to count the number of entries in a Manifest...

chain/stmgr/utils.go

vyzo requested a review from a team as a code owner June 10, 2022 14:12

vyzo marked this pull request as draft June 10, 2022 14:14

vyzo changed the base branch from master to release/v1.16.0 June 10, 2022 14:14

vyzo marked this pull request as ready for review June 10, 2022 17:42

arajasek reviewed Jun 14, 2022

View reviewed changes

chain/vm/fvm.go Outdated Show resolved Hide resolved

chain/vm/fvm.go Outdated Show resolved Hide resolved

chain/vm/fvm.go Outdated Show resolved Hide resolved

chain/vm/fvm.go Outdated Show resolved Hide resolved

vyzo mentioned this pull request Jun 15, 2022

Canonicalizing Dual Execution for Node/Application-specific Processing #8867

Open

arajasek force-pushed the feat/debug-execution branch from 8c0675d to 19c9e87 Compare June 23, 2022 03:58

arajasek changed the base branch from release/v1.16.0 to master June 23, 2022 15:39

arajasek force-pushed the feat/debug-execution branch 3 times, most recently from 865d5a3 to 330344f Compare June 28, 2022 20:54

feat: FVM Debug Dual Execution

3ad2fc5

arajasek force-pushed the feat/debug-execution branch from 330344f to 8e4d42c Compare June 28, 2022 23:17

arajasek reviewed Jun 28, 2022

View reviewed changes

an attempt at cleanup

a52d584

arajasek force-pushed the feat/debug-execution branch from 8e4d42c to a52d584 Compare June 28, 2022 23:24

Stebalien reviewed Jun 29, 2022

View reviewed changes

address review

f9cf25f

Stebalien reviewed Jun 29, 2022

View reviewed changes

more review

906463b

Stebalien approved these changes Jun 29, 2022

View reviewed changes

arajasek merged commit 709fe5c into master Jun 29, 2022

arajasek deleted the feat/debug-execution branch June 29, 2022 18:35

simlecode mentioned this pull request Sep 23, 2022

feat: fvm增加debug功能 / FVM Debug Dual Execution filecoin-project/venus#5319

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PoC: FVM Debug Dual Execution #8841

PoC: FVM Debug Dual Execution #8841

vyzo commented Jun 10, 2022 •

edited by arajasek

Loading

vyzo commented Jun 10, 2022

arajasek left a comment •

edited

Loading

jennijuju commented Jun 22, 2022

arajasek commented Jun 22, 2022

vyzo commented Jun 22, 2022

arajasek Jun 28, 2022

codecov bot commented Jun 28, 2022 •

edited

Loading

Stebalien Jun 29, 2022

arajasek Jun 29, 2022

Stebalien Jun 29, 2022

Stebalien Jun 29, 2022

arajasek Jun 29, 2022

		var newManifestData manifest.ManifestData
		if err := store.Get(ctx, newManifest.Data, &newManifestData); err != nil {

PoC: FVM Debug Dual Execution #8841

PoC: FVM Debug Dual Execution #8841

Conversation

vyzo commented Jun 10, 2022 • edited by arajasek Loading

How it works

Some niceties to consider

vyzo commented Jun 10, 2022

arajasek left a comment • edited Loading

Choose a reason for hiding this comment

jennijuju commented Jun 22, 2022

arajasek commented Jun 22, 2022

vyzo commented Jun 22, 2022

arajasek Jun 28, 2022

Choose a reason for hiding this comment

codecov bot commented Jun 28, 2022 • edited Loading

Codecov Report

Stebalien Jun 29, 2022

Choose a reason for hiding this comment

arajasek Jun 29, 2022

Choose a reason for hiding this comment

Stebalien Jun 29, 2022

Choose a reason for hiding this comment

Stebalien Jun 29, 2022

Choose a reason for hiding this comment

arajasek Jun 29, 2022

Choose a reason for hiding this comment

vyzo commented Jun 10, 2022 •

edited by arajasek

Loading

arajasek left a comment •

edited

Loading

codecov bot commented Jun 28, 2022 •

edited

Loading