ChainIndexer: make epoch validation more comprehensive #12570

rvagg · 2024-10-10T03:09:12Z

Currently we are just comparing counts of things, assuming that if we have the right number of messages, events and event entries, then the epoch is properly indexed. This isn't necessarily true and it would be good to validate in more detail. Events are particularly hard since there is a lot of data to be fetched and compared. We already have the data from the blockstore though, so that's not an additional expense (we need to load an entire AMT just to get its length so we know what size to compare).

SQLite has some sha3 functionality. It's not clear to me if these are available programatically or if they are only available on the sqlite3 cli. If they are available programatically then we could generate hashes of specific columns in the database and compare that digest to a locally generated digest of the same fields. A select over event fields: emitter, event_index, message (cid?), and the associated event_entry fields: indexed, flags, key, codec, and maybe even value to do the whole lot, and then a sha3 of the results. Attempting to reconstruct this locally would be an interesting exercise.

Unfortunately this would tie us to SQLite functionality, so if we made the database pluggable we'd want to abstract this away somehow so that we can do something similar for another database.

Or, alternatively: just do a bulk select and compare everything in one go.

The text was updated successfully, but these errors were encountered:

rvagg · 2024-10-10T03:18:57Z

Another proposal that was raised at one point for this was to recalculate the AMT root of each of the messages' events and compare that to what the receipt says. That way we wouldn't even need to load the AMT from the blockstore and it may end up being more efficient. In theory we should have everything we need to do a lossless reconstruction. The only catch I see is that in ChainIndexer, as in the prior event storage code, we skip over any event where we can't look up the address for the actor ID. I don't know if in practice we have any epochs where this has ever happened, but it's a possibility in the code that such an event won't be stored.

Instead of relying just on entry counts, compare the regenerated AMT root using just what we have in the db with the message receipt event root. This should tell us precisely that we have what we should or not. Ref: #12570

rvagg · 2024-10-10T05:31:09Z

I finally decided to investigate the practicality of AMT comparison: #12571

Still some issues with addresses to resolve, and it's unfortunately slower than just counting, which I didn't expect but I also think some of that may be due to using the RPC from lotus-shed and it may be quicker when done in-process.

Instead of relying just on entry counts, compare the regenerated AMT root using just what we have in the db with the message receipt event root. This should tell us precisely that we have what we should or not. Ref: #12570

#12571) Instead of relying just on entry counts, compare the regenerated AMT root using just what we have in the db with the message receipt event root. This should tell us precisely that we have what we should or not. Ref: #12570

aarshkshah1992 · 2024-10-23T10:35:08Z

Closed by #12632.

rvagg added the area/events label Oct 10, 2024

rvagg added this to FilOz Oct 10, 2024

github-project-automation bot moved this to 📌 Triage in FilOz Oct 10, 2024

rvagg mentioned this issue Oct 10, 2024

feat(events): compare-amt option for lotus-shed indexes inspect-events #12571

Merged

rvagg mentioned this issue Oct 10, 2024

feat: migration("re-indexing"), backfilling and diasgnostics tooling for the ChainIndexer #12450

Merged

1 task

aarshkshah1992 mentioned this issue Oct 14, 2024

ChainValidateIndex should validate the actual content of the events and not just match on the number of events #12594

Closed

rjan90 moved this from 📌 Triage to 🐱 Todo in FilOz Oct 22, 2024

rjan90 added this to the DX-Streamline milestone Oct 22, 2024

aarshkshah1992 mentioned this issue Oct 23, 2024

feat(chainindex): compare events AMT root between Index and chain state for validation #12632

Merged

aarshkshah1992 closed this as completed Oct 23, 2024

github-project-automation bot moved this from 🐱 Todo to 🎉 Done in FilOz Oct 23, 2024

rjan90 moved this from 🎉 Done to ☑️ Done (Archive) in FilOz Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChainIndexer: make epoch validation more comprehensive #12570

ChainIndexer: make epoch validation more comprehensive #12570

rvagg commented Oct 10, 2024

rvagg commented Oct 10, 2024

rvagg commented Oct 10, 2024

aarshkshah1992 commented Oct 23, 2024

ChainIndexer: make epoch validation more comprehensive #12570

ChainIndexer: make epoch validation more comprehensive #12570

Comments

rvagg commented Oct 10, 2024

rvagg commented Oct 10, 2024

rvagg commented Oct 10, 2024

aarshkshah1992 commented Oct 23, 2024