Tracking: Add ETL
to HashingStages
& HistoryStages
#6909
Labels
A-staged-sync
Related to staged sync (pipelines and stages)
C-enhancement
New feature or request
M-prevent-stale
Prevents old inactive issues/PRs from being closed due to inactivity
Feature
We should use the ETL Collector on these stages as to decrease both first sync duration and initial write-amplification. These stages use hashes as keys which lead to performance/storage degradation when using
tx.insert
(current) vstx.append
(with etl). More here.The flow is very straightforward.
Collector
.commit_threshold
now becomes the maximum chunk size to hold in memory. More here.. This also means that we commit all data in one go.tx.append
tx.insert
Example usage on:
TransactionLookupStage (took it from >5h to 20-30min and 157 GiB to 107GiB)
HeaderStage
Stages:
ETL
to Hashing Stages #7030ETL
toHistoryStages
#7249Additional context
No response
The text was updated successfully, but these errors were encountered: