horizon: `history_claimable_balances` is not cleared out by the reaper. #4396

2opremio · 2022-05-23T18:47:22Z

We should:

Make sure the reaper takes care of history_claimable_balances
Make sure all the other history tables are included.
Add a test that ensure all the history_ tables in the DB are cleared out (in order prevent this problem from happening to tables added in the future).

The text was updated successfully, but these errors were encountered:

2opremio · 2022-07-12T18:11:42Z

We have three tables which aren't cleaned up when deleting a range. Those are:

history_claimable_balances
history_liquidity_pools
history_accounts
history_assets

They all pair an entity (claimable balance, liquidity pool, account) with an internal id:

# \d history_accounts
 id      | bigint                |           | not null | nextval('history_accounts_id_seq'::regclass)
 address | character varying(64) |           |          | 

# \d history_liquidity_pools
 id                | bigint |           | not null | nextval('history_liquidity_pools_id_seq'::regclass)
 liquidity_pool_id | text   |           | not null | 

# \d history_assets
 id           | integer               |           | not null | nextval('history_assets_id_seq'::regclass)
 asset_type   | character varying(64) |           | not null | 
 asset_code   | character varying(12) |           | not null | 
 asset_issuer | character varying(56) |           | not null | 

# \d history_claimable_balances
 id                   | bigint |           | not null | nextval('history_claimable_balances_id_seq'::regclass)
 claimable_balance_id | text   |           | not null |

In theory, every time we clear out a range of data (either for reingesting or for reaping) we should also remove any orphan entries in the tables above (an orphan entry is one whose internal id isn't used anymore, e.g. for history_claimable_balances, that would be all entries which cannot be joined with history_operation_claimable_balances or history_transaction_claimable_balances).

However, the queries required to do so may be too expensive. As I see it we could either:

clear out any orphan entries (by figuring out which ids are not matched in any other tables)
before clearing out the other tables, figure out which ids are only used the in the ledger range to clear out (and would end up being orphans after the fact)

I think both (1) and (2) may be too expensive for large enough history_* tables

2opremio · 2022-07-12T18:13:26Z

maybe @sydneynotthecity may have a better querying suggestion?

sydneynotthecity · 2022-07-14T15:35:55Z

Do you know how expensive of a query cost is too expensive?

Another option is that we could create a trigger that tracked any record deletions in the history_* tables and logged them to an events logging table. We could set it up in a way where the deleted id was logged to the events table. Then, we could delete the orphaned entries based on a join to this events table, which should be significantly cheaper than joining on the original tables.

Once the records were cleared out, we could either update an indicator in the event tables for is_deleted = TRUE or just wipe the records from the events table so that the table only retains orphaned records to be deleted.

sydneynotthecity · 2022-07-14T15:41:46Z

@2opremio the other point is the history_transaction_claimable_balances and history_operation_claimable_balances were missing indices for claimable_balance_id which is what was causing joins/searches on claimable_balance_id to be super slow. @sreuland added the indices as a test in #4455 and it looks like it has greatly improved query performance.

It appears that the equivalent liquidity_pools tables are also missing indices on liquidity_pool_id only. My guess is that if we added those as well, we should see enough query performance improvement to be able to identify orphaned records by query, like you originally proposed.

2opremio · 2022-07-26T13:44:42Z

let's wait for the indices to be added and retake this after that.

@fons

…4518) While Horizon removes history data when `--history-retention-count` flag is set it doesn't clear lookup historical tables. Lookup tables are `[id, key name]` pairs that allow setting pointers to keys in historical tables, thus saving disk space. This data can occupy a vast space on disk and is never used when old historical data is deleted. This commit adds code responsible for clearing orphaned rows in lookup historical tables. Orphaned rows can appear when old data is removed by reaper. The new code is separate from the existing reaper code (see "Alternative solutions" below) and activates after each ledger if there are no more ledgers to ingest in the backend. This has two advantages: it does not slow down catchup and it works only when ingestion is idle which shouldn't affect ingestion at all. To ensure performance is not affected, the `ReapLookupTables` method is called with context with 5 seconds timeout which means that if it does not finish the work in specified time it will simply be cancelled. The solution here requires new indexes added in c2d52f0 (without it finding the rows to delete is slow). For each lookup table, we check the number of occurences of a given lookup ID in all the tables in which lookup table is used. If no occurences are found, the row is removed from a lookup table. Rows are removed in batches of 10000 rows (can be modified in the future). The cursor is updated when tables is processed so after next ledger ingesion the next chunk of rows is checked. When cursor reaches the end of table it is reset back to 0. This ensures that all the orphaned rows are removed eventually (some rows can be skipped because new rows are added to lookup tables by ingestion and some are removed by reaper so `offset` does not always skip to the place is should to cover entire table). #### Alternative solutions While working on this I tried to implement @fons'es idea from #4396 which was removing rows before clearing historical data which are not present in other ranges. There is a general problem with this solution. The lookup tables are actively used by ingestion which means that if rows are deleted while ingestion reads a given row it can create inconsistent data. We could modify reaper to aquire ingestion lock but if there are many ledgers to remove it can affect ingestion. We could also write a query that finds and removes all the orphaned rows but it's too slow to be executed between ingestion of two consecutive ledgers.

bartekn · 2022-08-09T09:25:19Z

#4518 clears orphans from history_claimable_balances and history_liquidity_pools. The last two remaining tables (history_accounts and history_assets) require indexes in other tables like horizon_operation_participants. @ire-and-curses agreed that new indexes should be add in one of the future releases after further tests. EDIT: I was able to actually use existing indexes to reap history_accounts in #4525. The remaining one is history_assets which require an index (even multicolumn index would work) in tables where it's used.

2opremio added horizon bug labels May 23, 2022

2opremio mentioned this issue May 23, 2022

enable-ingestion-filtering does not reduce database #4395

Closed

jcx120 mentioned this issue Jun 24, 2022

Claimable Balances Query slow on Horizon #4436

Closed

Shaptic mentioned this issue Jul 6, 2022

lighthorizon: Prepend version to ledger files #4450

Merged

2opremio self-assigned this Jul 7, 2022

2opremio mentioned this issue Jul 18, 2022

Optimize history tables for query by id performance(claimable balance, accounts, liquidity pools and assets) #4455

Closed

bartekn assigned bartekn and unassigned 2opremio Aug 4, 2022

bartekn mentioned this issue Aug 7, 2022

services/horizon: Reap history object tables when ingestion is idle #4518

Merged

7 tasks

bartekn mentioned this issue Aug 31, 2022

services/horizon: Add history_assets to lookup tables reap #4565

Merged

7 tasks

sreuland moved this to In Progress in Platform Scrum Sep 2, 2022

sreuland added this to Platform Scrum Sep 2, 2022

sreuland moved this from In Progress to Needs Review in Platform Scrum Sep 2, 2022

bartekn closed this as completed in #4565 Sep 6, 2022

Repository owner moved this from Needs Review to Done in Platform Scrum Sep 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

horizon: `history_claimable_balances` is not cleared out by the reaper. #4396

horizon: `history_claimable_balances` is not cleared out by the reaper. #4396

2opremio commented May 23, 2022

2opremio commented Jul 12, 2022 •

edited

Loading

2opremio commented Jul 12, 2022

sydneynotthecity commented Jul 14, 2022

sydneynotthecity commented Jul 14, 2022

2opremio commented Jul 26, 2022

bartekn commented Aug 9, 2022 •

edited

Loading

horizon: history_claimable_balances is not cleared out by the reaper. #4396

horizon: history_claimable_balances is not cleared out by the reaper. #4396

Comments

2opremio commented May 23, 2022

2opremio commented Jul 12, 2022 • edited Loading

2opremio commented Jul 12, 2022

sydneynotthecity commented Jul 14, 2022

sydneynotthecity commented Jul 14, 2022

2opremio commented Jul 26, 2022

bartekn commented Aug 9, 2022 • edited Loading

horizon: `history_claimable_balances` is not cleared out by the reaper. #4396

horizon: `history_claimable_balances` is not cleared out by the reaper. #4396

2opremio commented Jul 12, 2022 •

edited

Loading

bartekn commented Aug 9, 2022 •

edited

Loading