refactor: forester: add retry logic for epoch registration #1160

sergeytimoshin · 2024-09-06T16:39:44Z

Consolidate repeated active phase work processing into a new function process_epoch_work.
Introduce register_for_epoch_with_retry to handle registration retries with a specified maximum number of attempts and delay duration.

ananas-block

Further changes are necessary to handle not only active work but subsequent epoch steps correctly for recovered epochs.

Consolidate repeated active phase work processing into a new function `process_epoch_work`. Introduce `register_for_epoch_with_retry` to handle registration retries with a specified maximum number of attempts and delay duration.

Eliminated an unnecessary info log statement when setting epoch flags.

ananas-block · 2024-09-07T15:37:22Z

forester/src/epoch_manager.rs

+        }
+
+        // Attempt to recover registration info
+        let mut registration_info = match self.recover_registration_info(epoch).await {


what does this do if there is no registration that can be recovered?

If there is no registration we'll try to register here https://github.com/Lightprotocol/light-protocol/pull/1160/files/7213266f76690b8931a73e01055af9123b07b22a#diff-1e165bd3189acab768bdb1cd6cf4ec33528400511ec68498004b73a25d0646c3R280-R281

ananas-block · 2024-09-07T15:43:02Z

forester/src/epoch_manager.rs

+            return Err(ForesterError::Custom(format!(
+                "Too late to register for epoch {}. Current slot: {}, Registration end: {}",
+                epoch, slot, phases.registration.end
+            )));


do we really want to throw in this case?
An alternative behavior could be to wait for the next registration period.
Or just return if we have logic to wait for the next epoch in a different place.

I think this behaviour is semantically correct because it's part of the process_epoch flow: this case is really an error, it shouldn't happen in a normal situation, and we log it as an error in our logfiles: https://github.com/Lightprotocol/light-protocol/pull/1160/files/7213266f76690b8931a73e01055af9123b07b22a#diff-1e165bd3189acab768bdb1cd6cf4ec33528400511ec68498004b73a25d0646c3R147

Replaced multiple calls to slot_tracker.estimated_current_slot with sync_slot to ensure accurate slot synchronization. Updated sync_slot to return the current slot and adjusted the calling logic accordingly to maintain correct epoch phase handling.

sergeytimoshin changed the title ~~refactor: add retry logic for registration~~ refactor: forester: add retry logic for epoch registration Sep 6, 2024

ananas-block requested changes Sep 7, 2024

View reviewed changes

sergeytimoshin added 4 commits September 7, 2024 22:24

wip

e79b42c

improve epoch management logic

986f28c

Remove redundant logging in epoch_manager

7213266

Eliminated an unnecessary info log statement when setting epoch flags.

sergeytimoshin force-pushed the sergey/forester-epoch-registration-retry branch from ec1cd25 to 7213266 Compare September 7, 2024 15:30

ananas-block reviewed Sep 7, 2024

View reviewed changes

ananas-block approved these changes Sep 7, 2024

View reviewed changes

sergeytimoshin merged commit 44b1c05 into main Sep 7, 2024
7 checks passed

sergeytimoshin deleted the sergey/forester-epoch-registration-retry branch September 7, 2024 17:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: forester: add retry logic for epoch registration #1160

refactor: forester: add retry logic for epoch registration #1160

sergeytimoshin commented Sep 6, 2024

ananas-block left a comment

ananas-block Sep 7, 2024

sergeytimoshin Sep 7, 2024

ananas-block Sep 7, 2024

sergeytimoshin Sep 7, 2024

refactor: forester: add retry logic for epoch registration #1160

refactor: forester: add retry logic for epoch registration #1160

Conversation

sergeytimoshin commented Sep 6, 2024

ananas-block left a comment

Choose a reason for hiding this comment

ananas-block Sep 7, 2024

Choose a reason for hiding this comment

sergeytimoshin Sep 7, 2024

Choose a reason for hiding this comment

ananas-block Sep 7, 2024

Choose a reason for hiding this comment

sergeytimoshin Sep 7, 2024

Choose a reason for hiding this comment