BUG: Waiting on wrong key #24

tstone · 2021-06-24T21:39:40Z

Background

A bug was identified earlier today where the wait_for_new_value method is incorrectly waiting on the wrong key. This bugfix updates the code to re-fetch the latest key value whenever it attempts to see if another thread has written the value. This makes sure that re-attempts are always attempting to get the latest value.

Tasks

Code of Conduct reviewed
Specs written and passing
Backwards-incompatible changes called out in this PR
Increment the version file (./lib/atomic_cache/version.rb) when applicable
- See semver.org for what segment to increment

tstone · 2021-06-24T21:42:46Z

lib/atomic_cache/atomic_cache_client.rb

      return value if value.present?

      # wait for the other process if a last known value isn't there
      if key.present?
        return time('wait.run', tags: tags) do
-          wait_for_new_value(key, options, tags)
+          wait_for_new_value(keyspace, options, tags)


quick_retry and wait_for_new_value changed their approach here a bit:

quick_retry now uses the initial key obtained on line 40, as very little time has passed, and it's more efficient for all the locations that have quick_retry disabled.

wait_for_new_value no longer takes the key, as time will have elapsed between attempts, such that it should make sure it's looking at the most recent key for a value.

tstone · 2021-06-24T21:44:30Z

lib/atomic_cache/atomic_cache_client.rb

@@ -162,6 +159,8 @@ def wait_for_new_value(key, options, tags)
        backoff_duration_ms = option(:backoff_duration_ms, options, backoff_duration_ms)
        sleep((backoff_duration_ms.to_f / 1000) * attempt)

+        # re-fetch the key each time, to make sure we're actually getting the latest key with the correct LMT
+        key = @timestamp_manager.current_key(keyspace)


This line here is effectively the bug fix. Each attempt it will make sure to get the most current key.

How this is actually working in a cold start scenario is that because LMT is empty, the initial key is just prefix.foobar. The value that actually gets written by the generating process however is prefix.foobar.timestamp. The old code would continue to try and read prefix.foobar then exhaust it's attempts. This fix causes it to always grab the most current key, which will cause it to see prefix.foobar.timestamp as written by another generating process, and use that key for a value.

tstone · 2021-06-24T21:45:08Z

spec/atomic_cache/integration/waiting_spec.rb

+  end
+end
+
+


be warned: there be dragons below 🐉

🙈

lebeerman

The crux of this update generally makes sense - the integration test is really cool, but noting I'm not 100% about threadsafe-ness and how all the process queue management is working.

tstone added 2 commits June 24, 2021 15:35

don’t use (incorrect) initial key to wait for new value

43c5d5f

fixed reference to non-existing key value in log

8fc829c

tstone added the bug Something isn't working label Jun 24, 2021

Use initial key since nothing has changed

7165757

tstone commented Jun 24, 2021

View reviewed changes

tstone marked this pull request as ready for review June 24, 2021 21:46

lebeerman approved these changes Jun 24, 2021

View reviewed changes

tstone merged commit 6b10412 into main Jun 25, 2021

tstone deleted the bugfix/MTF-559-waiting-different-keys branch June 25, 2021 15:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: Waiting on wrong key #24

BUG: Waiting on wrong key #24

tstone commented Jun 24, 2021

tstone Jun 24, 2021

tstone Jun 24, 2021

tstone Jun 24, 2021 •

edited

Loading

lebeerman left a comment

BUG: Waiting on wrong key #24

BUG: Waiting on wrong key #24

Conversation

tstone commented Jun 24, 2021

Background

Tasks

tstone Jun 24, 2021

Choose a reason for hiding this comment

tstone Jun 24, 2021

Choose a reason for hiding this comment

tstone Jun 24, 2021 • edited Loading

Choose a reason for hiding this comment

lebeerman left a comment

Choose a reason for hiding this comment

tstone Jun 24, 2021 •

edited

Loading