Safe(r) minting of `CapabilityRef`s #429

petrosagg · 2021-11-02T22:54:02Z

Background

As part of progress tracking timely keeps track of three kinds of timestamp frequencies for each operator. Timestamps at the inputs (the consumeds part of SharedProgress), internal timestamps (the internals part of SharedProgress) and timestamps at the output (the produceds part of SharedProgress). A timely operator must carefully manipulate those frequencies in order to let timely know at what times data might be produced from the operator. An operator can misuse these frequencies if for example it clears a consumed timestamp t1 without adding it to its internal timestamps and later decides that it wants to produce data at t1.

With the exception of the raw operator builder timely provides a safe API that ensures timestamp frequency counts are correct via the CapabilityRef and Capability types. These act as a witness of the existence of a consumed or internal timestamp respectively.

CapabilityRef misuse

The Capability type which represents an internal timestamp contains within it a shared reference to the internal timestamp frequency counts and it automatically subtracts its timestamp from the counts on Drop. This allows users to keep a Capability around in whatever way they wish and have the internal operator timestamp counts follow accordingly.

However, the same is not true for the CapabilityRef type. When a CapabilityRef is minted its timestamp is immediately subtracted from consumed and provided to the user. This is a problem because if the operator logic held onto the CapabilityRef then it could later produce data that is behind the frontier.

In practice, holding onto a CapabilityRef across operator invocations is quite difficult as the user will be tasked with storing an input handle and its produced CapabilityRef (which is lifetimed) at the same time. Self-referencial structs are notoriously difficult (but not impossible nor unsound) to do in Rust and so synchronous operators don't usually face this problem.

This situation becomes trivially possible with an async operator. In async rust, where the compiler is tasked with generating a struct that captures the stack of a future at any given yield point, making this self-referencial is easy. A user simply needs to obtain a CapabilityRef from an input handle and then await on something. When this yield point is reached the current stack will be preserved, keeping the CapabilityRef alive across timely invocations.

Solution in this PR

The solution this PR implements is to make CapabilityRef behave in the same way as a Capability. Instead of relying on the difficulty of self-referencial structs in Rust this PR adds an extra guard field in CapabilityRef that when dropped it will update the consumed timestamps of the operator accordingly. This allows users to keep CapabilityRefs around for as long as they wish.

petrosagg · 2021-11-10T14:44:36Z

I just built materialize with this change to get further confidence that it is correct. Tests passed :) MaterializeInc/materialize#9023

CapabilityRefs are valid to exist as long as the data in the input are not marked as consumed. This change makes sure that this is the case by including an extra drop guard in the capability ref. Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

frankmcsherry · 2022-09-07T15:12:04Z

Looks good; thanks!

After TimelyDataflow/timely-dataflow#429 holding onto CapabilityRefs across await points is safe Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

Since the merge of TimelyDataflow#429, `CapabilityRef`s have been made safe to hold onto across operator invocations because that PR made sure that they only decremented their progress counts on `Drop`. While this allowed `async`/`await` based operators to freely hold on to them, it was still very difficult for synchronous based operators to do the same thing, due to the lifetime attached to the `CapabilityRef`. Since `CapabilityRef`s can now be held arbitrarily long, the lifetime constraint on them can be lifted and therefore made into a `'static` value. No extra clones of the timestamps were needed for this change. After making this change, the name `CapabilityRef` felt wrong, since there is no reference to anything anymore. Instead, the main distinction between `CapabilityRef`s and `Capabilities` are that the former is associated with an input port and the latter is associated with an output port. As such, I have renamed `CapabilityRef` to `InputCapability` to signal to users that holding onto one of them represents holding onto a timestamp at the input for which we have not yet determined the output port that it should flow to. This nicely ties up the semantics of the `InputCapability::retain_for_output` and `InputCapability::delayed_for_output` methods, which make it clear by their name and signature that this is what "transfers" the capability from input ports to output ports. Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

Since the merge of TimelyDataflow#429, `CapabilityRef`s have been made safe to hold onto across operator invocations because that PR made sure that they only decremented their progress counts on `Drop`. While this allowed `async`/`await` based operators to freely hold on to them, it was still very difficult for synchronous based operators to do the same thing, due to the lifetime attached to the `CapabilityRef`. We can observe that the lifetime no longer provides any benefits, which means it can be removed and turn `CapabilityRef`s into fully owned values. This allows any style of operator to easily hold on to them. The benefit of that isn't just performance (by avoiding the `retain()` dance), but also about deferring the decision of the output port a given input should flow to to a later time. After making this change, the name `CapabilityRef` felt wrong, since there is no reference to anything anymore. Instead, the main distinction between `CapabilityRef`s and `Capabilities` are that the former is associated with an input port and the latter is associated with an output port. As such, I have renamed `CapabilityRef` to `InputCapability` to signal to users that holding onto one of them represents holding onto a timestamp at the input for which we have not yet determined the output port that it should flow to. This nicely ties up the semantics of the `InputCapability::retain_for_output` and `InputCapability::delayed_for_output` methods, which make it clear by their name and signature that this is what "transfers" the capability from input ports to output ports. Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

Since the merge of #429, `CapabilityRef`s have been made safe to hold onto across operator invocations because that PR made sure that they only decremented their progress counts on `Drop`. While this allowed `async`/`await` based operators to freely hold on to them, it was still very difficult for synchronous based operators to do the same thing, due to the lifetime attached to the `CapabilityRef`. We can observe that the lifetime no longer provides any benefits, which means it can be removed and turn `CapabilityRef`s into fully owned values. This allows any style of operator to easily hold on to them. The benefit of that isn't just performance (by avoiding the `retain()` dance), but also about deferring the decision of the output port a given input should flow to to a later time. After making this change, the name `CapabilityRef` felt wrong, since there is no reference to anything anymore. Instead, the main distinction between `CapabilityRef`s and `Capabilities` are that the former is associated with an input port and the latter is associated with an output port. As such, I have renamed `CapabilityRef` to `InputCapability` to signal to users that holding onto one of them represents holding onto a timestamp at the input for which we have not yet determined the output port that it should flow to. This nicely ties up the semantics of the `InputCapability::retain_for_output` and `InputCapability::delayed_for_output` methods, which make it clear by their name and signature that this is what "transfers" the capability from input ports to output ports. Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

petrosagg force-pushed the safe-capabilityref branch 4 times, most recently from 668b490 to 0929592 Compare November 3, 2021 00:01

petrosagg force-pushed the safe-capabilityref branch from 0929592 to 2862f9d Compare November 10, 2021 13:47

petrosagg mentioned this pull request Nov 10, 2021

Test timely capability ref PR MaterializeInc/materialize#9023

Closed

petrosagg force-pushed the safe-capabilityref branch from 2862f9d to a7bf67f Compare December 1, 2021 17:36

petrosagg mentioned this pull request Sep 5, 2022

timely-util: no nonsense timely/async bridge MaterializeInc/materialize#14630

Merged

petrosagg force-pushed the safe-capabilityref branch from a7bf67f to e49dccb Compare September 6, 2022 14:09

frankmcsherry merged commit 9548bf9 into TimelyDataflow:master Sep 7, 2022

petrosagg added a commit to petrosagg/materialize that referenced this pull request Sep 12, 2022

storage: use CapabilityRefs directly

66af74c

After TimelyDataflow/timely-dataflow#429 holding onto CapabilityRefs across await points is safe Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

petrosagg mentioned this pull request Sep 12, 2022

storage: use CapabilityRefs directly MaterializeInc/materialize#14781

Merged

petrosagg added a commit to petrosagg/materialize that referenced this pull request Sep 13, 2022

storage: use CapabilityRefs directly

2d7b161

After TimelyDataflow/timely-dataflow#429 holding onto CapabilityRefs across await points is safe Signed-off-by: Petros Angelatos <petrosagg@gmail.com>

petrosagg mentioned this pull request Dec 22, 2022

timely: unconstrained lifetime for CapabilityRef #491

Merged

github-actions bot mentioned this pull request Oct 29, 2024

chore: release #594

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Safe(r) minting of `CapabilityRef`s #429

Safe(r) minting of `CapabilityRef`s #429

petrosagg commented Nov 2, 2021 •

edited

Loading

petrosagg commented Nov 10, 2021

frankmcsherry commented Sep 7, 2022

Safe(r) minting of CapabilityRefs #429

Safe(r) minting of CapabilityRefs #429

Conversation

petrosagg commented Nov 2, 2021 • edited Loading

Background

CapabilityRef misuse

Solution in this PR

petrosagg commented Nov 10, 2021

frankmcsherry commented Sep 7, 2022

Safe(r) minting of `CapabilityRef`s #429

Safe(r) minting of `CapabilityRef`s #429

petrosagg commented Nov 2, 2021 •

edited

Loading