Race condition in futures::sync::mpsc #909

simmons · 2018-03-25T22:57:31Z

It looks like there's a race condition in the MPSC implementation in futures 0.1.19, unless my understanding of the intended behavior is incorrect. If a Sender::try_send() happens concurrently with a Receiver close/drop, it's possible for try_send() to return Ok(()) even though the item can never be received and will not be dropped until the Sender is dropped. My expectation was to receive an error matching Err(ref e) if e.is_disconnected(). The Receiver Drop implementation closes the channel and drains any items present, but this can apparently happen just before the Sender thinks it has successfully enqueued the item.

I wrote a small program to stress-test this scenario and demonstrate the bug:
https://github.com/simmons/mpsc-stress

I discovered this behavior while stress testing a program which implements a synchronous API by sending a command + oneshot::Sender to a future's MPSC, which processes the command and sends the oneshot when complete. When the described race occurs, the sending thread would deadlock while hopelessly waiting forever for the oneshot::Receiver to either receive or indicate disconnection.

If I get a chance in the next few days, I'll see if I can root-cause the problem. (Unless I'm told that my expectation is incorrect.)

The text was updated successfully, but these errors were encountered:

simmons · 2018-03-27T18:51:53Z

I looked through the futures::sync::mpsc code, and confirmed what is happening. When a new item is submitted via try_send(), the Sender checks that the channel is open, then enqueues the message -- but the channel can actually be closed between these two events. Specifically:

On Thread 1, do_send() (via try_send()) is called to submit a message.
inc_num_messages() determines that the channel is open and a new message can be enqueued.
On Thread 2, the Receiver is closed, which marks the channel as closed, and all remaining messages are drained. This could be done explicitly by the application (e.g. the "clean shutdown" described in the documentation), or by way of the Drop implementation.
Back on Thread 1, do_send() adds the new message to the queue and returns Ok(()) to indicate that the message has successfully been sent.
That message is now stuck in a channel with no Receiver. It can never be received and will not be dropped (unless the Sender is dropped, of course). Because try_send() returns Ok(()) instead of Err(TrySendError { kind: TrySendErrorKind::Disconnected(_) }), the caller may reasonably assume that the message is deliverable or will be properly dropped should the Receiver close.

Just to test, I added a Mutex to prevent Sender::do_send() and Receiver::close() from executing concurrently, and confirmed that this fixes the problem. Obviously that is counter to the lock-free design goal of the MPSC code.

Fix a bug where messages sent into a channel at the same time as the `Receiver` is dropped are never removed. Fixes rust-lang#909

seanmonstar mentioned this issue Apr 11, 2018

Revert futures 0.2 changes hyperium/hyper#1482

Merged

hawkw mentioned this issue Aug 31, 2018

tokio-channel: fix race with dropping Receiver tokio-rs/tokio#601

Closed

carllerche added a commit to carllerche/futures-rs that referenced this issue Aug 31, 2018

fix race with dropping mpsc::Receiver

99416a6

Fix a bug where messages sent into a channel at the same time as the `Receiver` is dropped are never removed. Fixes rust-lang#909

carllerche mentioned this issue Aug 31, 2018

fix race with dropping mpsc::Receiver #1241

Merged

semtexzv mentioned this issue Oct 9, 2019

Consider replacing channels #1907

Closed

seanmonstar mentioned this issue Dec 19, 2019

Deadlock on drop of reqwest::blocking::Response sometimes seanmonstar/reqwest#746

Closed

taiki-e added the futures-channel label Sep 5, 2020

taiki-e added the A-channel Area: futures::channel label Dec 17, 2020

taiki-e added bug and removed futures-channel labels Dec 25, 2020

taiki-e mentioned this issue Jan 1, 2021

Port "fix race with dropping mpsc::Receiver" to 0.3 #2304

Merged

ktff mentioned this issue Jan 29, 2021

fix: Switch to futures-0.3 channels vectordotdev/vector#6283

Merged

4 tasks

taiki-e closed this as completed in #2304 Feb 13, 2021

jlizen mentioned this issue Dec 5, 2024

chore: remove legacy::client workaround for fixed race condition in mpsc channel hyperium/hyper-util#159

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Race condition in futures::sync::mpsc #909

Race condition in futures::sync::mpsc #909

simmons commented Mar 25, 2018

simmons commented Mar 27, 2018

Race condition in futures::sync::mpsc #909

Race condition in futures::sync::mpsc #909

Comments

simmons commented Mar 25, 2018

simmons commented Mar 27, 2018