Don't hold mutex until release cv in cv_wait #519

tuxoko · 2016-01-07T03:16:41Z

If a thread is holding mutex when doing cv_destroy, it might end up waiting a
thread in cv_wait. The waiter would wake up trying to aquire the same mutex
and cause deadlock.

We solve this by move the mutex_enter to the bottom of cv_wait, so that
the waiter will release the cv first, allowing cv_destroy to succeed and have
a chance to free the mutex.

This would create race condition on the cv_mutex. We use xchg to set and check
it to ensure we won't be harmed by the race. This would result in the cv_mutex
debugging becomes best-effort.

Also, the change reveals a race, which was unlikely before, where we call
mutex_destroy while test threads are still holding the mutex. We use
kthread_stop to make sure the threads are exit before mutex_destroy.

Signed-off-by: Chunwei Chen tuxoko@gmail.com

tuxoko · 2016-01-07T03:19:01Z

Fix openzfs/zfs#4166 and second part of openzfs/zfs#4106

dweeezil · 2016-01-07T13:57:48Z

I will try my existing tests with this.

tuxoko · 2016-01-07T19:28:58Z

Update: Change ASSERT to remove if and modify __cv_timedwait_hires.

behlendorf · 2016-01-07T19:41:56Z

@tuxoko nice! This isn't nearly as disruptive as I'd feared. The ref counts nicely ensure the memory remains valid while cv_wait (and friends) finish up. Moving the mutex to the end of the function removes the lock inversion. We've always relied on the caller not destroying the mutex prematurely so there no new concern there. That said, this kind of thing can be subtle so we'll definitely want to stress the new code... and it sounds like @dweeezil is already on that. Awesome, thanks guys!

dweeezil · 2016-01-07T22:36:36Z

Looks good, see openzfs/zfs#4106 (comment).

tuxoko · 2016-01-07T22:38:09Z

The splat complains a lot in openzfs/zfs#4173
I'll need to look into it.

If a thread is holding mutex when doing cv_destroy, it might end up waiting a thread in cv_wait. The waiter would wake up trying to aquire the same mutex and cause deadlock. We solve this by move the mutex_enter to the bottom of cv_wait, so that the waiter will release the cv first, allowing cv_destroy to succeed and have a chance to free the mutex. This would create race condition on the cv_mutex. We use xchg to set and check it to ensure we won't be harmed by the race. This would result in the cv_mutex debugging becomes best-effort. Also, the change reveals a race, which was unlikely before, where we call mutex_destroy while test threads are still holding the mutex. We use kthread_stop to make sure the threads are exit before mutex_destroy. Signed-off-by: Chunwei Chen <tuxoko@gmail.com>

dweeezil · 2016-01-11T14:36:12Z

@tuxoko What are your thoughts on this so far? I didn't look at the buildbot errors but I can easily get an assert from condvar:broadcast1 in debug builds because mutex is not held. Is this what the bots are triggering? So far, I've not seen the assert in testing with zfs, however, it would seem that zio_wait() might be able to trigger it and also possibly other places.

tuxoko · 2016-01-11T17:57:04Z

I've already fixed that. It's a preexisting race in splat. It's just my patch makes it almost always occur. zio_wait is absolutely safe from this. Whether there exist such race, I'm not 100 percent sure.

tuxoko · 2016-01-11T18:55:11Z

@dweeezil
Now I got to my desktop, I can explain more clearly.
The problem in splat code is that the waker (cv_broadcast, cv_signal) doing free (cv_destroy and mutex_destroy). This type of construct is always wrong, because you don't know when the waiter will release the mutex.

behlendorf · 2016-01-12T23:13:31Z

@tuxoko @dweeezil the updated fix to include the test cases LGTM. Either of you have reservations about merging this?

tuxoko · 2016-01-12T23:22:08Z

@behlendorf
I'd say go for it.

dweeezil · 2016-01-12T23:26:21Z

@behlendorf No problems during high stress testing. LGTM.

behlendorf · 2016-01-12T23:34:05Z

Great, thanks for quick reply. Merged as:

e843553 Don't hold mutex until release cv in cv_wait

tuxoko force-pushed the cv_mutex branch from 2d0b95c to 08fbc02 Compare January 7, 2016 19:20

tuxoko force-pushed the cv_mutex branch from 08fbc02 to eb2778e Compare January 7, 2016 23:16

behlendorf closed this Jan 12, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't hold mutex until release cv in cv_wait #519

Don't hold mutex until release cv in cv_wait #519

tuxoko commented Jan 7, 2016

tuxoko commented Jan 7, 2016

dweeezil commented Jan 7, 2016

tuxoko commented Jan 7, 2016

behlendorf commented Jan 7, 2016

dweeezil commented Jan 7, 2016

tuxoko commented Jan 7, 2016

dweeezil commented Jan 11, 2016

tuxoko commented Jan 11, 2016 via email

tuxoko commented Jan 11, 2016

behlendorf commented Jan 12, 2016

tuxoko commented Jan 12, 2016

dweeezil commented Jan 12, 2016

behlendorf commented Jan 12, 2016

Don't hold mutex until release cv in cv_wait #519

Don't hold mutex until release cv in cv_wait #519

Conversation

tuxoko commented Jan 7, 2016

tuxoko commented Jan 7, 2016

dweeezil commented Jan 7, 2016

tuxoko commented Jan 7, 2016

behlendorf commented Jan 7, 2016

dweeezil commented Jan 7, 2016

tuxoko commented Jan 7, 2016

dweeezil commented Jan 11, 2016

tuxoko commented Jan 11, 2016 via email

tuxoko commented Jan 11, 2016

behlendorf commented Jan 12, 2016

tuxoko commented Jan 12, 2016

dweeezil commented Jan 12, 2016

behlendorf commented Jan 12, 2016