[SPARK-18406][CORE][Backport-2.0] Race between end-of-task and completion iterator read lock release #18096

jiangxb1987 · 2017-05-24T19:05:33Z

This is a backport PR of #18076 to 2.0 and 2.1.

What changes were proposed in this pull request?

When a TaskContext is not propagated properly to all child threads for the task, just like the reported cases in this issue, we fail to get to TID from TaskContext and that causes unable to release the lock and assertion failures. To resolve this, we have to explicitly pass the TID value to the unlock method.

How was this patch tested?

Add new failing regression test case in RDDSuite.

…read lock release When a TaskContext is not propagated properly to all child threads for the task, just like the reported cases in this issue, we fail to get to TID from TaskContext and that causes unable to release the lock and assertion failures. To resolve this, we have to explicitly pass the TID value to the `unlock` method. Add new failing regression test case in `RDDSuite`. Author: Xingbo Jiang <xingbo.jiang@databricks.com> Closes apache#18076 from jiangxb1987/completion-iterator.

jiangxb1987 · 2017-05-24T19:06:27Z

cc @gatorsmile @cloud-fan

SparkQA · 2017-05-24T21:12:39Z

Test build #77308 has finished for PR 18096 at commit c85afb2.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile

LGTM

…tion iterator read lock release This is a backport PR of #18076 to 2.0 and 2.1. ## What changes were proposed in this pull request? When a TaskContext is not propagated properly to all child threads for the task, just like the reported cases in this issue, we fail to get to TID from TaskContext and that causes unable to release the lock and assertion failures. To resolve this, we have to explicitly pass the TID value to the `unlock` method. ## How was this patch tested? Add new failing regression test case in `RDDSuite`. Author: Xingbo Jiang <xingbo.jiang@databricks.com> Closes #18096 from jiangxb1987/completion-iterator-2.0.

gatorsmile · 2017-05-24T21:35:40Z

Thanks! Merging to 2.0. Could you close it?

gatorsmile approved these changes May 24, 2017

View reviewed changes

jiangxb1987 closed this May 24, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-18406][CORE][Backport-2.0] Race between end-of-task and completion iterator read lock release #18096

[SPARK-18406][CORE][Backport-2.0] Race between end-of-task and completion iterator read lock release #18096

jiangxb1987 commented May 24, 2017 •

edited

Loading

jiangxb1987 commented May 24, 2017

SparkQA commented May 24, 2017

gatorsmile left a comment

gatorsmile commented May 24, 2017

[SPARK-18406][CORE][Backport-2.0] Race between end-of-task and completion iterator read lock release #18096

[SPARK-18406][CORE][Backport-2.0] Race between end-of-task and completion iterator read lock release #18096

Conversation

jiangxb1987 commented May 24, 2017 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

jiangxb1987 commented May 24, 2017

SparkQA commented May 24, 2017

gatorsmile left a comment

Choose a reason for hiding this comment

gatorsmile commented May 24, 2017

jiangxb1987 commented May 24, 2017 •

edited

Loading