[SPARK-23316][SQL] AnalysisException after max iteration reached for IN query #20548

bogdanrdc · 2018-02-08T15:21:54Z

What changes were proposed in this pull request?

Added flag ignoreNullability to DataType.equalsStructurally.
The previous semantic is for ignoreNullability=false.
When ignoreNullability=true equalsStructurally ignores nullability of contained types (map key types, value types, array element types, structure field types).
In.checkInputTypes calls equalsStructurally to check if the children types match. They should match regardless of nullability (which is just a hint), so it is now called with ignoreNullability=true.

How was this patch tested?

New test in SubquerySuite

SparkQA · 2018-02-08T18:30:46Z

Test build #87218 has finished for PR 20548 at commit 367c70b.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2018-02-13T01:00:21Z

This sounds another regression in Spark 2.3. cc @cloud-fan @dilipbiswal

dilipbiswal · 2018-02-13T01:17:11Z

@gatorsmile Yeah. Its due to the changes made for SPARK-21759. The fix looks okay to me. One other aspect would be to stop adding the additional projects we keep adding during in IN Coercion.

cloud-fan · 2018-02-13T03:10:21Z

The fix LGTM. cc @sameeragarwal

gatorsmile · 2018-02-13T05:01:24Z

retest this please

gatorsmile · 2018-02-13T05:03:15Z

sql/catalyst/src/main/scala/org/apache/spark/sql/types/DataType.scala

@@ -298,22 +298,24 @@ object DataType {
   * Returns true if the two data types share the same "shape", i.e. the types (including
   * nullability) are the same, but the field names don't need to be the same.
   */
-  def equalsStructurally(from: DataType, to: DataType): Boolean = {
+  def equalsStructurally(from: DataType, to: DataType,
+      ignoreNullability: Boolean = false): Boolean = {


Nit: the indents.

gatorsmile · 2018-02-13T05:04:43Z

sql/catalyst/src/main/scala/org/apache/spark/sql/types/DataType.scala

@@ -298,22 +298,24 @@ object DataType {
   * Returns true if the two data types share the same "shape", i.e. the types (including
   * nullability) are the same, but the field names don't need to be the same.


This comments need an update too.

SparkQA · 2018-02-13T08:05:01Z

Test build #87364 has finished for PR 20548 at commit 367c70b.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2018-02-13T08:42:55Z

retest this please

cloud-fan · 2018-02-13T08:43:53Z

Anyone knows which commit introduced this bug?

dilipbiswal · 2018-02-13T08:45:12Z

@cloud-fan Its SPARK-21759. Apart from fixing the bug, this PR also refactored the code in checkInputTypes and that seemed to have caused it.

SparkQA · 2018-02-13T11:53:20Z

Test build #87381 has finished for PR 20548 at commit 367c70b.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

…IN query ## What changes were proposed in this pull request? Added flag ignoreNullability to DataType.equalsStructurally. The previous semantic is for ignoreNullability=false. When ignoreNullability=true equalsStructurally ignores nullability of contained types (map key types, value types, array element types, structure field types). In.checkInputTypes calls equalsStructurally to check if the children types match. They should match regardless of nullability (which is just a hint), so it is now called with ignoreNullability=true. ## How was this patch tested? New test in SubquerySuite Author: Bogdan Raducanu <bogdan@databricks.com> Closes #20548 from bogdanrdc/SPARK-23316. (cherry picked from commit 05d0512) Signed-off-by: gatorsmile <gatorsmile@gmail.com>

gatorsmile · 2018-02-13T17:52:34Z

This is a regression introduced by #18968. We have to merge to 2.3. I resolved my comments when I merge the codes.

Thanks! Merged to master/2.3

fix + test

367c70b

gatorsmile reviewed Feb 13, 2018

View reviewed changes

asfgit closed this in 05d0512 Feb 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-23316][SQL] AnalysisException after max iteration reached for IN query #20548

[SPARK-23316][SQL] AnalysisException after max iteration reached for IN query #20548

bogdanrdc commented Feb 8, 2018

SparkQA commented Feb 8, 2018

gatorsmile commented Feb 13, 2018

dilipbiswal commented Feb 13, 2018 •

edited

Loading

cloud-fan commented Feb 13, 2018

gatorsmile commented Feb 13, 2018

gatorsmile Feb 13, 2018

gatorsmile Feb 13, 2018

SparkQA commented Feb 13, 2018

cloud-fan commented Feb 13, 2018

cloud-fan commented Feb 13, 2018

dilipbiswal commented Feb 13, 2018 •

edited

Loading

SparkQA commented Feb 13, 2018

gatorsmile commented Feb 13, 2018

		@@ -298,22 +298,24 @@ object DataType {
		* Returns true if the two data types share the same "shape", i.e. the types (including
		* nullability) are the same, but the field names don't need to be the same.

[SPARK-23316][SQL] AnalysisException after max iteration reached for IN query #20548

[SPARK-23316][SQL] AnalysisException after max iteration reached for IN query #20548

Conversation

bogdanrdc commented Feb 8, 2018

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Feb 8, 2018

gatorsmile commented Feb 13, 2018

dilipbiswal commented Feb 13, 2018 • edited Loading

cloud-fan commented Feb 13, 2018

gatorsmile commented Feb 13, 2018

gatorsmile Feb 13, 2018

Choose a reason for hiding this comment

gatorsmile Feb 13, 2018

Choose a reason for hiding this comment

SparkQA commented Feb 13, 2018

cloud-fan commented Feb 13, 2018

cloud-fan commented Feb 13, 2018

dilipbiswal commented Feb 13, 2018 • edited Loading

SparkQA commented Feb 13, 2018

gatorsmile commented Feb 13, 2018

dilipbiswal commented Feb 13, 2018 •

edited

Loading

dilipbiswal commented Feb 13, 2018 •

edited

Loading