[SQL] Decrease partitions when testing #2164

marmbrus · 2014-08-27T19:44:11Z

No description provided.

rxin · 2014-08-27T20:00:05Z

sql/core/src/main/scala/org/apache/spark/sql/test/TestSQLContext.scala

+
+  /** Fewer partitions to speed up testing. */
+  override private[spark] def numShufflePartitions: Int =
+    getConf(SQLConf.SHUFFLE_PARTITIONS, "5").toInt


I don't know... I was thinking a little more parallelism might be more likely to find bugs without incurring too much overhead. I could be convinced otherwise...

there isn't any parallelism really here since we run with "local", which is single threaded. increasing this from 2 to 5 simply breaks each dataset into more chunks, to be processed sequentially.

I guess parallelism isn't really what I meant, I'm thinking more about bugs that could be related to expecting data to be copartitioned when it actually isn't.

That said, perhaps the test should also run in local[2] or higher. We have found a couple of bugs after deploying that are the result of concurrency issues (scala reflection... i'm looking at you :P)

+1 to run this using local[2]

SparkQA · 2014-08-27T20:36:09Z

QA tests have started for PR 2164 at commit 50aca12.

This patch merges cleanly.

SparkQA · 2014-08-27T21:57:44Z

QA tests have finished for PR 2164 at commit 50aca12.

This patch passes unit tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2014-08-27T22:15:28Z

ok to test

SparkQA · 2014-08-27T22:21:22Z

QA tests have started for PR 2164 at commit b035325.

This patch merges cleanly.

SparkQA · 2014-08-27T23:25:40Z

QA tests have finished for PR 2164 at commit b035325.

This patch fails unit tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2014-08-28T01:43:18Z

Oops some parquet tests failed

SparkQA · 2014-09-05T23:53:38Z

QA tests have started for PR 2164 at commit b035325.

This patch merges cleanly.

SparkQA · 2014-09-06T01:33:45Z

QA tests have finished for PR 2164 at commit b035325.

This patch fails unit tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2014-09-06T06:03:26Z

Jenkins, retest this please.

SparkQA · 2014-09-06T06:43:54Z

QA tests have started for PR 2164 at commit b035325.

This patch merges cleanly.

SparkQA · 2014-09-06T07:32:51Z

QA tests have finished for PR 2164 at commit b035325.

This patch fails unit tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2014-09-10T01:42:53Z

Jenkins, test this please

SparkQA · 2014-09-10T01:51:18Z

QA tests have started for PR 2164 at commit ee687cd.

This patch merges cleanly.

SparkQA · 2014-09-10T03:09:58Z

QA tests have finished for PR 2164 at commit ee687cd.

This patch fails unit tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
- case class Last(child: Expression) extends PartialAggregate with trees.UnaryNode[Expression]
- case class LastFunction(expr: Expression, base: AggregateExpression) extends AggregateFunction
- case class Abs(child: Expression) extends UnaryExpression

SparkQA · 2014-09-10T21:03:39Z

QA tests have started for PR 2164 at commit ee687cd.

This patch does not merge cleanly!

SparkQA · 2014-09-10T21:53:17Z

QA tests have started for PR 2164 at commit dc7cb6e.

This patch merges cleanly.

SparkQA · 2014-09-10T22:15:57Z

QA tests have finished for PR 2164 at commit ee687cd.

This patch fails unit tests.
This patch does not merge cleanly!

SparkQA · 2014-09-10T23:12:08Z

QA tests have finished for PR 2164 at commit dc7cb6e.

This patch fails unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2014-09-13T19:49:15Z

QA tests have started for PR 2164 at commit 2dabae3.

This patch merges cleanly.

SparkQA · 2014-09-13T20:58:48Z

QA tests have finished for PR 2164 at commit 2dabae3.

This patch fails unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2014-09-13T21:39:18Z

QA tests have started for PR 2164 at commit 2dabae3.

This patch merges cleanly.

SparkQA · 2014-09-13T21:39:19Z

QA tests have started for PR 2164 at commit 0da1e8c.

This patch merges cleanly.

SparkQA · 2014-09-13T23:01:59Z

QA tests have finished for PR 2164 at commit 2dabae3.

This patch passes unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2014-09-13T23:04:45Z

QA tests have finished for PR 2164 at commit 0da1e8c.

This patch passes unit tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2014-09-13T23:05:49Z

I'm going to merge this to avoid more test timeouts.

Author: Michael Armbrust <michael@databricks.com> Closes apache#2164 from marmbrus/shufflePartitions and squashes the following commits: 0da1e8c [Michael Armbrust] test hax ef2d985 [Michael Armbrust] more test hacks. 2dabae3 [Michael Armbrust] more test fixes 0bdbf21 [Michael Armbrust] Make parquet tests less order dependent b42eeab [Michael Armbrust] increase test parallelism 80453d5 [Michael Armbrust] Decrease partitions when testing

This PR backports apache#2843 to branch-1.1. The key difference is that this one doesn't support Hive 0.13.1 and thus always returns `0.12.0` when `spark.sql.hive.version` is queried. 6 other commits on which apache#2843 depends were also backported, they are: - apache#2887 for `SessionState` lifecycle control - apache#2675, apache#2823 & apache#3060 for major test suite refactoring and bug fixes - apache#2164, for Parquet test suites updates - apache#2493, for reading `spark.sql.*` configurations Author: Cheng Lian <lian@databricks.com> Author: Cheng Lian <lian.cs.zju@gmail.com> Author: Michael Armbrust <michael@databricks.com> Closes apache#3113 from liancheng/get-info-for-1.1 and squashes the following commits: d354161 [Cheng Lian] Provides Spark and Hive version in HiveThriftServer2 for branch-1.1 0c2a244 [Michael Armbrust] [SPARK-3646][SQL] Copy SQL configuration from SparkConf when a SQLContext is created. 3202a36 [Michael Armbrust] [SQL] Decrease partitions when testing 7f395b7 [Cheng Lian] [SQL] Fixes race condition in CliSuite 0dd28ec [Cheng Lian] [SQL] Fixes the race condition that may cause test failure 5928b39 [Cheng Lian] [SPARK-3809][SQL] Fixes test suites in hive-thriftserver faeca62 [Cheng Lian] [SPARK-4037][SQL] Removes the SessionState instance created in HiveThriftServer2

marmbrus mentioned this pull request Aug 27, 2014

Set Spark SQL Hive compatibility test shuffle partitions to 2. #1784

Closed

rxin reviewed Aug 27, 2014
View reviewed changes

marmbrus force-pushed the shufflePartitions branch from b035325 to ee687cd Compare September 9, 2014 01:58

marmbrus force-pushed the shufflePartitions branch from ee687cd to 0bcaafa Compare September 10, 2014 20:59

marmbrus force-pushed the shufflePartitions branch from 0bcaafa to dc7cb6e Compare September 10, 2014 21:05

marmbrus mentioned this pull request Sep 11, 2014

[SPARK-3294][SQL] Eliminates boxing costs from in-memory columnar storage #2327

Closed

3 tasks

marmbrus added 4 commits September 13, 2014 12:41

Decrease partitions when testing

80453d5

increase test parallelism

b42eeab

Make parquet tests less order dependent

0bdbf21

more test fixes

2dabae3

marmbrus force-pushed the shufflePartitions branch from dc7cb6e to 2dabae3 Compare September 13, 2014 19:43

marmbrus added 2 commits September 13, 2014 14:30

more test hacks.

ef2d985

test hax

0da1e8c

marmbrus mentioned this pull request Sep 13, 2014

[SPARK-2890][SQL] Allow reading of data when case insensitive resolution could cause possible ambiguity. #2209

Closed

asfgit closed this in 0f8c4ed Sep 13, 2014

marmbrus deleted the shufflePartitions branch September 22, 2014 19:54

scwf mentioned this pull request Nov 8, 2014

[SPARK-3971][SQL] Backport #2843 to branch-1.1 #3113

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SQL] Decrease partitions when testing #2164

[SQL] Decrease partitions when testing #2164

marmbrus commented Aug 27, 2014

rxin Aug 27, 2014

marmbrus Aug 27, 2014

rxin Aug 27, 2014

marmbrus Aug 27, 2014

rxin Aug 27, 2014

SparkQA commented Aug 27, 2014

SparkQA commented Aug 27, 2014

marmbrus commented Aug 27, 2014

SparkQA commented Aug 27, 2014

SparkQA commented Aug 27, 2014

rxin commented Aug 28, 2014

SparkQA commented Sep 5, 2014

SparkQA commented Sep 6, 2014

rxin commented Sep 6, 2014

SparkQA commented Sep 6, 2014

SparkQA commented Sep 6, 2014

marmbrus commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

marmbrus commented Sep 13, 2014

[SQL] Decrease partitions when testing #2164

[SQL] Decrease partitions when testing #2164

Conversation

marmbrus commented Aug 27, 2014

rxin Aug 27, 2014

Choose a reason for hiding this comment

marmbrus Aug 27, 2014

Choose a reason for hiding this comment

rxin Aug 27, 2014

Choose a reason for hiding this comment

marmbrus Aug 27, 2014

Choose a reason for hiding this comment

rxin Aug 27, 2014

Choose a reason for hiding this comment

SparkQA commented Aug 27, 2014

SparkQA commented Aug 27, 2014

marmbrus commented Aug 27, 2014

SparkQA commented Aug 27, 2014

SparkQA commented Aug 27, 2014

rxin commented Aug 28, 2014

SparkQA commented Sep 5, 2014

SparkQA commented Sep 6, 2014

rxin commented Sep 6, 2014

SparkQA commented Sep 6, 2014

SparkQA commented Sep 6, 2014

marmbrus commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 10, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

SparkQA commented Sep 13, 2014

marmbrus commented Sep 13, 2014