[SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default #608

liancheng · 2014-05-01T04:50:20Z

In-memory compression is now configurable in SparkConf by the spark.sql.inMemoryCompression.enabled property, and is disabled by default.

To help code review, the bug fix is in the first commit, compression configuration is in the second one.

CompressibleColumnAccessor.hasNext and RunLengthEncoding.decoder.hasNext were not correctly implemented.

AmplabJenkins · 2014-05-01T04:52:57Z

Merged build triggered.

AmplabJenkins · 2014-05-01T04:53:06Z

Merged build started.

AmplabJenkins · 2014-05-01T06:07:23Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-05-01T06:07:23Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14601/

liancheng · 2014-05-01T07:38:44Z

Notice that the RLDecoder in Shark has the same bug fixed in RunLengthEncoding.decoder, and may lose repeated values. @rxin

AmplabJenkins · 2014-05-05T04:17:58Z

Merged build triggered.

AmplabJenkins · 2014-05-05T04:18:08Z

Merged build started.

AmplabJenkins · 2014-05-05T05:32:46Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-05-05T05:32:46Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14658/

marmbrus · 2014-05-05T19:53:13Z

This looks great! Thanks for doing this (and even splitting the commits into nice chunks).

One question is about the naming of the config option. I'm thinking something more like spark.sql.inMemoryColumnarStorage.compressed. That way we have a separate group for all "inMemoryColumnarStorage" things and it clear which version this is if we add another in-memory option. Open to other suggestions though.

@pwendell, we should try to include this in 1.0 if possible.

liancheng · 2014-05-06T01:21:37Z

Done :)

AmplabJenkins · 2014-05-06T01:22:57Z

Merged build triggered.

AmplabJenkins · 2014-05-06T01:23:06Z

Merged build started.

AmplabJenkins · 2014-05-06T02:28:36Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-05-06T02:28:36Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14693/

pwendell · 2014-05-06T02:38:43Z

Thanks - I can merge this.

…ession configurable, disabled by default In-memory compression is now configurable in `SparkConf` by the `spark.sql.inMemoryCompression.enabled` property, and is disabled by default. To help code review, the bug fix is in [the first commit](liancheng@d537a36), compression configuration is in [the second one](liancheng@4ce09aa). Author: Cheng Lian <lian.cs.zju@gmail.com> Closes #608 from liancheng/spark-1678 and squashes the following commits: 66c3a8d [Cheng Lian] Renamed in-memory compression configuration key f8fb3a0 [Cheng Lian] Added assertion for testing .hasNext of various decoder 4ce09aa [Cheng Lian] Made in-memory compression configurable via SparkConf d537a36 [Cheng Lian] Fixed SPARK-1678 (cherry picked from commit 6d721c5) Signed-off-by: Patrick Wendell <pwendell@gmail.com>

…ession configurable, disabled by default In-memory compression is now configurable in `SparkConf` by the `spark.sql.inMemoryCompression.enabled` property, and is disabled by default. To help code review, the bug fix is in [the first commit](liancheng@d537a36), compression configuration is in [the second one](liancheng@4ce09aa). Author: Cheng Lian <lian.cs.zju@gmail.com> Closes apache#608 from liancheng/spark-1678 and squashes the following commits: 66c3a8d [Cheng Lian] Renamed in-memory compression configuration key f8fb3a0 [Cheng Lian] Added assertion for testing .hasNext of various decoder 4ce09aa [Cheng Lian] Made in-memory compression configurable via SparkConf d537a36 [Cheng Lian] Fixed SPARK-1678

Author: Andrew Ash <andrew@andrewash.com> Closes apache#608 from ash211/patch-7 and squashes the following commits: bd85f2a [Andrew Ash] Worker registration logging fix (cherry picked from commit c0795cf) Signed-off-by: Aaron Davidson <aaron@databricks.com>

* Create ISSUE_TEMPLATE.md * add dev mailing list and jira links

Bump bazel to 0.23

liancheng added 2 commits April 30, 2014 20:19

Fixed SPARK-1678

d537a36

CompressibleColumnAccessor.hasNext and RunLengthEncoding.decoder.hasNext were not correctly implemented.

Made in-memory compression configurable via SparkConf

4ce09aa

Added assertion for testing .hasNext of various decoder

f8fb3a0

Renamed in-memory compression configuration key

66c3a8d

asfgit closed this in 6d721c5 May 6, 2014

liancheng deleted the spark-1678 branch September 24, 2014 00:14

rvesse pushed a commit to rvesse/spark that referenced this pull request Mar 2, 2018

Create ISSUE_TEMPLATE.md (apache#608)

90a204c

* Create ISSUE_TEMPLATE.md * add dev mailing list and jira links

bzhaoopenstack pushed a commit to bzhaoopenstack/spark that referenced this pull request Sep 11, 2019

Bump bazel to 0.23 (apache#608)

a3a9633

Bump bazel to 0.23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default #608

[SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default #608

liancheng commented May 1, 2014

AmplabJenkins commented May 1, 2014

AmplabJenkins commented May 1, 2014

AmplabJenkins commented May 1, 2014

AmplabJenkins commented May 1, 2014

liancheng commented May 1, 2014

AmplabJenkins commented May 5, 2014

AmplabJenkins commented May 5, 2014

AmplabJenkins commented May 5, 2014

AmplabJenkins commented May 5, 2014

marmbrus commented May 5, 2014

liancheng commented May 6, 2014

AmplabJenkins commented May 6, 2014

AmplabJenkins commented May 6, 2014

AmplabJenkins commented May 6, 2014

AmplabJenkins commented May 6, 2014

pwendell commented May 6, 2014

[SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default #608

[SPARK-1678][SPARK-1679] In-memory compression bug fix and made compression configurable, disabled by default #608

Conversation

liancheng commented May 1, 2014

AmplabJenkins commented May 1, 2014

AmplabJenkins commented May 1, 2014

AmplabJenkins commented May 1, 2014

AmplabJenkins commented May 1, 2014

liancheng commented May 1, 2014

AmplabJenkins commented May 5, 2014

AmplabJenkins commented May 5, 2014

AmplabJenkins commented May 5, 2014

AmplabJenkins commented May 5, 2014

marmbrus commented May 5, 2014

liancheng commented May 6, 2014

AmplabJenkins commented May 6, 2014

AmplabJenkins commented May 6, 2014

AmplabJenkins commented May 6, 2014

AmplabJenkins commented May 6, 2014

pwendell commented May 6, 2014