Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-17350][SQL] Disable default use of KryoSerializer in Thrift Server #14906

Closed

Conversation

JoshRosen
Copy link
Contributor

In SPARK-4761 / #3621 (December 2014) we enabled Kryo serialization by default in the Spark Thrift Server. However, I don't think that the original rationale for doing this still holds now that most Spark SQL serialization is now performed via encoders and our UnsafeRow format.

In addition, the use of Kryo as the default serializer can introduce performance problems because the creation of new KryoSerializer instances is expensive and we haven't performed instance-reuse optimizations in several code paths (including DirectTaskResult deserialization).

Given all of this, I propose to revert back to using JavaSerializer as the default serializer in the Thrift Server.

/cc @liancheng

@JoshRosen JoshRosen changed the title Disable kryo in thriftserver [SPARK-17350][SQL] Disable default use of KryoSerializer in Thrift Server Sep 1, 2016
@SparkQA
Copy link

SparkQA commented Sep 1, 2016

Test build #64740 has finished for PR 14906 at commit aa18bb6.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@rxin
Copy link
Contributor

rxin commented Nov 1, 2016

LGTM if tests pass again.

@SparkQA
Copy link

SparkQA commented Nov 1, 2016

Test build #3390 has finished for PR 14906 at commit aa18bb6.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@rxin
Copy link
Contributor

rxin commented Nov 1, 2016

Merging in master.

@asfgit asfgit closed this in 6e62981 Nov 1, 2016
@JoshRosen JoshRosen deleted the disable-kryo-in-thriftserver branch November 1, 2016 23:38
uzadude pushed a commit to uzadude/spark that referenced this pull request Jan 27, 2017
…rver

In SPARK-4761 / apache#3621 (December 2014) we enabled Kryo serialization by default in the Spark Thrift Server. However, I don't think that the original rationale for doing this still holds now that most Spark SQL serialization is now performed via encoders and our UnsafeRow format.

In addition, the use of Kryo as the default serializer can introduce performance problems because the creation of new KryoSerializer instances is expensive and we haven't performed instance-reuse optimizations in several code paths (including DirectTaskResult deserialization).

Given all of this, I propose to revert back to using JavaSerializer as the default serializer in the Thrift Server.

/cc liancheng

Author: Josh Rosen <joshrosen@databricks.com>

Closes apache#14906 from JoshRosen/disable-kryo-in-thriftserver.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants