Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-19937] Collect metrics for remote bytes read to disk during shuffle. #18249

Closed
wants to merge 2 commits into from

Conversation

jinxing64
Copy link

In current code(#16989), big blocks are shuffled to disk.
This pr proposes to collect metrics for remote bytes fetched to disk.

@SparkQA
Copy link

SparkQA commented Jun 9, 2017

Test build #77840 has finished for PR 18249 at commit ae859fb.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@jinxing64 jinxing64 changed the title [WIP][SPARK-19937] Collect metrics for remote bytes read to disk during shuffle. [SPARK-19937] Collect metrics for remote bytes read to disk during shuffle. Jun 13, 2017
Copy link
Contributor

@vanzin vanzin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks ok. At first I wasn't really sold on the name but can't think of something better either.

@@ -38,6 +38,10 @@ object MimaExcludes {
lazy val v23excludes = v22excludes ++ Seq(
// [SPARK-20495][SQL] Add StorageLevel to cacheTable API
ProblemFilters.exclude[ReversedMissingMethodProblem]("org.apache.spark.sql.catalog.Catalog.cacheTable")
) ++ Seq(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you follow the same style as v22excludes, which avoids this extra line?

@SparkQA
Copy link

SparkQA commented Jun 16, 2017

Test build #78140 has finished for PR 18249 at commit 9768860.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@jinxing64
Copy link
Author

@vanzin
Would you mind give more comments when have time ? And I can continue working on this :)

@vanzin
Copy link
Contributor

vanzin commented Jun 22, 2017

I was just waiting in case anyone else wanted to comment.

LGTM, merging to master.

@asfgit asfgit closed this in 58434ac Jun 22, 2017
robert3005 pushed a commit to palantir/spark that referenced this pull request Jun 29, 2017
…uffle.

In current code(apache#16989), big blocks are shuffled to disk.
This pr proposes to collect metrics for remote bytes fetched to disk.

Author: jinxing <jinxing6042@126.com>

Closes apache#18249 from jinxing64/SPARK-19937.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants