Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-32907][ML] adaptively blockify instances - revert blockify gmm #29782

Closed
wants to merge 3 commits into from

Conversation

zhengruifeng
Copy link
Contributor

What changes were proposed in this pull request?

revert blockify gmm

Why are the changes needed?

@WeichenXu123 and I thought we should use memory size instead of number of rows to blockify instance; then if a buffer's size is large and determined by number of rows, we should discard it.
In GMM, we found that the pre-allocated memory maybe too large and should be discarded:

@transient private lazy val auxiliaryPDFMat = DenseMatrix.zeros(blockSize, numFeatures)

We had some offline discuss and thought it is better to revert blockify GMM.

Does this PR introduce any user-facing change?

blockSize added in master branch will be removed

How was this patch tested?

existing testsuites

@SparkQA
Copy link

SparkQA commented Sep 17, 2020

Test build #128806 has finished for PR 29782 at commit 3e616ef.

  • This patch fails Python style tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Sep 17, 2020

Test build #128807 has finished for PR 29782 at commit d4d3b14.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@@ -432,13 +429,6 @@ def setAggregationDepth(self, value):
"""
return self._set(aggregationDepth=value)

@since("3.1.0")
def setBlockSize(self, value):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@zero323 FYI

Copy link
Member

@srowen srowen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems OK if this is just reverting something that went into 3.1.

@srowen
Copy link
Member

srowen commented Sep 22, 2020

Feel free to merge

@WeichenXu123
Copy link
Contributor

Jenkins retest this

@WeichenXu123
Copy link
Contributor

@zhengruifeng
Copy link
Contributor Author

@WeichenXu123 Good catch! Thanks

@SparkQA
Copy link

SparkQA commented Sep 23, 2020

Test build #129002 has finished for PR 29782 at commit 8d7113c.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@zhengruifeng
Copy link
Contributor Author

Merged to master. Thanks all for reviewing!

@zhengruifeng zhengruifeng deleted the unblockify_gmm branch September 23, 2020 07:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants