-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SPARK-3039: Allow spark to be built using avro-mapred for hadoop2 #1945
Conversation
Can one of the admins verify this patch? |
I've looked at this part of the build a lot and can say LGTM |
Should I also add the Since now to build it according to the README one should run: |
You have to specify a Hadoop profile already, and you added the classifier to all of them. So that's fine. Building with YARN is orthogonal, so doesn't belong elsewhere I think. |
The problem I see, is that if you build according to the README:
Either the README should be changed to account for this, or the property should be added to the yarn and yarn-alpha profile (not the mapr, I think) Or is there a way to fix this with maven? |
Yeah that's out of date I believe. For example |
Yeah, you're right about yarn being orthogonal to the Hadoop version. Apart from the maven/sbt question there is another issue: The |
I think it works with the invocation you describe. Honestly it's not a big priority, this version, but nice to get it right. Want to open a JIRA to track updating/deleting the info from README.md? I think it needs to be fixed one way or the other. |
Created the issue: https://issues.apache.org/jira/browse/SPARK-3069 (Build instructions in README are outdated) @srowen: Thank you for your input! |
Can one of the admins verify this patch? |
Yeah - LGTM pending tests. |
QA tests have started for PR 1945 at commit
|
QA tests have finished for PR 1945 at commit
|
SPARK-3039: Adds the maven property "avro.mapred.classifier" to build spark-assembly with avro-mapred with support for the new Hadoop API. Sets this property to hadoop2 for Hadoop 2 profiles. I am not very familiar with maven, nor do I know whether this potentially breaks something in the hive part of spark. There might be a more elegant way of doing this. Author: Bertrand Bossy <bertrandbossy@gmail.com> Closes #1945 from bbossy/SPARK-3039 and squashes the following commits: c32ce59 [Bertrand Bossy] SPARK-3039: Allow spark to be built using avro-mapred for hadoop2 (cherry picked from commit c243b21) Signed-off-by: Patrick Wendell <pwendell@gmail.com>
Hey @pwendell @srowen @bbossy this is actually causing issues for SBT applications that use the |
* Switch to ENABLE_COMET and ENABLE_COMET_SCAN_ONLY Previous environment variable did not include the `ENABLE_` prefix. * Use explicit string config name This avoids a ClassNotFoundException
SPARK-3039: Adds the maven property "avro.mapred.classifier" to build spark-assembly with avro-mapred with support for the new Hadoop API. Sets this property to hadoop2 for Hadoop 2 profiles.
I am not very familiar with maven, nor do I know whether this potentially breaks something in the hive part of spark. There might be a more elegant way of doing this.