We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
If both data sets are stored sorted on the join key, then its possible to perform the join on the map side. The general idea is to:
There are already implementations in both pig and hive, and would be a nice addition to scoobi.
Pigs implementation - http://wiki.apache.org/pig/PigMergeJoin Hives implementation - https://issues.apache.org/jira/browse/HIVE-1194
The text was updated successfully, but these errors were encountered:
i need code to implement sort merge join any suggestions ?
Sorry, something went wrong.
No branches or pull requests
If both data sets are stored sorted on the join key, then its possible to perform the join on the map side. The general idea is to:
There are already implementations in both pig and hive, and would be a nice addition to scoobi.
Pigs implementation - http://wiki.apache.org/pig/PigMergeJoin
Hives implementation - https://issues.apache.org/jira/browse/HIVE-1194
The text was updated successfully, but these errors were encountered: