Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Add initial index lookup join operator implementation #12218

Closed
wants to merge 1 commit into from

Conversation

xiaoxmeng
Copy link
Contributor

Summary:
Add the initial index lookup operator implementation which supports inner join and left join
from an index source.
The index lookup join operator takes input from the probe side with one batch at a time. For each
probe input batch, it sends the lookup request from the index source and gets a lookup result iterator.
The result iterator produces the output in batches through a future for async processing at the
storage backend. For each lookup result, the index join operator produces one or more output batches
based on the operator's output batch size limit. For left join, the index join operator needs to detect the
input request rows that have no matches and produce output with nulls for the columns from the lookup
table.

Differential Revision: D68906030

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 30, 2025
Copy link

netlify bot commented Jan 30, 2025

Deploy Preview for meta-velox canceled.

Name Link
🔨 Latest commit c4b44fa
🔍 Latest deploy log https://app.netlify.com/sites/meta-velox/deploys/679c16a5b84b7f00081d4278

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D68906030

xiaoxmeng added a commit to xiaoxmeng/velox that referenced this pull request Jan 30, 2025
…incubator#12218)

Summary:

Add the initial index lookup operator implementation which supports inner join and left join
from an index source.
The index lookup join operator takes input from the probe side with one batch at a time. For each
probe input batch, it sends the lookup request from the index source and gets a lookup result iterator.
The result iterator produces the output in batches through a future for async processing at the
storage backend. For each lookup result, the index join operator produces one or more output batches
based on the operator's output batch size limit. For left join, the index join operator needs to detect the
input request rows that have no matches and produce output with nulls for the columns from the lookup
table.

Differential Revision: D68906030
@xiaoxmeng xiaoxmeng requested a review from assignUser as a code owner January 30, 2025 21:51
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D68906030

xiaoxmeng added a commit to xiaoxmeng/velox that referenced this pull request Jan 30, 2025
…incubator#12218)

Summary:

Add the initial index lookup operator implementation which supports inner join and left join
from an index source.
The index lookup join operator takes input from the probe side with one batch at a time. For each
probe input batch, it sends the lookup request from the index source and gets a lookup result iterator.
The result iterator produces the output in batches through a future for async processing at the
storage backend. For each lookup result, the index join operator produces one or more output batches
based on the operator's output batch size limit. For left join, the index join operator needs to detect the
input request rows that have no matches and produce output with nulls for the columns from the lookup
table.

Differential Revision: D68906030
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D68906030

Copy link
Contributor

@mbasmanova mbasmanova left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you, @xiaoxmeng

…incubator#12218)

Summary:

Add the initial index lookup operator implementation which supports inner join and left join
from an index source.
The index lookup join operator takes input from the probe side with one batch at a time. For each
probe input batch, it sends the lookup request from the index source and gets a lookup result iterator.
The result iterator produces the output in batches through a future for async processing at the
storage backend. For each lookup result, the index join operator produces one or more output batches
based on the operator's output batch size limit. For left join, the index join operator needs to detect the
input request rows that have no matches and produce output with nulls for the columns from the lookup
table.

Reviewed By: mbasmanova

Differential Revision: D68906030
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D68906030

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in dcafd32.

@xiaoxmeng xiaoxmeng deleted the export-D68906030 branch January 31, 2025 01:06
ArnavBalyan pushed a commit to ArnavBalyan/velox that referenced this pull request Jan 31, 2025
…incubator#12218)

Summary:
Pull Request resolved: facebookincubator#12218

Add the initial index lookup operator implementation which supports inner join and left join
from an index source.
The index lookup join operator takes input from the probe side with one batch at a time. For each
probe input batch, it sends the lookup request from the index source and gets a lookup result iterator.
The result iterator produces the output in batches through a future for async processing at the
storage backend. For each lookup result, the index join operator produces one or more output batches
based on the operator's output batch size limit. For left join, the index join operator needs to detect the
input request rows that have no matches and produce output with nulls for the columns from the lookup
table.

Reviewed By: mbasmanova

Differential Revision: D68906030

fbshipit-source-id: 4038c31653bc96893497b3c26d935fa2f7b58f43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants