Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add column_index_operation #396

Merged
merged 1 commit into from
Dec 4, 2024
Merged

feat: add column_index_operation #396

merged 1 commit into from
Dec 4, 2024

Conversation

iajoiner
Copy link
Contributor

@iajoiner iajoiner commented Nov 27, 2024

Please be sure to look over the pull request guidelines here: https://github.com/spaceandtimelabs/sxt-proof-of-sql/blob/main/CONTRIBUTING.md#submit-pr.

Please go through the following checklist

Rationale for this change

In sort-merge joins we need to be able to apply columns to possibly repetitive indexes. Hence let's add this before actually adding sort_merge_join.

What changes are included in this PR?

  • add ColumnOperationError::IndexOutOfBounds.
  • add apply_slice_to_indexes in slice_operation.rs.
  • add IndexOp and ApplyIndexOp to apply the new op to Columns.

Are these changes tested?

Yes.

@iajoiner iajoiner mentioned this pull request Nov 27, 2024
5 tasks
@iajoiner iajoiner enabled auto-merge November 27, 2024 20:53
@iajoiner iajoiner requested a review from tlovell-sxt December 2, 2024 19:12
@iajoiner iajoiner force-pushed the feat/apply-index branch 5 times, most recently from ccca278 to d763e97 Compare December 4, 2024 14:56
@iajoiner iajoiner requested a review from tlovell-sxt December 4, 2024 14:56
@iajoiner iajoiner removed the request for review from JayWhite2357 December 4, 2024 15:03
column: &Column<'a, S>,
alloc: &'a Bump,
indexes: &[usize],
) -> ColumnOperationResult<Column<'a, S>>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

NIT: This comment might not be useful depending on how this function will be used later. Without that context, I'd say it would be better if the error type was just a new IndexOutOfBounds error type since most of the column operation errors cannot occur in this function. But if this function is only ever used in contexts that are dealing with that error type, then it's probably fine.

@iajoiner iajoiner merged commit 533d8d1 into main Dec 4, 2024
11 checks passed
Copy link

github-actions bot commented Dec 4, 2024

🎉 This PR is included in version 0.57.1 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

@iajoiner iajoiner deleted the feat/apply-index branch December 11, 2024 02:18
iajoiner added a commit that referenced this pull request Dec 18, 2024
Please be sure to look over the pull request guidelines here:
https://github.com/spaceandtimelabs/sxt-proof-of-sql/blob/main/CONTRIBUTING.md#submit-pr.

# Please go through the following checklist
- [x] The PR title and commit messages adhere to guidelines here:
https://github.com/spaceandtimelabs/sxt-proof-of-sql/blob/main/CONTRIBUTING.md.
In particular `!` is used if and only if at least one breaking change
has been introduced.
- [x] I have run the ci check script with `source
scripts/run_ci_checks.sh`.
- The following upstream PRs have been approved and merged:
  - [x] #391 
  - [x] #396


# Rationale for this change
This PR adds the actual sort-merge join process which completes a part
of #394.
<!--
Why are you proposing this change? If this is already explained clearly
in the linked issue then this section is not needed.
Explaining clearly why changes are proposed helps reviewers understand
your changes and offer better suggestions for fixes.

 Example:
 Add `NestedLoopJoinExec`.
 Closes #345.

Since we added `HashJoinExec` in #323 it has been possible to do
provable inner joins. However performance is not satisfactory in some
cases. Hence we need to fix the problem by implement
`NestedLoopJoinExec` and speed up the code
 for `HashJoinExec`.
-->

# What changes are included in this PR?
- add `sort_merge_join`.
<!--
There is no need to duplicate the description in the ticket here but it
is sometimes worth providing a summary of the individual changes in this
PR.

Example:
- Add `NestedLoopJoinExec`.
- Speed up `HashJoinExec`.
- Route joins to `NestedLoopJoinExec` if the outer input is sufficiently
small.
-->

# Are these changes tested?
<!--
We typically require tests for all PRs in order to:
1. Prevent the code from being accidentally broken by subsequent changes
2. Serve as another way to document the expected behavior of the code

If tests are not included in your PR, please explain why (for example,
are they covered by existing tests)?

Example:
Yes.
-->
Yes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants