Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Experimental parquet decoder with first-class selection pushdown support #6921
base: main
Are you sure you want to change the base?
Experimental parquet decoder with first-class selection pushdown support #6921
Changes from all commits
cc6dd14
5837fc7
fec6313
948db87
8c50d90
f5422ce
dfdc1b6
3c526f8
53f5fad
56980de
4dd1b6b
f8f983e
882aaf1
c8bdbcf
cdb1d85
69720e5
a9550ab
be1435f
e4d9eb7
21e015b
bbc3595
547fb46
05c8c8f
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Filed #7014
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think there is something wrong with how github is displaying this diff -- this callsite was changed to use simdutf8 in this PR
simdutf8
#6668The version on main doesn't have any calls to
from_utf8
:https://github.com/apache/arrow-rs/blob/main/parquet/src/arrow/array_reader/byte_view_array.rs#L381
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I just checked and this function goes away if
main
is merged in.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah ok, yeah this is a different one than I mentioned in #7014 , sorry for the confusion
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The good news is that now it should be much faster