-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Correctness issues in S3 Select Pushdown #17775
Comments
Bullets 2 and 3 can be fixed within the Trino code, but the first one looks like a bug on the AWS side of things. @pettyjamesm do you have much background with s3 select? The test I wrote that hit this is here: https://github.com/trinodb/trino/pull/17563/files#diff-60fcb98253871d1bddcafc2121e18998f5f0b98a5f127731450f98eb446d928fR133 |
Not specifically, let's see if @dnanuti can help out with that |
Hello! Happy to help.
|
Thanks Diana (replied with the decimal comparison example in a different thread) For anyone looking at potentially picking this up, these are some additional tests that don't pass. I believe all the issues here (besides decimal comparison) are on the Trino side, not AWS:
Additionally if you add
to the test data provider here |
@pettyjamesm @dnanuti can you reply to the above? @alexjo2144 has #18102 to disable S3 Select CSV support until this can be fixed (or worked around on our side) |
The approach in #18102 seems reasonable- S3 select currently ignores scale and precision on The other issues do sound like they're fixable from the Trino side (something about a mismatch between |
S3 select was removed in #18241 |
An initial set of corrections is here: #17563
While testing that PR there were a few other issues I ran into:
textfile_field_separator, textfile_field_separator_escape, null_format
S3SelectCsvRecordReader
checks for a quote character in the Hive schema, but that property can't be set when creating tables in Trino using theTEXTFILE
formatThe text was updated successfully, but these errors were encountered: