Limit Parquet Page Row Count By Default to reduce writer memory requirements with highly compressable columns #5797
Labels
enhancement
Any new improvement worthy of a entry in the changelog
parquet
Changes to the parquet crate
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
There is a discussion on the mailing list about default settings for page sizing, and one of the suggestions is that the page row count limit should be enabled by default, where currently it is not. Row groups are limited to 1M rows by default, there is some suggestion pages should be limited to 20,000.
Creating this issue to track
Describe the solution you'd like
Describe alternatives you've considered
Additional context
The text was updated successfully, but these errors were encountered: