diff --git a/bad_data/ARROW-GH-41317.parquet b/bad_data/ARROW-GH-41317.parquet new file mode 100644 index 0000000..a51eec7 Binary files /dev/null and b/bad_data/ARROW-GH-41317.parquet differ diff --git a/bad_data/ARROW-GH-41321.parquet b/bad_data/ARROW-GH-41321.parquet new file mode 100644 index 0000000..b78c544 Binary files /dev/null and b/bad_data/ARROW-GH-41321.parquet differ diff --git a/bad_data/README.md b/bad_data/README.md index 472865b..885af61 100644 --- a/bad_data/README.md +++ b/bad_data/README.md @@ -22,3 +22,7 @@ These are files used for reproducing various bugs that have been reported. * PARQUET-1481.parquet: tests a case where a schema Thrift value has been corrupted +* ARROW-GH-41321.parquet: test case of https://github.com/apache/arrow/issues/41321 + where decoded rep / def levels is less than num_values in page_header. +* ARROW-GH-41317.parquet: test case of https://github.com/apache/arrow/issues/41317 + where all columns have not the same size.