[Spark] Support List and Map columns in Uniform #2459

LukasRupprecht · 2024-01-09T19:54:17Z

Which Delta project/connector is this regarding?

Description

This PR adds support for List and Map columns in Uniform. To support these types, Delta column mapping needs to write additional field IDs to the parquet schema. List columns require one additional field ID for the 'element' subfield and Map columns require two additional field IDs for the 'key' and 'value' subfields inside the parquet file. These nested field IDs are added to the table schema during the generation of the IDs and physical names for column mapping. They are added to the parquet schema through a new class, DeltaParquetWriteSupport, that hooks into Spark's parquet write path and rewrites the parquet schema based on the additional field IDs.

This PR is part of #2297.

How was this patch tested?

Unit tests will be added soon in a separate PR.

Does this PR introduce any user-facing changes?

No

lzlfred

very excited to see Map and List support for Uniform Iceberg !

implements support for list and map

679ac48

lzlfred approved these changes Jan 9, 2024

View reviewed changes

vkorukanti closed this in b4e5d5c Jan 10, 2024

LukasRupprecht deleted the list-map-support branch April 3, 2024 00:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Spark] Support List and Map columns in Uniform #2459

[Spark] Support List and Map columns in Uniform #2459

LukasRupprecht commented Jan 9, 2024

lzlfred left a comment

[Spark] Support List and Map columns in Uniform #2459

[Spark] Support List and Map columns in Uniform #2459

Conversation

LukasRupprecht commented Jan 9, 2024

Which Delta project/connector is this regarding?

Description

How was this patch tested?

Does this PR introduce any user-facing changes?

lzlfred left a comment

Choose a reason for hiding this comment