[Delta Protocol] Follow up comments for ClusteringTable feature #2294

dabao521 · 2023-11-15T15:18:15Z

Which Delta project/connector is this regarding?

Description

Follow up comments for ClusteringTable feature #2264

We had late comments after the original PR merged, and this PR is addressing those missed comments.

How was this patch tested?

N/A

Does this PR introduce any user-facing changes?

No

dabao521 · 2023-11-15T15:19:42Z

PROTOCOL.md

  - A clustering implementation is free to add additional information such as adding a new user-controlled metadata domain to keep track of its metadata.
+- Writers must not define clustered and partitioned table at the same time.


FYI @ryan-johnson-databricks / @imback82 , this is the new rule to disallow partitioned table . This is to address comment and comment

dabao521 · 2023-11-15T15:23:07Z

PROTOCOL.md

+  }
+}
+```
+The example above converts `configuration` field into JSON format, including escaping characters. Here's how it looks in plain JSON for better understanding.


FYI @ryan-johnson-databricks , this is to address your comment https://github.com/delta-io/delta/pull/2264/files#r1393336240

dabao521 · 2023-11-15T18:20:32Z

PROTOCOL.md

@@ -1057,27 +1059,48 @@ When Row Tracking is enabled (when the table property `delta.enableRowTracking`

 The Clustered Table feature facilitates the physical clustering of rows that share similar values on a predefined set of clustering columns.
 This enhances query performance when selective filters are applied to these clustering columns through data skipping.
-Clustering columns must be specified during the initial definition of a clustered table, and they can be modified after the table has been created.
+Clustering columns can be sprecified when creating a table or later, as long as the table doesn't have partition columns.


FYI @imback82 / @ryan-johnson-databricks , I have updated here and below to let clustering table feature to be enabled either during table creation or at a later stage . This is to address comment.

PROTOCOL.md

ryan-johnson-databricks

LGTM

PROTOCOL.md

imback82

LGTM

dabao521 added 3 commits November 15, 2023 06:33

fix comments

c31158e

address comments

058e4f1

follow up comments

d7f400d

dabao521 commented Nov 15, 2023

View reviewed changes

Let clustering table feature added either during creation or later stage

6689304

dabao521 commented Nov 15, 2023

View reviewed changes

imback82 reviewed Nov 15, 2023

View reviewed changes

PROTOCOL.md Outdated Show resolved Hide resolved

ryan-johnson-databricks approved these changes Nov 15, 2023

View reviewed changes

fix comments

adec52b

dabao521 requested a review from imback82 November 16, 2023 04:39

imback82 reviewed Nov 16, 2023

View reviewed changes

PROTOCOL.md Show resolved Hide resolved

fix comment

0f7a442

dabao521 requested a review from imback82 November 16, 2023 04:58

imback82 approved these changes Nov 16, 2023

View reviewed changes

dabao521 added 2 commits November 16, 2023 06:03

add back partitionValues

48402f4

remove extra size

1695e92

scottsand-db approved these changes Nov 16, 2023

View reviewed changes

allisonport-db closed this in 266a2fc Nov 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Delta Protocol] Follow up comments for ClusteringTable feature #2294

[Delta Protocol] Follow up comments for ClusteringTable feature #2294

dabao521 commented Nov 15, 2023 •

edited

Loading

dabao521 Nov 15, 2023

dabao521 Nov 15, 2023

dabao521 Nov 15, 2023 •

edited

Loading

ryan-johnson-databricks left a comment

imback82 left a comment

		- A clustering implementation is free to add additional information such as adding a new user-controlled metadata domain to keep track of its metadata.
		- Writers must not define clustered and partitioned table at the same time.

[Delta Protocol] Follow up comments for ClusteringTable feature #2294

[Delta Protocol] Follow up comments for ClusteringTable feature #2294

Conversation

dabao521 commented Nov 15, 2023 • edited Loading

Which Delta project/connector is this regarding?

Description

How was this patch tested?

Does this PR introduce any user-facing changes?

dabao521 Nov 15, 2023

Choose a reason for hiding this comment

dabao521 Nov 15, 2023

Choose a reason for hiding this comment

dabao521 Nov 15, 2023 • edited Loading

Choose a reason for hiding this comment

ryan-johnson-databricks left a comment

Choose a reason for hiding this comment

imback82 left a comment

Choose a reason for hiding this comment

dabao521 commented Nov 15, 2023 •

edited

Loading

dabao521 Nov 15, 2023 •

edited

Loading