Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Redundant file system access to Delta's checkpoint parquet file #18916

Open
ebyhr opened this issue Sep 5, 2023 · 1 comment
Open

Redundant file system access to Delta's checkpoint parquet file #18916

ebyhr opened this issue Sep 5, 2023 · 1 comment
Labels
delta-lake Delta Lake connector performance

Comments

@ebyhr
Copy link
Member

ebyhr commented Sep 5, 2023

See TODO being added in #18917

@jkylling
Copy link
Contributor

jkylling commented Sep 5, 2023

This relates to #17406 and #17516
We should keep track of the protocol and metadata entries incrementally. Currently we re-read protocol entries from the checkpoint on every read, and metadata entries on every new commit to a table.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
delta-lake Delta Lake connector performance
Development

No branches or pull requests

2 participants