Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request]On simple crc generation, check if the (commit version - 1)'s crc file present. #4177

Open
8 tasks
huan233usc opened this issue Feb 19, 2025 · 0 comments
Labels
enhancement New feature or request

Comments

@huan233usc
Copy link
Collaborator

Feature request

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Overview

Current implementation (#4134) generate crc_simple post commit hook at commit version only when (commit version - 1)'s crc is used for building the snapshot.

crc_simple will not present if retry happen.

One optimization could be -- if crc at (commit version - 1) is not present, try to read it and build new crc info. This will increase the chance that crc_simple got build

Motivation

Further details

Willingness to contribute

The Delta Lake Community encourages new feature contributions. Would you or another member of your organization be willing to contribute an implementation of this feature?

  • Yes. I can contribute this feature independently.
  • Yes. I would be willing to contribute this feature with guidance from the Delta Lake community.
  • No. I cannot contribute this feature at this time.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant