-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement data format versioning #44
Comments
I don't think right now the goal is to support multiple versions in the same release, but we must make the version explicit. We should add in the metadata that we are using a specific version of the Qbeast format. The reader should then check if it supports that version and throws an exception if not. The user should then look for the correct version git and compile the proper version. In the future, we can support upgrades, but now it is not necessary. Therefore the todo should me more: start versioning the format evolution. I guess the first one was 0.0.1 and now we use 0.0.2?
|
We may not have that many format versions to use semantic versioning, so I'd say the first one was v1 and now we are using v2. This way it won't be confused with the proper qbeast-spark version. For the rest I think it's fine, it will also help issue #53 to be solved. |
Mmh, I think you're right! And with just the number should be enough (1, 2...) because it's more like a |
I've been thinking about developing this, and I thought of doing it the following way: Use DeltaLog's So basically, my idea is to include our tags inside the
I think is a clean solution; I did some really quick tests with it and everything seems to work fine. The problem I found is to extend/modify the Protocol class from Delta. Could someone provide a hint? As a summary, I've seen that when committing a transaction, it includes a |
As the issue has been inactive for an extended period, and there has been no recent activity or updates, we will close this frozen issue |
What went wrong?
During the development and cleaning of the software, we missed having a version control on each of the implementations.
Without this track, in the main branch, we have two different ways of writing the metadata with no possible hint to differentiate between the releases.
For example, one dataset written before 7eb77dd would contain this information:
While the new ones have a different structures:
We should: [EDITED]
The text was updated successfully, but these errors were encountered: