-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimization of the Unindexed Files [Staging Area] #294
Comments
@osopardo1 Are these unindexed files the result of a transaction that did not use Qbeast to append? |
I will update the issue with a better problem formulation. |
Let's use "unindexed files", the term unindexed exists in English: https://en.wiktionary.org/wiki/unindexed#English |
Oh, at first it sounded weird to me. agree with Unindexed |
Opened PR: #440 |
Qbeast Spark supports reading files not indexed with Qbeast Metadata. There's different situations that can cause a table to have a hybrid state.
The current behavior is to ignore the non-indexed files when reading and writing, thus disabling part of the Sampling capabilities and reducing the precision when estimating the index. Also, optimization of this "staging area", does not select that subset of files for any rearrangement operation.
This issue is to record and analyze which is the best storyline to follow when Optimizing the Non-Indexed files.
The text was updated successfully, but these errors were encountered: