Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spill out query and database detail at the time of bug report, to help r... #47

Closed
wants to merge 1 commit into from

Conversation

jondot
Copy link
Contributor

@jondot jondot commented Nov 13, 2013

...eproduction of bugs.

@jvshahid jvshahid closed this Nov 13, 2013
jvshahid pushed a commit that referenced this pull request Aug 12, 2014
jvshahid pushed a commit that referenced this pull request Aug 12, 2014
mgattozzi added a commit that referenced this pull request Feb 10, 2025
This commit adds a FileIndex to the compactor. This lets us index
values for column names passed into the compactor function by being
able to look up the path for the name of the file.

Future work will need to be done for path rewriting in the index if
parquet files are moved to different paths. For now it provides us
an index of the files as they are written out into object storage.

This commit does a few things:
- The compactor now returns a struct with ParquetFile data, a FileIndex, and the necessary Object Store Paths from the compactor
- Provides a method to index a record batch passed into it
- Extends our compactor tests to include assertions for the indexes
- Changes the FileIndex to use a combined key for one lookup, rather
  than two lookups and two maps, simplifying it's logic

With these changes we can now create an index as we compact for
queries.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants