-
Notifications
You must be signed in to change notification settings - Fork 13
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Parquet export IR Exporting data through a CSV intermediary is subject to loss of specificity and type info. This is particularly noticable for read_pandas, where the resulting dataframe has every column of type `object` and NULLs are indistinguishable from zero values. I used a small hack to export data from Dolt into a DataFrame using parquet instead of CSV. This requires the pyarrow dependency. I left TODOs for improvements on the Dolt side that would make this code cleaner and Dolt issues for the associated features. There is one bug with NULL datetime values that I added a Dolt issue for. * fix fmt * revert python changes, use early pyarrow
- Loading branch information
1 parent
f3c83cc
commit 0043e7e
Showing
4 changed files
with
106 additions
and
14 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters