-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update to DataFrames v0.11, CSV 0.2.1, RData 0.3 #48
Conversation
Thanks @alyst. When this is ready for merging, let me know |
@quinnj I've switched from
or
or
Maybe you have an easy fix/explanations to those. |
I've failed to resist the temptation of caching the |
@alyst, can you list the individual files you saw the errors on? Happy to take a look. |
The
(the CSV.jl line numbers could be off, because I've made some changes to support all-nulls columns): |
Some datasets need JuliaData/CSV.jl#115 to load properly |
Thanks for working on this. I believe RDatasets is only package that needs to be updated before we can patch up Gadfly to work on top of Dataframes v0.11 |
I'm waiting for JuliaData/CSV.jl#115 to remove [WIP] and consider merging. I hope that PR would be reviewed in a few days. |
Just ping me when this is ready for merging |
@alyst JuliaData/CSV.jl#115 has been merged. |
Even with the PR being merged, it still needs to be tagged. And that tag should be the minimum version in REQUIRES here. |
@quinnj could you tag a minor release? It would help with updating RDatasets and Gadfly. |
@Nosferican Thanks! Actually, I haven't forgotten this PR at all :) Besides JuliaData/CSV.jl#115 there are a few other outstanding issues (notably JuliaData/CSV.jl#132), which we are trying to resolve. Once it's done and CI is green for DataFrames and CSV, I'll update this PR and it's ok to merge. |
A new tag of CSV was just merged. |
@alyst Travis fails overall because it still mentions 0.4 and 0.5, but 0.6 passes. If you could please modify .travis.yml to just test 0.6 and master, then we should be able to tag a new version of RDatasets |
@randyzwitch I hope to have a look tomorrow. |
Thanks to @StoneCypher, this is now passing on 0.6 as well. Up to you @alyst if there are more updates you want to do, or if I should merge this in and tag a new release. |
It would be great to have this merged and a new release tagged as soon as possible. Thanks everyone for getting this up and running. (esp @alyst!) |
- decompress csv.gz and feed the stream to CSV using CodecZlib - raise row detection limit for csv files to 15000 as vcd::Bundesliga needs to read upto row #10804 to detect that 7th column can contain missing values
so that test results for non-problematic packages are collapsed into a single row in the report
allow digits in the first position to make psych::cubits pass, but disallow whitespace at any other position
use 20 rows by default and define Dataset_typedetect_rows dictionary that stores the exceptions
I've updated REQUIRE, removed enclosing modules from tests (now all test.jl files are executed in their own testset) and finally removed [WIP]. |
A part of JuliaData/DataFrames.jl#1232.
Still have to update
.yml
so that CI loads master versions of DataFrames, CSV and RData.