Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parsing improvements and more misc stuff #60

Merged
merged 24 commits into from
Feb 9, 2024
Merged

Parsing improvements and more misc stuff #60

merged 24 commits into from
Feb 9, 2024

Conversation

Vindaar
Copy link
Member

@Vindaar Vindaar commented Jan 31, 2024

Some fixes to spread and adds few helpers (see changelog). Mainly fixes issue #58 about parsing of CSV files with quoted fields (I hope fully).

Full changelog:

* v0.4.0
- add ~shuffle~ to shuffle a DF. Either using stdlib global RNG or
  given RNG
- add ~extend~ helpers for ~seq/Tensor/Column~ to add a single element
  to any collection and return the extended version
- throw custom ~OSError~ if CSV file cannot be read
- add =%~= for ~(string, T)~ to construct a ~VObject~ Value
- fix ~spread~ implementation if more keys are present
- correctly handle quoted fields in CSV files, fixes issue #58
- add ~allowLineBreaks~ option to also allow for line breaks in quoted
  fields. It's an optional option (despite being commonly useful for
  files with quotes), because counting the lines is otherwise a useful
  sanity check the parsing worked successfully.  
- [IO] fix for space separated files with quoted fields as columns

This is useful for some things... For me at least. (e.g. histogram calls)
In the old version adding a `colConstant` (of type int) to a `colInt`
lead to it being converted to `colFloat`.
Note: the changelog is not up to date. More was added later!
If a space separated file has a quoted column name containing a space
itself, the parsing algo was broken.
@Vindaar Vindaar merged commit 8a26c32 into master Feb 9, 2024
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant