-
-
Notifications
You must be signed in to change notification settings - Fork 306
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pan Deng: Integrating pandas.Panel
and xarray Features
#127
Conversation
OXPHOS
commented
Mar 23, 2016
- I contacted NumFOCUS mentors before writing my proposal
- I showed you my contribution (it can be any form: proof-of-concept project idea, some sample code or just a link to your commits from other project)
- I linked to my sample contribution from the proposal
- I linked to my opened issues in numfocus/gsoc repository from the proposal
|
||
## Technical Details | ||
|
||
Most of my proposal is supposed to be carried out with current implemented features in pandas and xarray. For PCA part, to improve the performance, I might switch to C++ and Eigen3 library. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
currently xarray
is pure python, this would be a very large change. using cython
and/or numba
might be acceptable. I suspect that xarray
will eventually have to go down this route. @shoyer can shed some more light.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
xarray is currently pure Python built on top of NumPy and dask.array. Indeed, Cython or Numba might be acceptable, but Numba at least would need to be an optional dependency.
For PCA in particular, it would make sense to wrap existing implementations/wrappers in SciPy rather than rolling your own.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I get it..No experience with cython or SciPy. I will check it out.
One feature that would have a lot of impact would be bringing groupby performance in xarray up to par with pandas: pydata/xarray#659 |
Honestly I didn't find too much to transfer from pandas to xarray and I am trying to dig something out from nowhere. Or actually I should focus on add-on features like Also I have the following todo list from pydata/xarray#702. Can I include all this in my proposal? |