-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add packages for everyday usage #2
Conversation
Add the full complement of scientific python packages I use on a fairly regular basis.
BTW, I ran a quick check of versions and imports off the binder link, and all so far looks good:
Thanks! |
Thank you so much, @fperez! We should find ways to automate this even further, and with more people being hired at 2i2c I'm hopeful that can happen :) |
Awesome, thanks so much! It took me a bit to figure out how to get the script to work in-place in the yaml structure while preserving structure (ruyaml is essentially undocumented in this aspect, and no methods have docstrings, so it's kind of hunting in the dark). But the script now works well for this purpose, so we can eventually put it in a more automated pipeline if needed. BTW - building the env on the hub took forever. I timed it on my laptop where it took 1 min 25s, while on the hub it took easily ~ 20 minutes (I didn't time it precisely). I know those filesystem-intensive operations are slow, but this seemed unusually slow. Just figured I'd let you know. |
@fperez yeah, that's sort of the performance I'd expect on NFS honestly! Another reason I recommend against putting user environments on NFS... :D |
I hear you @yuvipanda! TBH I was (naïvely) expecting the gap to be a bit less brutal. In the cloud, it really seems to be horrid (on a local, "classic" linux cluster the penalties are manageable). In any case - given this, do you have any other suggestions for users to experiment when they need to build and keep around an env to use? No matter what we do with the base env, users will always need to experiment with new environments, they'll have a few one-off packages they need to build, etc. What is being recommended to other 2i2c users in general? |
Bump xarray from 2022.11.0 to 2022.12.0 which contains a [bugfix](pydata/xarray#7304) useful for reading multiple groups from ICESat-2 HDF5 files in AWS S3 buckets. Also taking the opportunity to sort packages alphabetically in the `environment.yml` and reorganize some sections originally added in #2. Hopefully this will make it clearer on where new conda packages can be added in the future!
Add the full complement of scientific python packages I use on a fairly regular basis.
As per @yuvipanda's comment, I went through the various environment files in images I use, and I tried to capture all the packages I actually use in practice on a reasonably regular basis, leaving out more obscure things that might end up going stale or not actually be used.
All the packages here are things I've used in some capacity for either research or teaching either at Berkeley or in the JMTE hub in the last couple of years.