Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DH-140] clean up the datahub image python install #5274

Conversation

shaneknapp
Copy link
Contributor

DO NOT MERGE (yet)

Copy link
Collaborator

@ryanlovett ryanlovett left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yay for deleting a whole bunch of stuff for courses that now have their own hub!

I left one comment about prob140 dependencies.

Ideally, packages should be moved from the pip section to the upper conda package list, if those packages are available through conda. My feeling is that this will improve the performance of the resolver during installation.

Lastly, for dependencies with descriptions referring to very old terms (like Fall 2019), either those dependencies should be removed, or the comments should be updated to reflect that the dependencies are need through specific upcoming semesters. (like if it is known that the course will be taught by the same instructor for the next X semesters)

That being said, even if you agree with any of this, it doesn't all need to happen in this PR.

deployments/datahub/images/default/environment.yml Outdated Show resolved Hide resolved
@shaneknapp
Copy link
Contributor Author

shaneknapp commented Dec 15, 2023 via email

@shaneknapp
Copy link
Contributor Author

i ran the following command on a list of pip packages i gleaned from environment.yaml:

for x in $(cat check-conda ); do conda search $x > /dev/null 2>&1 && echo $x >> in-conda; done

then i went through, confirmed they were there and moved them out of the pip block: 6d57c86

now we can check the mamba build steps w/timing and compare to how it was before!

@shaneknapp
Copy link
Contributor Author

tl;dr: it didn't appear that having conda resolve more packages than pip would have been worth the effort to implement. also, bumping the circleci runner size from 'default' to 'large' didn't appear to make the build go any quicker that before.

the final build is running and i'll be merging to staging afterwards.

@shaneknapp shaneknapp merged commit 83aa012 into berkeley-dsep-infra:staging Dec 18, 2023
2 checks passed
@shaneknapp shaneknapp deleted the dh-140-audit-datahub-python-packages branch December 18, 2023 19:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants