-
-
Notifications
You must be signed in to change notification settings - Fork 15k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pytorch-bin: 1.10.0 -> 1.11.0 #164712
pytorch-bin: 1.10.0 -> 1.11.0 #164712
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for creating a PR!
torchvision-bin and torchaudio-bin also depend on this.
We should include them in another commit or require another PR.
Did prefetch.sh work fine? |
Technically, the binary does not need nvcc with cudatoolkit. |
I'll double check it (I ran it in two steps before, so possible I made a mistake somewhere in updating the script) |
I think the title and commit should be 'python3Packages.pytorch-bin: 1.10.2 -> 1.11.0'. |
I was a little misunderstood. This includes both full build and binary derivations. |
cc @NixOS/cuda-maintainers This failure is due to the CUDA version being picked up from the Long story short, try the following in python-packages.nix: pytorch = callPackage ../development/python-modules/pytorch {
cudaSupport = pkgs.config.cudaSupport or false;
cudatoolkit = pkgs.cudatoolkit_11;
cudnn = pkgs.cudnn_8_3_cudatoolkit_11;
}; if that doesn't work, we'll need to figure out the latest version of cuDNN that pytorch supports compiling against. cuDNN v8.3.5 is currently the latest upstream and the latest packaged in nixpkgs. But there are other options as well. Don't hesitate to reach out with any questions! I guarantee that the CUDA/cuDNN versions you need are packaged... just a matter of figuring out what works. |
Source build is already on staging. 5446ad8 |
Ah I missed comments here - glad to see others are beating me to it (I'll turn this into a pytorch-bin only bump unless I'm too slow again :)) |
@mweinelt |
@junjihashimoto nixpkgs-unstable has been failing for the last 7 days, so we likely won't be able to see any builds until that is cleared up. Check out https://status.nixos.org for more info. |
deba706
to
b8c24e1
Compare
pkgs/development/python-modules/coqui-trainer/conqui-trainer.patch
Outdated
Show resolved
Hide resolved
Is |
My 2c is that I think it's best to avoid the complexity associated with multiple versions if possible, but I'm not a pytorch maintainer so I'll leave that to @junjihashimoto @teh @thoughtpolice and @tscholak. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Keeping multiple versions may just put off the problem.
We may need more than one version for another reason.
I'm not sure if we should manage the set of packages or follow conda or anything else.
I had a draft PR for torchaudio source that I never quite got to finishing up properly. #160210 with some test dependencies #160206 and #160197 It had a lot of failing tests that took crazy times to run, so it was a bit difficult to verify. (I don't use torchaudio myself either, and really don't want to be a maintainer). Anyway, if anyone is so inclined, feel free to reappropriate it |
I also tried it, but I couldn't complete many dependencies other than python. |
8137cb7
to
17d9e38
Compare
Squashed superfluous commits & rebased |
Result of 1 package failed to build:
8 packages built:
|
coqui-trainer isn't building on python310 but I don't view that as a blocker for merging this. Lots of packages still failing on python310.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
let's use fetchpatch and then I think this PR is good to go!
pkgs/development/python-modules/coqui-trainer/conqui-trainer.patch
Outdated
Show resolved
Hide resolved
pkgs/development/python-modules/coqui-trainer/conqui-trainer.patch
Outdated
Show resolved
Hide resolved
Let's add a disable for python3.10 on coqui-trainer with a reference to coqui-ai/Trainer#22.
I agree with the remarks made by @samuela, patches need to go, fetchpatch is the way. |
17d9e38
to
163450e
Compare
Result of 8 packages built:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok LGTM! I'll go ahead and merge tomorrow unless anyone objects
Thanks for all the help! |
Great work, and thank you for your persistence with this @rehno-lindeque! It's not easy getting PRs through for some of these larger packages but it's absolutely crucial work! |
This pull request has been mentioned on NixOS Discourse. There might be relevant details there: https://discourse.nixos.org/t/nixpkgss-current-development-workflow-is-not-sustainable/18741/53 |
Description of changes
This is still a work in progress (needs testing, etc)
1.11 Release of PyTorch:
Nix changes:
TODO
CUDA appears to need updatingThings done
sandbox = true
set innix.conf
? (See Nix manual)nix-shell -p nixpkgs-review --run "nixpkgs-review rev HEAD"
. Note: all changes have to be committed, also see nixpkgs-review usage./result/bin/
)nixos/doc/manual/md-to-db.sh
to update generated release notes