-
Notifications
You must be signed in to change notification settings - Fork 7
Commit
There are no files selected for viewing
Original file line number | Diff line number | Diff line change | ||
---|---|---|---|---|
|
@@ -801,10 +801,10 @@ cat << _EOF_ > $BCJOB | |||
#SBATCH --output=$EXPDIR/$OUTDIR/logs/$BCNAME.log | ||||
#SBATCH --error=$EXPDIR/$OUTDIR/logs/$BCNAME.err | ||||
#SBATCH --account=$group | ||||
#SBATCH --time=03:00:00 | ||||
#SBATCH --ntasks=28 | ||||
#SBATCH --time=12:00:00 | ||||
#SBATCH --nodes=1 | ||||
This comment has been minimized.
Sorry, something went wrong.
This comment has been minimized.
Sorry, something went wrong.
biljanaorescanin
Author
Contributor
|
set NCPUS = 20 |
Does that make sense?
This comment has been minimized.
This comment has been minimized.
Sorry, something went wrong.
mathomp4
Mar 23, 2023
Member
@gmao-rreichle You can't do it there. That's before you are in SLURM. What you need to do is at some point after every set of #SBATCH
pragmas set in a heredoc, do:
set NCPUS = $SLURM_CPUS_ON_NODE
because that variable is only set inside a SLURM allocation.
So pretty much somewhere after each of the file changes in this PR.
This comment has been minimized.
This comment has been minimized.
Sorry, something went wrong.
gmao-rreichle
Mar 23, 2023
Contributor
Thanks again, @mathomp4! Makes eminent sense, of course. I generally should refrain coding and commenting after 5pm... my brain is fried.
@biljanaorescanin, @weiyuan-jiang: The important thing is to update the python version accordingly. Fixing this in the c-shell make_bcs is only so helpful.
@biljanaorescanin, are you sure this is what we're supposed to do? Here's an excerpt from the SI team's email:
"...be sure you ask for --ntasks= rather than nodes since the tasks per node on sky|cas varies (make sure any run scripts use the Slurm environment variable $SLURM_CPUS_ON_NODE rather than hardwiring as 36 or 45)."
Maybe we need to do something like:
#SBATCH --ntasks=$SLURM_CPUS_ON_NODE
Or would we automatically get ntasks=SLURM_CPUS_ON_NODE?
Also, what happened to the instruction from a year or so ago that suggested using fewer than the max number of CPUs on the newer (cas?) nodes?
I might well be misunderstanding the email from the SI team.
cc: @weiyuan-jiang