Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ocean/global_ocean/QU240/PHC/files_for_e3sm failing on Cori #335

Closed
xylar opened this issue Mar 25, 2022 · 0 comments · Fixed by #555
Closed

ocean/global_ocean/QU240/PHC/files_for_e3sm failing on Cori #335

xylar opened this issue Mar 25, 2022 · 0 comments · Fixed by #555
Assignees
Labels
bug Something isn't working ocean

Comments

@xylar
Copy link
Collaborator

xylar commented Mar 25, 2022

running: srun -n 18 /global/homes/x/xylar/miniconda3/envs/dev_compass_1.0.0/bin/ESMF_RegridWeightGen --source ./src_mesh.nc --destination ./dst_mesh.nc --weight map_QU240E2r1_to_0.5x0.5degree_bilinear.nc --method bilinear --netcdf4 --no_log --src_loc center --src_regional --ignore_unmapped
running: srun -n 18 /global/homes/x/xylar/miniconda3/envs/dev_compass_1.0.0/bin/ESMF_RegridWeightGen --source ./src_mesh.nc --destination ./dst_mesh.nc --weight map_QU240E2r1_to_6000.0x6000.0km_10.0km_Antarctic_stereo_bilinear.nc --method bilinear --netcdf4 --no_log --src_loc center --src_regional --dst_regional --ignore_unmapped
running: srun -n 18 /global/homes/x/xylar/miniconda3/envs/dev_compass_1.0.0/bin/ESMF_RegridWeightGen --source ./src_mesh.nc --destination ./dst_mesh.nc --weight map_QU240E2r1_to_6000.0x6000.0km_10.0km_Arctic_stereo_bilinear.nc --method bilinear --netcdf4 --no_log --src_loc center --src_regional --dst_regional --ignore_unmapped
srun: error: nid00229: task 17: Exited with exit code 1
srun: launch/slurm: _step_signal: Terminating StepId=56638155.27
slurmstepd: error: *** STEP 56638155.27 ON nid00226 CANCELLED AT 2022-03-25T02:52:22 ***
srun: error: nid00229: tasks 14-16: Terminated
srun: error: nid00226: tasks 1-2: Terminated
srun: error: nid00227: tasks 6-9: Terminated
srun: error: nid00228: tasks 10-13: Terminated
srun: Force Terminated StepId=56638155.27

It's odd that 2 calls to ESMF_RegridWeightGen are successful but the 3rd fails.

This is failing with both Intel and Gnu on Cori-Haswell, and with Intel on Cori-KNL.

@xylar xylar added bug Something isn't working ocean python package DEPRECATED: PRs and Issues involving the python package (master branch) labels Mar 25, 2022
@xylar xylar self-assigned this Mar 25, 2022
@xylar xylar changed the title ocean/global_ocean/QU240/PHC/files_for_e3sm failing on Cori-Haswell with Gnu ocean/global_ocean/QU240/PHC/files_for_e3sm failing on Cori-Haswell Mar 25, 2022
@xylar xylar changed the title ocean/global_ocean/QU240/PHC/files_for_e3sm failing on Cori-Haswell ocean/global_ocean/QU240/PHC/files_for_e3sm failing on Cori Mar 25, 2022
@xylar xylar removed the python package DEPRECATED: PRs and Issues involving the python package (master branch) label Apr 1, 2022
@xylar xylar mentioned this issue Mar 15, 2023
64 tasks
@xylar xylar closed this as completed in #555 Apr 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working ocean
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant