-
Notifications
You must be signed in to change notification settings - Fork 383
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ne1024 F case run on Cori aborted from namelist_mod #3417
Comments
based on the stack trace, there might be a message for why the code called abort. can you check near the end of the atm.log and e3sm.log files? I suspect it's because the "theta-l" dycore has a new more rational way to set the timesteps, so the new defaults interfere with the namelist above, which is setting the old style splitting parameters. To keep using this old namelist, you might just need to add: "se_tstep=-1" |
@mt5555 |
@mt5555 |
@mt5555
However, the case still failed, with exactly the same trace stack and error message. |
can you send me the atm.log file? |
@mt5555 |
based on the log file, can you try adding all of these: se_tstep=-1 |
@mt5555 |
@dqwu this does not look like a bug, it looks like just a problem setting the timesteps in a yet pretty custom configuration. We are actively working on dialing in the configuration for ne1024 before putting the defaults into the CIME configuration, so this is very much a work in progress. For now, I suggest using @mt5555 suggestion above that seems like it worked for you. I am testing a more "out of the box" default right now, but I'd consider this issue closed since the "error" appears to be just erroneous custom settings. The code does not appear to be doing anything unexpected to me. |
@brhillman @mt5555 |
This failure occurred in less than 5 minutes after case.run started. The error stack trace is:
It has been confirmed that this issue is caused by PR #3368. In the conversation of that PR, a similar stack trace was reported from failed SMS_D_Ln5.ne4_ne4.FC5AV1C-L.anvil_intel.cam-cosplite.
Below are detailed steps to reproduce it on Cori. The job wall-time can be set to 10 minutes to reduce wait time.
PS, in previous successful ne1024 F case runs on Cori, "se_phys_tscale = 0" is set in user_nl_cam. This obsolete namelist variable has been removed by PR #3368 so it is no longer used in the steps below.
The text was updated successfully, but these errors were encountered: