Plea for more testing. #2521

jgfouca · 2018-05-01T17:23:49Z

Many basic aspects of CIME functionality are still untested. For example, the key concepts:

Upon model failure, the user will have a message "Model fail, see $logfile" at the bottom of their output.
Upon model build failure, the user will have a message "$component build failure, see $logfile" at the bottom of their output.

... are completely untested.

The barriers to testing CIME are significant:

Two different models
20+ different machines
How do I test that a feature works for cheyenne on melvin?
Much of cime's configuration is "hard wired", IE, exists in files in the repo. Making it hard to test various XML configurations on the fly.
How do I test that sandiatoss3 would work with X,Y,Z changes to config_batch.xml on melvin?

Proposal:

A new --test option to case.setup that will make it work on any --machine=X regardless of the underlying actual machine.
A new "CIME VM/docker" that will allow us to load and test any arbitrary XML configurations and test them from any machine.
Some tracking of coverage, at least of key features, maybe assisted by LOC coverage.

jedwards4b · 2018-05-01T22:29:48Z

should be straight forward - the only issue is looking for directories and filesystems that don't exist - I suppose you can create these relative to the test directory if the --test flag is used.
I'm not sure I understand the value of this or what additional testing it would allow
https://github.com/marketplace/coveralls might be useful

jgfouca · 2018-05-01T22:33:23Z

@jedwards4b , as far as 2), it would allow us to test a specific config_machine/config_batch XML block without that block having to exist in cime/config/$model/machines

@jedwards4b

…etup Allow users to go up through the setup phase on non-local machines Also, remove a bit of unrelated dead code. Test suite: scripts_regression_tests, by-hand Test baseline: Test namelist changes: Test status: bit for bit Addresses #2521 User interface changes?: Yes, new --non-local flag for create_test/create_newcase Update gh-pages html (Y/N)?: N Code review: @jedwards4b

ekluzek · 2019-09-17T19:29:49Z

Has there been any work on the second proposal of this issue 'A new "CIME VM/docker" that will allow us to load and test any arbitrary XML configurations and test them from any machine.' @rgknox

rljacob · 2019-09-17T22:01:32Z

I'm not sure how 2 will do what @jgfouca proposed. A real virtual machine could do that but containers (docker) are not VMs.

A container that could run cime should be a short step from the existing SCAM container I heard about at the CESM workshop.

jgfouca · 2019-10-23T19:45:49Z

We still have much work to do in this area. CIME has the classic symptom of a poorly-tested system: I'm always afraid to touch things for fear of breaking something, somewhere. IE, I cannot rely on CIME's testing to tell me if I made a mistake.

We have some unique challenges to overcome in the testing of CIME:

CIME's behavior is extremely coupled to the underlying system. The fact that CIME runs correctly on your current system is not that good of an indicator that it will run on other systems.
It's not possible to run CIME for a target system that is different than your current system. Missing file paths and environment modules will break CIME.
Full testing of CIME requires building and submitting lots of cases, which is very expensive. It currently takes about 2 hours to run scripts_regression_tests
CIME standalone testing does not necessary catch mistakes that impact full models (E3SM, CESM)
Combinatorics. There are currently three very impactful configuration selections for runnning CIME: the system (e.g. summit or yellowstone), the model (e.g. E3SM vs CESM), and the driver (mct, nuopc). Full coverage of this combination space would require hundreds of runs scripts_regression_tests across ~30 systems.

Possible solutions:

Dramatically increase the number of machines running nightly tests. We'd want to increase coverage of the combination space mentioned above as much as possible. One risk here is that we put a lot of effort into setting up all this testing and then we don't monitor it. We've had fails on our CIME dashboard for years. The advantage of this solution is that it would not require any new CIME development.
Set up a "mock" testing environment for CIME. This mock environment would provide a virtual file system and possibly a virtual module system. For obvious reasons, we could not expect real builds and runs to work in the mock environment, but we could dump key outputs and diff them against a baseline to at least get fairly good confidence that CIME behavior is the same as it was before our change. This will require significant CIME development to support.

We discussed this on our 10/23/2019 telecon.

jedwards4b · 2019-10-23T20:02:51Z

I do monitor the cdash and I am aware of the known failures on the cesm side, I have however stopped informing e3sm of failures on their systems and expect them to monitor as well.

jgfouca · 2019-10-23T20:10:46Z

Yeah, that's on me. I have tolerated the failures on E3SM for a long time

github-actions · 2023-08-07T02:11:35Z

This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions · 2023-08-13T01:58:39Z

This issue was closed because it has been stalled for 5 days with no activity.

jgfouca · 2023-08-14T16:21:13Z

We've made good progress in this area so I'm letting this issue stay closed.

jgfouca added the ty: Discussion label May 1, 2018

jgfouca assigned rljacob, jedwards4b, billsacks, jgfouca and mvertens May 1, 2018

rljacob added the Assigned label May 2, 2018

jgfouca mentioned this issue Jun 12, 2018

Allow users to go up through the setup phase on non-local machines #2665

Merged

jgfouca assigned jasonb5 Jan 27, 2021

jgfouca mentioned this issue Mar 17, 2021

CIME V6 Planning! #3886

Open

github-actions bot added the Stale label Aug 7, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Aug 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plea for more testing. #2521

Plea for more testing. #2521

jgfouca commented May 1, 2018 •

edited

Loading

jedwards4b commented May 1, 2018

jgfouca commented May 1, 2018

ekluzek commented Sep 17, 2019

rljacob commented Sep 17, 2019

jgfouca commented Oct 23, 2019

jedwards4b commented Oct 23, 2019

jgfouca commented Oct 23, 2019

github-actions bot commented Aug 7, 2023

github-actions bot commented Aug 13, 2023

jgfouca commented Aug 14, 2023

Plea for more testing. #2521

Plea for more testing. #2521

Comments

jgfouca commented May 1, 2018 • edited Loading

jedwards4b commented May 1, 2018

jgfouca commented May 1, 2018

ekluzek commented Sep 17, 2019

rljacob commented Sep 17, 2019

jgfouca commented Oct 23, 2019

jedwards4b commented Oct 23, 2019

jgfouca commented Oct 23, 2019

github-actions bot commented Aug 7, 2023

github-actions bot commented Aug 13, 2023

jgfouca commented Aug 14, 2023

jgfouca commented May 1, 2018 •

edited

Loading