GFS v16 GSI minimization issues #154

RussTreadon-NOAA · 2021-04-30T13:42:53Z

The operational MinMon package occasionally detects and reports problematic GFS v16 GSI minimizations. One such case was the operational 2021041718 gfs cycle (prod curve in figure below). Investigation of this case found that placing a lower bound of 1.e-07 on the saturation specific humidity computed in genqsat.f90 yielded a much smoother minimization (master_qmin in figure below).

Constant qmin is defined in constants.f90 as

real(r_kind),parameter:: qmin = 1.e-07_r_kind ! lower bound on ges_q

The corresponding change in genqsat.f90 is

qsat(i,j,k) = max(qmin,qsat(i,j,k))

This issue is opened to document further evaluation of this change. Pending evaluation results this change will be proposed for merger into the master.

The text was updated successfully, but these errors were encountered:

RussTreadon-NOAA · 2021-04-30T18:54:01Z

GSI tests for normal minimization cases

Compile release/gfsda.v16.1.0 twice. The is the release branch for GFS v16.1. The first compilation did not include the qmin addition to genqsat.f90 described above. The second compilation includes the qmin addition to genqsat.f90.

Curves labeled cntrl on the plots below pertain to global_gsi.x executable build from release/gfsa.v16.1.0 without any changes. Curves labeled qmin on the plots below pertain to the global_gsi.x executable with the modified genqsat.f90.

The GSI was run for 4 cycles using a stand-alone run script. These runs are not cycled. Each run is independent. Input for each run was taken from the cycled GFS v16.1 real-time parallel. The cycles run were 2021042912 to 2021043006. None of these cycles were by flagged by MinMon as having anomalous minimizations. Adding the qmin bound does not significantly alter the nature of the minimization found in the control as shown by the figures below.

Again, these four cycles were not cycled. The GSI was individually run for each cycle. Below is a collection over all four cycles of the (observation - analysis) fits for sonde temperature, winds, and humidity.

The similarity between the fits is not surprising given the similarity in the convergence curves. This test indicates that the qmin change does not alter the analysis significantly for normal minimization cases.

RussTreadon-NOAA · 2021-04-30T19:23:10Z

The qmin bound in genqsat.f90 has been added to PR #149. Done at a5d32a17.

RussTreadon-NOAA · 2021-05-03T13:30:47Z

2021041718 gfs case

Rerun global_gsi.x for 2021041718 gfs case with analysis fits added to gsistat. Do this for global_gsi.x built from master (cntrl) and global_gsi.x build from master with qmin change to genqsat.f90 (qmin). Plotted below are observation minus analysis fits for sonde temperature, winds, and RH. The fits are comparable between the two runs. Close inspection of the RMSE plots shows qmin to have slightly smaller rmse than cntrl at certain levels. Even smaller differences are found on the BIAS plot.

None of the plots included thus far in this issue are from cycled DA runs. All plots are based on independent executations of the global_gsi.x for a separate cases. It would be interesting to see the effect of the qmin change to genqsat.f90 in a cycled parallel.

RussTreadon-NOAA · 2021-05-10T15:08:15Z

2021050418 gfs case

The minimization for the operational 2021050418 gfs analysis exhibited a saw-tooth pattern on the second outer loop as shown in the figure below
.

NOAA-EMC/GSI PR #149 includes the qsat change which improved minimization in the 2021041718 gfs case. The PR #149 global_gsi.x was run for the 2021050418 gfs case. The qmin change did not improve minimization (see figure below).

Through trial and error it was found that turning off correlated error over sea surfaces yielded a smoother minimization for this case

Correlated error is applied to both IASI and CrIS. Additional runs of the PR #149 global_gsi.x show a smoother minimization curve when only turning off correlated error over sea points for CrIS

than for only turning off correlated error over sea points for IASI

The two runs indicate, though, that turning off correlated error over IASI sea points has a greater "smoothing" effect on the gnorm curve in the first outer loop than turning off correlated error over CrIS sea points.

Other cases, gfs or gdas, should be examined. It's possible minimizations for other cases are not sensitive to correlated error. A horizontal plot of the IASI and/or CrIS sea points for the 2021050418 case might prove enlightening. This issue only reports a sensitivity. Much work remains to understand the nature of this sensitivity.

RussTreadon-NOAA · 2021-05-10T19:45:33Z

Kristen suggested increasing kmult in global_anavinfo.l127.txt from 1.0 to 2.0. This was done for IASI and CrIS sea points and the 2021050418 case rerun. The resulting gnorm curve is included below

2021051000 gfs
To increase the sample size another case was examined - 2021051000 gfs. Here is the gnorm curve using the #PR 149 global_gsi.x with correlated error active for CrIS and IASI

Below is the gnorm curve for this case with kmult = 2.0 for CrIS and IASI sea points

Discussion between Kristen and John led to suggesting a run with 0 < kreq < 1. The case kreq=0.5 for CrIS and IASI sea points was run with the resulting gnorm curve below.

kmult=2.0 yields a smoother minimization on the first outer loop than kreq=0.5. Both settings yield comparably smooth second outer loop minimizations.

RussTreadon-NOAA · 2021-05-10T19:55:26Z

@jderber-NOAA , @KristenBathmann-NOAA , @dtkleist

Additional runs and cases have been made based on today's email exchange. Thank you for the kmult and kreq suggestions. It's not hard to find cases of saw tooth or "spiky" minimization in v16 gfs or gdas cycles. Click here for the operational MinMon page. I've been choosing gfs for my tests simply because the gfs cycle runs a bit faster (fewer iterations and no o-a stats) than the gdas cycle.

Files to run the 2021051000 gfs case are being rsync'd to Hera. Look in /scratch2/NCEPDEV/stmp1/Russ.Treadon/gfs/prod for enkfgdas, gdas, and gfs directories and files.

RussTreadon-NOAA · 2021-05-11T20:59:00Z

2021051100 gfs
The gnorm curve for this case has cyclic behavior in the second outer loop

Toggling correlated error parameters kmult or kreq for observations over sea points yields smoother minimizations on the second outer loop.

2021051012 gfs
The gnorm curve for this case is very choppy.

Toggling correlated error parameters do not reduce the choppiness of the gnorm curve (not shown). Even removing all IASI and CrIS data (and therefore no correlated error) does not alter the choppiness.

There's more to learn about cost function minimization in GFS v16.

FYI, both the 2021051012 and 2021051100 cases were run using global_gsi.x built from PR #149.

RussTreadon-NOAA · 2021-05-12T12:09:14Z

2021051012 gfs

Sensitivity tests found that placing ASCAT winds in monitor mode yields a smooth gnorm curve.

Placing METOP-2(A) ASCAT (report type 290, subtype 4) in monitor mode yields a smoother gnorm curve than placing METOP-1(B) ASCAT (report type 290, subtype 3) in monitor mode.

It would be interesting to dig deeper into this case to better understand why METOP-2(A) ASCAT impacts the minimization as it does.

One of the proposed changes for GFS v16.x is to turn on thinning for ASCAT winds. 100 km thinning was active in GFS v15. GFS v16.0 turned off this thinning. The 2021051012 gfs case was rerun with METOP-2(A) and METOP-1(B) ASCAT assimilated with two thinning grids: 50 and 100 km. The resulting gnorm curves are shown below

Both thinning grids yields comparably smooth gnorm curves.

RussTreadon-NOAA · 2021-05-12T13:01:58Z

Add Emily @emilyhcliu

jderber-NOAA · 2021-05-12T14:54:46Z

Russ,

I am looking into a different issue with the case you put on Hera. I noted a call to reset_predictors_var within the outer loop in setuprhsall. This changes the background error covariance within the outer loop and thus the relationship between xhatsave and yhatsave will no longer be consistent. I suppose this has been included to account for the change in numbers of channels and aircraft that pass the quality control in setuprhs, but I am afraid it is leading to issues. Also, what is done in reset_predictors_var is inconsistent with what is done in set_predictors_var. The first tests I have done (just removing the call to reset_predictors_var) showed a much smoother second outer iteration. Also, the final value for the gradient went from 41 to 10. I don't think just removing the call to reset_predictors_var is the necessarily the best solution (I am trying some other things now), but I think it shows that this may be an issue. Also, it may explain why we see the sawtooth pattern mostly in the second outer iteration.

RussTreadon-NOAA · 2021-05-12T15:16:58Z

Thanks, John, for the update. This sounds like a fruitful path to investigate. Perhaps making reset_predictors_var consistent with set_predictors_vars will help. You may be exploring this. I wonder if there's anything we do at the end of pcgsoi (e.g., in update_guess) which also adds inconsistency between outer loops.

jderber-NOAA · 2021-05-12T16:03:09Z

OK - I was wrong for this case. It turns out the background error does not change because the number of observations do not change enough to hit the criterion in reset_predictors_var from the first to second outer iteration. However, the routine does change the values specified in set_predictors_var and the improved convergence appears to be coming from using those original values in set_predictors_var. I think the inconsistency between xhatsave and yhatsave could occur, just not in this case. What is done in reset_predictors_var is very crude and does not seem to be as good as that in set_predictors_var. Wish I knew why this routine was included. Will look at set_predictors_var to see if we can make an improvement in these values so that reset_predictors_var is not needed.

jderber-NOAA · 2021-05-14T18:19:41Z

Russ,

I was finally able to get things where I was happy. The output is in /scratch2/NCEPDEV/stmp1/John.Derber/tmp766/ pr149_gfs_b6.2021051000, I don't know what my changes to the background error to the radiances and aircraft temperatures, but the convergence looks better to me.

Code changes are in /scratch1/NCEPDEV/da/John.Derber/jderber/src/gsi in routines berror, aircraftinfo, radinfo, and setuprhsall.

John

RussTreadon-NOAA · 2021-05-14T18:36:19Z

Thanks, John. I am cloning your corr branch on WCOSS_D and will try it in the cases of poor minimization.

jderber-NOAA · 2021-05-14T20:17:51Z

Thanks Russ. Hope it helps!JohnSent from my Verizon, Samsung Galaxy smartphone -------- Original message --------From: RussTreadon-NOAA ***@***.***> Date: 5/14/21 2:36 PM (GMT-05:00) To: NOAA-EMC/GSI ***@***.***> Cc: jderber-NOAA ***@***.***>, Mention ***@***.***> Subject: Re: [NOAA-EMC/GSI] GFS v16 GSI minimization issues (#154) Thanks, John. I am cloning your corr branch on WCOSS_D and will try it in the cases of poor minimization. —You are receiving this because you were mentioned.Reply to this email directly, view it on GitHub, or unsubscribe.

RussTreadon-NOAA · 2021-05-14T20:53:55Z

2021051000 gfs
Clone and build jderber-NOAA:corr on WCOSS_D. Rerun 2021051000 gfs case with resulting global_gsi.x. Shown below is the original gnorm curve on the left with the corr gnorm curve on the right.

The minimization is improved. Earlier sensitivity tests altering kmult or kreq indicate that tuning of correlated error parameters might also improve the minimization for this case. It is possible that updates to the correlated error fix files or formulation could also yield improvements in the minimization.

It would be interesting to test the corr branch for other cases and see its impact.

RussTreadon-NOAA · 2021-05-14T21:06:51Z

2021051318 gfs

The operational MinMon package flagged the operational 2021051318 gfs analysis with the following message:
Final gnorm gross check failure: suffix = GFS, cycle = 2021051318, final gnorm = 0.00130159302389528 File source for report is: /gpfs/dell1/nco/ops/com/gfs/prod/gfs.20210513/18/atmos/gfs.t18z.gsistat

Below is the MinMon gnorm curve for this case

Through a series of sensitivity tests it was found that placing npp cris-fsr channel 1008 in monitor mode results in a much smoother gnorm curve (right). The choppy gnorm curve (left) was generated by running PR #149 global_gsi.x for this case with all data, including npp cris-fsr channel 1008, assimilated. The same executable was run to generate the curve on the right.

CrIS fsr channel 1008 is a water vapor channel. It would be good to dig deeper into this case to better understand why assimilation of npp cris-fsr channel 1008 has this impact on the minimization.

RussTreadon-NOAA · 2021-05-15T11:18:11Z

Summary (thus far)

The following cases have been examined and sensitivities identified.

case	behavior	sensitivity / action
2021041718 gfs	choppy gnorm on second outer loop	placing lower bound of 1.0e-7 on qsat in `genqsat.f90` smooths gnorm curve. This change is included in PR #149
2021050418 gfs	sawtooth gnorm on first and second outer loop. higher frequency sawtooth pattern on second outer loop	gnorm curve smoothed by (1) turning off correlated error over sea points (larger impact for CrIS than IASI). (2) setting correlated error parameter `kmult = 2` for sea points in anavinfo
2021051000 gfs	sawtooth gnorm on first and second outer loop	(1) `kmult = 2` or `kreq = 0.5` smooth gnorm curve with greater improvement from `kmult = 2`, (2) jderber-NOAA:corr includes source code changes which smooths gnorm curve without adjusting `kmult` or `kreq`
2021051012 gfs	choppy gnorm on first and second outer loop, sawtooth pattern evident on second outer loop	thinning ASCAT winds to 50 or 100 km via settings in convinfo smooths gnorm curve on both loops
2021051100 gfs	cyclic gnorm increase / decrease on second outer loop	setting (1) `kmult = 2` or `kreq = 0.5` smooth gnorm curve with greater improvement from `kmult = 2`, (2) jderber-NOAA:corr does not remove cyclic gnorm behavior for this case
2021051318 gfs	choppy gnorm on second outer loop	placing NPP CrIS FSR channel 1008 (water vapor channel) in monitor mode smooths gnorm curve on second outer loop

Note: gfs (early dump) cases have been examined thus far due to quicker turn around for stand alone gsi run script. The gfs gsi has 50 fewer iterations on the second outer loop than the gdas (late dump) gsi.

jderber-NOAA · 2021-05-16T17:35:37Z

Russ,

I have been trying various values of kreq on the case I have on Hera. I think .5 is too large and I was trying some smaller values. I was hoping that .1 (a small value that does not change the variance too much), but I am not seeing much improvement. In fact it looks a little worse than 0.. However, somewhat smaller values (.15 and .25) seem to be a bit better. I think .25 might still be a bit too large. I really think kmult = 2. is way too large, eventhough it produces smoother results. kmult will increase the errors assigned to the radiances by a lot and I am afraid this will change things too much.

John

KristenBathmann · 2021-05-17T13:55:29Z

What about testing the sensitivity to a group of channels, for example only applying kreq or kmult to the water vapor channels? The GSI code doesn't allow this, but I can do it offline.

RussTreadon-NOAA · 2021-05-17T21:06:47Z

2021051318 gfs update
Comment out all calls to upd_positive_fldr3 in PR #149 update_guess.f90. Recompile and rerun 2021051318 gfs case with all operational data assimilated and without any changes to info files. Resulting gnorm curve below

John pointed out the error in the above test. By commenting out update_postive_fldr3, the increment was NOT added to the background. This is wrong. When the increment is added in update_guess without applying the lower bound imposed by upd_positve_fldr3 the gnorm curve remains choppy on the second outer loop (see below).

jderber-NOAA · 2021-05-17T23:05:20Z

Russ,

Very nice. My test is still waiting on Hera.

I hope you replaced the calls with "ptr3dges = ptr3dges + ptr3dinc" for 3d fields ("ptr2dges = ptr2dges + ptr2dinc" for 2d). Otherwise the fields are not updated. This update is not necessary for the jacobian calculations and I cannot see any issues with replacing the calls. We might want to add something that limits the fields before writing out so that the model does not get negative values.

John

RussTreadon-NOAA · 2021-05-18T01:12:37Z

Russ,

Very nice. My test is still waiting on Hera.

I hope you replaced the calls with "ptr3dges = ptr3dges + ptr3dinc" for 3d fields ("ptr2dges = ptr2dges + ptr2dinc" for 2d). Otherwise the fields are not updated. This update is not necessary for the jacobian calculations and I cannot see any issues with replacing the calls. We might want to add something that limits the fields before writing out so that the model does not get negative values.

John

Touché, you caught my mistake. I simply commented out the call to upd_positive_ptr* without looking at the routines. I corrected my mistake as you indicated. Below is the gnorm curve for the 2021051318 case with all upd_positive_ptr* calls commented out AND the increment added to the guess.

The gnorm curve remains choppy on the second outer loop. Back to the drawing board ... at least for this case.

jderber-NOAA · 2021-05-18T13:03:58Z

When I removed the check on minimum values in both update_guess and compute derived I am getting a much smoother reduction in gradient in the second outer iteration for the case I have on Hera. This case was not too bad in the second outer iteration, so I cannot say how much it will help in other cases. The first outer iteration was identical, indicating the that qmin was being applied earlier in the code as well. This looks very promising.

jderber-NOAA · 2021-05-24T15:15:43Z

The third case Russ put on Hera (2021052006) does not appear to be moisture related. By changing preconditioning I have been able to get a significant change in the convergence properties, but still resulted in a magnitude of the gradient that was a bit too large at the end of the second outer iteration. I am looking into some changes that might help.

RussTreadon-NOAA · 2021-05-24T16:05:59Z

This sounds very promising. The operational 2021052318 gfs has a choppy gnorm curve on both outer loops. We may want to test your preconditioning changes on this case, too.

jderber-NOAA · 2021-05-26T18:57:31Z

Not much progress through the preconditioning route. Some improvement on the smoothness, but gradient still ends too high. Appears to be a problem related to the ensembles, but getting weird results (i.e., input parameters are changed and after read in they are set correctly, but then they change back to the original values.)

RussTreadon-NOAA · 2021-06-09T14:35:45Z

John opened issue #170 to document changes to bias correction variances. These changes improve convergences in certain cases and overall improve consistency between outer loops. Please see this issue for details regarding his changes.

RussTreadon-NOAA · 2021-06-09T22:16:13Z

2021060806 gfs

The operational MinMon package flagged the operational 2021060806 gfs analysis with the following message:
Final gnorm gross check failure: suffix = GFS, cycle = 2021060806, final gnorm = 0.000517371673207889 File source for report is: /gpfs/dell1/nco/ops/com/gfs/prod/gfs.20210608/06/atmos/gfs.t06z.gsistat

Below is the MinMon gnorm curve for this case

Through trial and error it was found that placing Metop-A and Metop-B IASI channels 2701 to 3310 (8 channels in all) in monitor mode results in a much smoother gnorm curve.

Tests placing eight IASI channels from 2701 to 3310 in monitor mode from a single satellite show Metop-A contributes more to the choppy gnorm curve than Metop-B for this case.

The above subset of monitored channels can be narrowed to {2993, 3002, 3049} and still yield a smoother gnorm curve than the original operational run.

What atmospheric constituents are these IASI channels sensitive to? What parts of the atmospheric column are these channels sensitive to? It would be interesting to generate horizontal maps to see where large innovations occur.

RussTreadon-NOAA · 2021-06-22T17:17:46Z

2021061918 gfs

While MinMon did not flag the operational gfs analysis for 2021061918, the gnorm curve is choppy for this cycle.

The NOAA-EMC/GSI master at a67b816 was built and used to run the 2021061918 case. The gnorm curve remains choppy with the master global_gsi.x.

Three tests were subsequently run:

thin ASCAT winds to 75 km
monitor Metop-A and Metop-B IASI water vapor channels 2889 to 5480 (10 channels for each satellite)
monitor CrIS-FSR N20 water vapor channels 882 to 1058 (7 channels). All CrIS-FSR NPP channels are monitored due to an instrument anomaly.

The resulting gnorm curves are shown below.

Thinning ASCAT data to 75 km does not yield a smoother gnorm curve for this case.

Monitoring Metop-A/B IASI water vapor channels does not impact the gnorm curve on the first outer loop but the gnorm curve on the second outer loop is much smoother.

Monitoring CrIS-FSR N20 water vapor channels impacts both the first and second outer loops.

The improvement in convergence from small changes (e.g. placing 7 channels in monitor mode) is considerable. These results and those for other cycles suggest that we should take a closer look at the assimilation of CrIS and IASI water vapor channels in v16. This is not meant to say that there is a problem with assimilation of these water vapor channels. The sensitivity of the gnorm curve to these channels may lead other aspects of the analysis (e.g., ensembles, correlated error, increased vertical resolution, ...).

jderber-NOAA · 2021-06-23T13:21:16Z

Russ, Your results would indicate to me that we should be using larger observation errors and/or not using these channels for the cloudy radiance assimilation. John

…

On Tue, Jun 22, 2021 at 1:18 PM RussTreadon-NOAA ***@***.***> wrote: *2021061918 gfs* While MinMon did not flag the operational gfs analysis for 2021061918, the gnorm curve is choppy for this cycle. [image: GFS_gfs 2021061918 gnorms] <https://user-images.githubusercontent.com/26926959/122967772-cf186f80-d358-11eb-80a8-6d7489d8484e.png> The NOAA-EMC/GSI master at a67b816 <a67b816> was built and used to run the 2021061918 case. The gnorm curve remains choppy with the master global_gsi.x. [image: gnorm_master all_data_2021061918] <https://user-images.githubusercontent.com/26926959/122969004-279c3c80-d35a-11eb-9125-0cb8f78b1f69.png> Three tests were subsequently run: 1. thin ASCAT winds to 75 km 2. monitor Metop-A and Metop-B IASI water vapor channels 2889 to 5480 (10 channels for each satellite) 3. monitor CrIS-FSR N20 water vapor channels 882 to 1058 (7 channels). All CrIS-FSR NPP channels are monitored due to an instrument anomaly. The resulting gnorm curves are shown below. [image: gnorm_master thin_ascat75_2021061918] <https://user-images.githubusercontent.com/26926959/122969019-2c60f080-d35a-11eb-9dc0-cc7aa6d50456.png> Thinning ASCAT data to 75 km does not yield a smoother gnorm curve for this case. [image: gnorm_master iasi_mon2889_5480_2021061918] <https://user-images.githubusercontent.com/26926959/122969086-426eb100-d35a-11eb-8068-76cf3a67d891.png> Monitoring Metop-A/B IASI water vapor channels does not impact the gnorm curve on the first outer loop but the gnorm curve on the second outer loop is much smoother. [image: gnorm_master crisn20_mon882_1058_2021061918] <https://user-images.githubusercontent.com/26926959/122969217-63370680-d35a-11eb-9c6c-e994632a8085.png> Monitoring CrIS-FSR N20 water vapor channels impacts both the first and second outer loops. The improvement in convergence from small changes (e.g. placing 7 channels in monitor mode) is considerable. These results and those for other cycles suggest that we should take a closer look at the assimilation of CrIS and IASI water vapor channels in v16. This is not meant to say that there is a problem with assimilation of these water vapor channels. The sensitivity of the gnorm curve to these channels may lead other aspects of the analysis (e.g., ensembles, correlated error, increased vertical resolution, ...). — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#154 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ASD2M5TWDQOGKFZEBQRNTN3TUDAUTANCNFSM434IKOQQ> .

RussTreadon-NOAA · 2021-06-24T10:41:58Z

Reran 2021061918 gfs case with satinfo error changed for 7 assimilated CrIS-FSR N20 water vapor channels. No impact. Looking at the code I don't think the satinfo error is used when correlated error is on. Is this correct, Kristen?

Rerain 2021061918 gfs case with satinfo ermax changed for 7 assimilated CrIS-FSR N20 water vapor channels. Operations currently uses ermax = 0.900 for these water vapor channels. Reduce this to 0.450 with the resulting gnorm curve

ermax=0.450 yields a smoother gnorm curve on the first outer loop but is still spiky on the second outer loop.

Rerun with incrementally smaller ermax. Below are gnorm curves with ermax=0.420, 0.410, and 0.400 for the 7 assimilated CrIS-FSR N20 water vapor channels

The second outer loop gnorm curve is progressively smoother as ermax is decreased.

One final run was made with ermax=0.200 for the 7 assimilated water vapor channels

For previous cases we experimented with adjustments to correlated error parameters kreq and kmult in the anavinfo file. Sensitivity tests with adjustments to these parameters could be run for this case. However, adjustments to kreq and kmult affect all channels.

Earlier on in this issue Kristen mentioned the following channel specific option

What about testing the sensitivity to a group of channels, for example only applying kreq or kmult to the water vapor channels? The GSI code doesn't allow this, but I can do it offline.

Kristen, do we have the necessary code modifications in hand to run channel specific kreq and kmult sensitivity tests?

KristenBathmann · 2021-06-24T12:15:25Z

We use correlated error for CrIS over sea surfaces only, so non-sea surfaces continue to use the satinfo errors. Running channel specific tests with kreq and kmult is very instrument-dependent, so I can either supply separate covariance files for each option you want to test, or make some inelegant code changes to the GSI.

RussTreadon-NOAA · 2021-06-24T13:08:17Z

Rather than generate a bunch of Rcov files for cris-fsr_n20, I'd opt for the inelegant GSI code changes for exploratory testing. Which option do you prefer?

…

On Thu, Jun 24, 2021 at 8:15 AM Kristen Bathmann ***@***.***> wrote: We use correlated error for CrIS over sea surfaces only, so non-sea surfaces continue to use the satinfo errors. Running channel specific tests with kreq and kmult is very instrument-dependent, so I can either supply separate covariance files for each option you want to test, or make some inelegant code changes to the GSI. — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#154 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AGNN634RO25HFLFXVDTE6LTTUMOWRANCNFSM434IKOQQ> .

KristenBathmann · 2021-06-24T14:41:58Z

Changing the GSI code would probably be the easiest option for testing. Are these tests are based on the GSI master, or on John's corr branch?

RussTreadon-NOAA · 2021-06-24T14:49:28Z

The 2021061918 tests use the current head of the master ( a67b816 <a67b816> )

…

On Thu, Jun 24, 2021 at 10:42 AM Kristen Bathmann ***@***.***> wrote: Changing the GSI code would probably be the easiest option for testing. Are these tests are based on the GSI master, or on John's corr branch? — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#154 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AGNN634KPERP5AYBTBHD2XDTUM74BANCNFSM434IKOQQ> .

KristenBathmann · 2021-06-24T18:17:35Z

I created a branch called kreqtune to allow for separate values of kreq for surface, water vapor and other channels. This is an additive inflation value
Rcov(r,r)=(sqrt(Rcov(r,r)+kreq)^2.
The code is also on hera, here:
/scratch1/NCEPDEV/stmp4/Kristen.Bathmann/mintest/GSI
This update requires a change to the anavinfo file, and I regrettably had difficultly committing that change. If it is not there, two columns need to be added to the correlated obs table:
correlated_observations::
! isis method kreq kreq_surf kreq_wv kmult type cov_file
iasi_metop-a 2 0.0 0.0 0.0 1.0 sea Rcov_iasiasea

kreq_surf and kreq_wv are the inflations for the surface and water vapor channels. I also updated the Rcov_cris* covariance files to the versions I plan to use when IASI-C is implemented.
...

RussTreadon-NOAA · 2021-06-24T18:21:47Z

Rerun with correlated error turned off for CrIS-FSR N20.

Interesting cyclic behavior observed in gnorm curve.

Runs using John's biasvar and corr branches do not yield smoother gnorm curves.

CrIS-FSR N20 water vapor channels seem to be the key for this case. Rather than adjusting correlated error would it suffice to adjust satinfo parameter ermax? This parameter is satellite, sensor, and channel specific. Might comparison of CrIS-FSR N20 diagnostic files for ermax=0.9 (control) and ermax=0.4 (test) point to useful CrIS-FSR QC to add to qcmod?

KristenBathmann · 2021-06-24T19:49:38Z

It might be useful to look at the jacobains in the diag files, but this requires re-running with certain options turned on. I have a code somewhere that can plot jacobians. It would be sufficient to just look at the control file, and examine observations that have 3(O-G)>0.4.

RussTreadon-NOAA · 2021-06-24T20:44:40Z

Install Kristen's kreqtune on Venus and run 2021061918 case. Below is the resulting gnorm curve.

This run used updated Rcov files in the kreqtune branch. CrIS-FSR N20 kreq_surf=0.1 and kreq_wv=0.2. The gnorm curve is smoother on the first outer loop but remains choppy on the second loop.

Repeat this test with kreq_wv=0.5. The resulting gnorm cuve is below

Repeat this test with kreq_uv=1.0. The resulting gnorm curve is below:

KristenBathmann · 2021-06-25T15:28:10Z

Here are spaghetti plots of the temperature, humidity and ozone jacobians for CrIS channel 1058, from the operational gfs diag file at 2021061918:

The jacobians are definitely larger with errmax>0.4. The ozone jacobians look a bit odd.

KristenBathmann · 2021-06-25T15:34:18Z

To clarify on my last comment, plots with errmax>0.4 show jacobians of observations that would pass quality control if errmax is greater than 0.4. Plots with errmax<0.4 show jacobians of observations that would pass quality control if errmax is less than 0.4.

RussTreadon-NOAA · 2021-07-05T11:14:23Z

2021070106 gfs

MinMon flagged the operational GFS 2021070106 gfs cycle with the following alert

 Final gnorm gross check failure:  suffix = GFS,  cycle = 2021070106, final gnorm = 0.000454794453772121   File source for report is:  /gpfs/dell1/nco/ops/com/gfs/prod/gfs.20210701/06/atmos/gfs.t06z.gsistat

 http://www.emc.ncep.noaa.gov/gmb/gdas/gsi_stat/index.html?src=GFS&typ=gnorm&cyc=2021070106

Below is the gnorm curve for this case

Replicated this gnorm behavior using NOAA-GSI/EMC master at 17cddf0.

Rerun this case using stand-alone GSI script testing various configurations. Find gnorm sensitivities to ASCAT thinning and correlated error tuning for CrIS and IASI water vapor channels. Use Kristen's kreqtune branch at 7a1b66a to explore gnorm sensitivity to CrIS and IASI water vapor error tuning. Best gnorm behavior obtained with the following configuration

thin ASCAT winds to 75 km
set kreq_wv = 0.2 for CrIS
set kreq_wv = 0.2 for IASI

The resulting gnorm curve for this configuration is shown below

RussTreadon-NOAA · 2021-07-20T19:13:39Z

2021070106 gfs (update)
Kristen found that removing the ensemble yields a smooth minimization.

Turning off assimilation of CrIS and IASI does not yield a smooth minimization.

Kristen's kreqtune branch with kreq_wv = 0.2 for CrIS (sea) and IASI (sea and land) yields the following gnorm curve with ensembles (ie, 4denvar) used.

RussTreadon-NOAA · 2021-07-26T21:58:53Z

2021072312 gfs

The operational MinMon package flagged 2021072312 gfs with the following alert

Final gnorm gross check failure: suffix = GFS, cycle = 2021072312, final gnorm = 0.00103280304291467 File source for report is: /gpfs/dell1/nco/ops/com/gfs/prod/gfs.20210723/12/atmos/gfs.t12z.gsistat

This is a case of poor convergence.

Various sensitivity tests have been run by Kristen. In all tests correlated error was turned off.

Turn off ensemble, turn off cris WV channels, no improvement.
Turning off IASI WV channels in addition to this showed considerable improvement, but the final gradient value was still larger than it should be.

Additional sensitivity tests have been run on Mars. The best convergence thus far has been obtained by not assimilating IASI and CrIS along with thinning ASCAT winds to 75 km.

Additional tests should be run.

jderber-NOAA · 2021-07-29T23:38:34Z

Some changes in minimization technique. In the right direction, but more improvement necessary

RussTreadon-NOAA · 2021-08-02T20:12:13Z

2021073018 gfs & gdas

Both the gfs and gdas cycles for 2021073018 were flagged by MinMon for minimization resets (4 gfs, 3 gdas). Below are the gnorm curves for each

Various sensitivity tests were run for the gfs case using a standalone GSI run script and the master at 3656d50.

Thinning ASCAT winds to 75 km by itself did not noticeably improve the minimization. This agrees with a check of the b terms from fort.220. The largest negative b occurs for the radiance term. Turning off or adjusting the amplitude of correlated error for CrIS and IASI did not improve the minimization. Turning off the moisture (and even all) constraints did not improve convergence. Not assimilating IASI data along with thinning ASCAT winds to 75 km yielded smoother gnorm curve.

The best convergence for 4denvar was obtained by thinning ASCAT winds to 75 km and setting the ensemble contribution to the q background error to zero.

Attempts to reduce the ensemble q perturbations above the tropopause did not improve convergence. Sensitivity to q perturbations may be in the lower troposphere where the L127 model has extra layers compared with L64.

RussTreadon-NOAA · 2021-08-04T16:45:08Z

2021073018 gfs

Additional tests were run in which the ensemble q perturbations were set to zero on specified ranges of model layers. The greatest impact on convergence (in a positive sense) was found when zeroing the ensemble q perturbations on layers 42 to 55.

A check of the atmanl.nc shows layers 42 to 55 approximately correspond to 669 to 455 hPa.

RussTreadon-NOAA · 2021-08-19T21:01:35Z

2021081906_gfs

MinMon flagged the 2021081906 gfs cycle with the following warning

Final gnorm gross check failure: suffix = GFS, cycle = 2021081906, final gnorm = 0.00084927489879786 File source for report is: /gpfs/dell1/nco/ops/com/gfs/prod/gfs.20210819/06/atmos/gfs.t06z.gsistat

Below is the gnorm curve.

No resets occurred in the minimization. This is a case of poor convergence.

global_gsi.x was built from the master at e306315 and run for this case. The master global_gsi.x exhibits similar choppy gnorm behavior with poor convergence.

Examination of fort.220 showed large negative b values for term 20 - winds. The master global_gsi.x was run with ASCAT winds thinned to 75 km. This improved convergence, though the odd cyclic behavior we've seen before is evident on the second outer loop.

Previous cases have shown sensitivity to CrIS and/or IASI water vapor channels. The master global_gsi.x was run with ASCAT winds thinned to 75 km and CrIS and IASI water vapor channels placed in monitor mode. The gnorm curve for this configuration does not exhibit cyclical behavior on the second outer loop.

Three more runs were made, each placing water vapor channels from only one sensor in monitor mode. The goal here was to see if the smoothness of the above gnorm curve correlates with a specific satellite.

Placing N20 CrIS or Metop-B IASI water vapor channels in monitor mode yields gnorm curves with cyclic behavior on the second outer loop. Placing Metop-A IASI water vapor channels in monitor mode yields the smoothest gnorm curve. Additional tests could be run to quantify the impact of specific Metop-A IASI water vapor channels on the gnorm curve.

RussTreadon-NOAA · 2021-11-19T19:17:11Z

20211118 06 & 18Z gfs

MinMon flagged the operational gfs analyses for insufficient convergence on 2021111818 in the 06, and 18Z cycles. Sensitivity tests using the master at f442020 were run for each cycle. The resulting gnorm curves are available as links in the table below. All rows except the first, gfs.v16.1.x, ran the master global_gsi.x with the indicated configuration. Configurations are explained below the table.

run	2021111806	2021111818
gfs.v16.1.5 (opr)	plot	plot
master	plot	plot
ascat75	plot	plot
mon_iasi_cris_wv	plot	plot
ascat75.mon_iasi_cris_wv	plot	plot
no_corerr	plot	plot
mon_iasi_cris_wv.no_corerr	plot	plot
ascat75.mon_iasi_cris_wv.no_corerr	plot	plot

The following configurations were tested

master - global_gsi.x built from f442020 with operational configuration
ascat75 - master with ASCAT winds thinned to 75 km and observation error increased to 2.5 m/s
mon_iasi_cris_wv - master with IASI and CrIS water vapor channels placed in monitor mode
ascat75.mon_iasi_cris_wv - combine ascat75 and mon_iasi_cris_wv
no_corerr - master with correlated error turned off for IASI and CrIS
mon_iasi_cris_wv.no_corerr - combine mon_iasi_cris_wv and no_corerr
ascat75.mon_iasi_cris_wv.no_corerr - combine ascat75, mon_iasi_cris_wv, and no_corerr

For both cases placing IASI and CrIS water vapor channels in monitor mode and turning off correlated error for the remaining assimilated IASI and CrIS channels yielded much smoother gnorm curves. Adding to this the thinning of ASCAT winds and increased ASCAT wind observation error yielded additional improvements.

Gnorm curves for sensitivity tests not mentioned above are found in

RussTreadon-NOAA · 2021-11-20T14:04:06Z

20211118 06 & 18Z gfs follow up

Run an additional tests toggling on/off correlated error for entries in global_anavinfo.l127.txt. Gnorm curves for various tests are tabulated below

run	2021111806	2021111818
no rcov_crisn20 (sea)	plot	plot
no rcov_iasib land	plot	plot
no rcov_iasic land	plot	plot
no rcov_iasib & c land	plot	plot
no rcov_iasib sea	plot	plot
no rcov_iasic sea	plot	plot
no rcov_iasib & c sea	plot	plot

Examination of the above plots shows that turning off correlated error for Metop-C IASI over sea has the greatest impact in terms of yielding a smoother gnorm curve.

For the 2021111818 case this change alone yields a very smooth gnorm curve.
For 2021111806 the gnorm curve is smoother but still exhibits cyclical behavior on the second outer loop.
For the 2021111806 case turning off correlated error for both Metop-B and Metop-C IASI over sea yields the smoothest gnorm curve.

Turning off correlated error for N20 CrIS does not yield as great an improvement in convergence as does turning off correlated error over sea for IASI.

Some interconnected questions:

Why does turning off correlated error over sea, especially for IASI, result in a smoother gnorm curve?
Are the above findings limited to the given cases (2021111806 and 2021111818) or is a similar, yet perhaps smaller, response found in other cases?
For 2021111818 turning off correlated error for Metop-C IASI over sea significantly improves convergence. Why?
Would tuning correlated error parameters in the anavinfo file (master or Kristen's extension) result in smoother convergence with correlated error still on?
Would correlated error benefit from regeneration of the Rcov files using a larger database?

Other ideas or thoughts?

KristenBathmann · 2021-11-21T22:07:43Z

(Try testing with the kreqtune branch and remember to add columns to the anavinfo. Use negative values for kreq* for CrIS NPP.)

RussTreadon-NOAA · 2021-11-22T13:57:42Z

2021112118 gdas and 2021112200 gfs

MinMon flagged the operational 2021112118_gdas and 2021112200_gfs with minimization resets. Given findings from the 20211118 gfs cases, the f442020 master global_gsi.x was run with the following configuration

ASCAT winds thinned to 75 km with 2.5 m/s observation error
Monitor IASI and CrIS water vapor channels
Turn off correlated error over sea points (e.g, do not use Rcovsea files)

The resulting 2021112118_gdas and 2021112200_gfs gnorm curves are much smoother with significantly improved reduction in the gradient norm.

Additional tests for the 2021112200_gfs case show that the combination of monitoring IASI and CrIS water vapor channels along with turning off correlated error over sea points contributes the most to smoothing the gnorm curve and improving convergence. Thinning ASCAT to 75 km and increasing the observation error improves convergence, but this is a secondary effect for this case. Click here to view gnorm curves for other tests run for 2021112200_gfs.

RussTreadon-NOAA · 2021-11-25T20:06:05Z

2021112506 gfs

MinMon flagged the operational 2021112506 gfs cycle for insufficient reduction.

Final gnorm gross check failure: suffix = GFS, cycle = 2021112506, final gnorm = 0.000768855949438719 File source for report is: /gpfs/dell1/nco/ops/com/gfs/prod/gfs.20211125/06/atmos/gfs.t06z.gsistat

Various sensitivity tests were run

thin ASCAT winds to 75 km & increase observation error to 2.5 m/s
monitor IASI & CrIS water vapor channels
combine 1 and 2
omit various Rcov (correlated error) files without 1 and 2
combine 4 with 2
combine 4 with 3

For this case the combination of thinning ASCAT winds to 75 km with an increased observation error along with monitoring IASI and CrIS water vapor channels (test 3) yields the greatest improvement in convergence with respect to operations.

Turning off correlated error over all land or sea points in test 3 improves convergence a bit. The overall best gnorm performance is obtained by combining test 3 with turning off correlated error off over sea points.

Gnorm curves for all tests run for this case are in https://www.emc.ncep.noaa.gov/gmb/wd20rt/plots/gnorm/gfs_2021112506/.

RussTreadon-NOAA · 2021-11-29T21:59:12Z

2021112906 gfs and gdas and 2021112912 gfs

MinMon flagged the 2021112906 gfs, 2021112906 gdas, and 2021112912 gfs cycles for minimization resets and insufficient convergence.

Sensitivity tests were run for the two gfs cycles. The greatest gnorm reduction was obtained with the following combinations:

2021112906 gfs: thin ASCAT to 75 km with 2.5 m/s observation error, monitor IASI and CrIS water vapor channels, turn off correlated error over sea points.
- Note that while the overall reduction in the gradient norm is improved the gnorm curve exhibits cyclical spiky behavior.
- Gnorm curves for other sensitivity tests run for this cycle are found in gfs_2021112906.
2021112912_gfs: thin ASCAT to 75 km with 2.5 m/s observation error, monitor IASI and CrIS water vapor channels, turn off correlated error over sea points.
- Simply monitoring IASI and CrIS water vapor channels significantly smooths the gnorm curve.
- Turning off correlated error for all IASI and CrIS data also yields a smooth gnorm curve with a greater gnorm reductions than retaining correlated error with IASI and CrIS water vapor channels monitored.
- Gnorm curves for other sensitivity tests are found in gfs_2021112912.

RussTreadon-NOAA · 2022-05-04T15:51:24Z

@jderber-NOAA has extensively investigated GSI minimizations issues. He has developed a set of changes which significantly improve the minimization. NOAA-EMC/GSI issue #375 has been opened to document these changes.

RussTreadon-NOAA self-assigned this Apr 30, 2021

jderber-NOAA mentioned this issue Sep 30, 2021

Improve minimization and fix bug in vqc #219

Closed

jderber-NOAA mentioned this issue May 4, 2022

Improvements to minimization #375

Closed

RussTreadon-NOAA closed this as completed Dec 6, 2023

GFS v16 GSI minimization issues #154

GFS v16 GSI minimization issues #154

Comments

RussTreadon-NOAA commented Apr 30, 2021

RussTreadon-NOAA commented Apr 30, 2021 • edited Loading

RussTreadon-NOAA commented Apr 30, 2021

RussTreadon-NOAA commented May 3, 2021

RussTreadon-NOAA commented May 10, 2021

RussTreadon-NOAA commented May 10, 2021

RussTreadon-NOAA commented May 10, 2021

RussTreadon-NOAA commented May 11, 2021

RussTreadon-NOAA commented May 12, 2021

RussTreadon-NOAA commented May 12, 2021

jderber-NOAA commented May 12, 2021

RussTreadon-NOAA commented May 12, 2021

jderber-NOAA commented May 12, 2021

jderber-NOAA commented May 14, 2021

RussTreadon-NOAA commented May 14, 2021

jderber-NOAA commented May 14, 2021 via email

RussTreadon-NOAA commented May 14, 2021

RussTreadon-NOAA commented May 14, 2021

RussTreadon-NOAA commented May 15, 2021

jderber-NOAA commented May 16, 2021

KristenBathmann commented May 17, 2021

RussTreadon-NOAA commented May 17, 2021 • edited Loading

jderber-NOAA commented May 17, 2021

RussTreadon-NOAA commented May 18, 2021

jderber-NOAA commented May 18, 2021

jderber-NOAA commented May 24, 2021

RussTreadon-NOAA commented May 24, 2021

jderber-NOAA commented May 26, 2021

RussTreadon-NOAA commented Jun 9, 2021

RussTreadon-NOAA commented Jun 9, 2021

RussTreadon-NOAA commented Jun 22, 2021

jderber-NOAA commented Jun 23, 2021 via email

RussTreadon-NOAA commented Jun 24, 2021

KristenBathmann commented Jun 24, 2021

RussTreadon-NOAA commented Jun 24, 2021 via email

KristenBathmann commented Jun 24, 2021

RussTreadon-NOAA commented Jun 24, 2021 via email

KristenBathmann commented Jun 24, 2021

RussTreadon-NOAA commented Jun 24, 2021

KristenBathmann commented Jun 24, 2021

RussTreadon-NOAA commented Jun 24, 2021

KristenBathmann commented Jun 25, 2021

KristenBathmann commented Jun 25, 2021

RussTreadon-NOAA commented Jul 5, 2021

RussTreadon-NOAA commented Jul 20, 2021

RussTreadon-NOAA commented Jul 26, 2021

jderber-NOAA commented Jul 29, 2021 • edited Loading

RussTreadon-NOAA commented Aug 2, 2021 • edited Loading

RussTreadon-NOAA commented Aug 4, 2021

RussTreadon-NOAA commented Aug 19, 2021 • edited Loading

RussTreadon-NOAA commented Nov 19, 2021

RussTreadon-NOAA commented Nov 20, 2021

KristenBathmann commented Nov 21, 2021

RussTreadon-NOAA commented Nov 22, 2021

RussTreadon-NOAA commented Nov 25, 2021

RussTreadon-NOAA commented Nov 29, 2021

RussTreadon-NOAA commented May 4, 2022

RussTreadon-NOAA commented Apr 30, 2021 •

edited

Loading

RussTreadon-NOAA commented May 17, 2021 •

edited

Loading

jderber-NOAA commented Jul 29, 2021 •

edited

Loading

RussTreadon-NOAA commented Aug 2, 2021 •

edited

Loading

RussTreadon-NOAA commented Aug 19, 2021 •

edited

Loading