inplace refactor train_lv.py #167

mathause · 2022-06-22T12:05:50Z

As part of covariance & localisation: extract functions #153
Passes isort . && black . && flake8
Fully documented, including CHANGELOG.rst

This is a small refactoring of train_lv_AR1_sci and train_lv_find_localized_ecov before actually doing #153. Helps me understand what happens & should make it slighly faster.

codecov-commenter · 2022-06-22T12:12:03Z

Codecov Report

Merging #167 (940e5b1) into master (0c5b265) will decrease coverage by 0.01%.
The diff coverage is 94.11%.

@@            Coverage Diff             @@
##           master     #167      +/-   ##
==========================================
- Coverage   79.23%   79.22%   -0.02%     
==========================================
  Files          30       30              
  Lines        1387     1386       -1     
==========================================
- Hits         1099     1098       -1     
  Misses        288      288

Flag	Coverage Δ
unittests	`79.22% <94.11%> (-0.02%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
mesmer/calibrate_mesmer/train_lv.py	`80.35% <94.11%> (-0.52%)`	⬇️
mesmer/core/auto_regression.py	`100.00% <0.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0c5b265...940e5b1. Read the comment docs.

mathause

@leabeusch do you want to take a look at this because I anyway have some questions for you?

mathause · 2022-06-22T12:11:34Z

mesmer/calibrate_mesmer/train_lv.py

+        # ATTENTION: STILL NEED TO CHECK IF THIS IS TRUE.
+        # THIS FORMULA IS DIFFERENT IN THE ESD PAPER! (coef not squared)
+        # But I am pretty sure that code is correct and the error is in the paper)


@leabeusch questions:

Will I find the answer to this in Cressie and Wikle 2011?

Given this is the 'analytic formula for an AR1' process - does this mean it is not possible to use an AR(2) process for the local variability (unless the formula is adapted)?

Do I read this correctly: loc_ecov_AR1_innovs is the 'localized empirical spatial covariance matrix for the innovations of the AR 1 process'?

Optional: Do you have better name for c below? Maybe I also find something in the book.

I also ping @yquilcaille who may be interested in the answer to (2).

you can also check in Humphrey and Gudmundsson 2019 equation (13). As far as i know, in the ESD paper for MESMER, the squared was forgotten, the equation in the code is correct.

I discussed this with Lukas, and it seems that there is no straightforward way to generalize this equation.

Yes, I think that you got it right.

Because the term c actually scales the correlations, but writing scale could be misleading, why not multip_ecov?

Sorry for taking quite some time to answer @mathause & thanks for getting these answers started @yquilcaille! I elaborate some more on them below:

Oh no, I completely forgot to follow up on that! Good thing you’re going through this refactoring @mathause and uncover things I should have done months, maybe even years (?!) ago. 🙈 😅 As @yquilcaille correctly states: the squares got forgotten in the ESD paper & I should still publish a corrigendum for it (dealing with that has re-entered my TODO list now). I have the formula from Vincent (he used it for his reconstruction work that @yquilcaille also pointed to), I just wrote it down a bit more like code / less mathematical than he did. As far as I know, he originally found it in the Cressie & Wikle 2011 book. I just quickly glanced at it once & couldn’t find it right away, but Lukas reassured me that it must be inside, so I kept the reference nevertheless. Maybe you have more luck checking than I did? Or you directly ask Vincent? He’s back at the institute, no? (If you find the exact equation inside the book, please let me know where!)

Exactly. I’m pretty sure Vincent found a (considerably) more complicated analytical for AR(2) somewhere as well & he may have even used it for some work? But I’m not aware of clean analytical solutions for higher order AR processes. Originally (i.e., including the preprint stage of the first MESMER method paper: https://doi.org/10.5194/esd-2019-34), I allowed for an up to AR(4) process in the local variability of MESMER. & since I had no analytical formula for it, I just computed out the covariance matrix of the innovations empirically after extracting the innovation terms from Equation 8 of the preprint (i.e., after extracting the ν). However, I when I delved more into AR-process-targeted verification for the review, I realized that the AR terms are quite noisy terms (e.g., Fig. 10 of the final paper: https://doi.org/10.5194/esd-11-139-2020) & that I gain more by just allowing for a single lag to be included & having an analytical solution for it, as this considerably increased the localization radii I obtained during cross validation & thus allowed me to reproduce further-reaching spatial cross-correlations between grid points.

Yes, sounds & looks right to me.

What’s your rationale for calling it c here? Generally, I have extremely negative feelings towards single letter variables without meaning. 😅 I think what it is, is the grid-point-specific reduction in variance when going from the full residual local variability (i.e., what is called η in the ESD paper) to the innovations (i.e., the spatially (but not temporally) correlated noise term that is called ν in the ESD paper). I.e., the innovations have a smaller variability than the full residual local variability because that one also contains the variability induced by the AR memory terms. -> maybe call it something along the lines of ar _induced _reduction_factor? But please feel free to come up with something much better if you can. I’m aware my suggestion isn’t great either. ^^

Thanks - I will update the comment.

Ok, this is good to know - will also add a comment.

Thanks.

No good reason to call it c but I needed some name.

yquilcaille

It looks ready for merge.

leabeusch

I hope this is actually helpful & not just an overwhelming amount of text. Let me know if you have other questions or would rather discuss it in person. & I don’t hit the approve button yet, because I hope you’ll still improve the naming of c & rewrite my ugly (although useful reminder) inline comment about the need to check if the formula is accurate (i.e., maybe just write that in the original paper it is wrong & a corrigendum should be published at some point?). 😉

Btw: I didn’t actually run the code but by trying to think through the way you rewrote the nested loop, it seemed to me that it should produce the same thing as the original code. I assume our tests are good enough that they would have picked up on it in case you (or better: we) would have a thought error there, correct? 😅

leabeusch · 2022-07-03T12:42:04Z

mesmer/calibrate_mesmer/train_lv.py

+        # ATTENTION: STILL NEED TO CHECK IF THIS IS TRUE.
+        # THIS FORMULA IS DIFFERENT IN THE ESD PAPER! (coef not squared)
+        # But I am pretty sure that code is correct and the error is in the paper)


Sorry for taking quite some time to answer @mathause & thanks for getting these answers started @yquilcaille! I elaborate some more on them below:

Oh no, I completely forgot to follow up on that! Good thing you’re going through this refactoring @mathause and uncover things I should have done months, maybe even years (?!) ago. 🙈 😅 As @yquilcaille correctly states: the squares got forgotten in the ESD paper & I should still publish a corrigendum for it (dealing with that has re-entered my TODO list now). I have the formula from Vincent (he used it for his reconstruction work that @yquilcaille also pointed to), I just wrote it down a bit more like code / less mathematical than he did. As far as I know, he originally found it in the Cressie & Wikle 2011 book. I just quickly glanced at it once & couldn’t find it right away, but Lukas reassured me that it must be inside, so I kept the reference nevertheless. Maybe you have more luck checking than I did? Or you directly ask Vincent? He’s back at the institute, no? (If you find the exact equation inside the book, please let me know where!)

Exactly. I’m pretty sure Vincent found a (considerably) more complicated analytical for AR(2) somewhere as well & he may have even used it for some work? But I’m not aware of clean analytical solutions for higher order AR processes. Originally (i.e., including the preprint stage of the first MESMER method paper: https://doi.org/10.5194/esd-2019-34), I allowed for an up to AR(4) process in the local variability of MESMER. & since I had no analytical formula for it, I just computed out the covariance matrix of the innovations empirically after extracting the innovation terms from Equation 8 of the preprint (i.e., after extracting the ν). However, I when I delved more into AR-process-targeted verification for the review, I realized that the AR terms are quite noisy terms (e.g., Fig. 10 of the final paper: https://doi.org/10.5194/esd-11-139-2020) & that I gain more by just allowing for a single lag to be included & having an analytical solution for it, as this considerably increased the localization radii I obtained during cross validation & thus allowed me to reproduce further-reaching spatial cross-correlations between grid points.

Yes, sounds & looks right to me.

What’s your rationale for calling it c here? Generally, I have extremely negative feelings towards single letter variables without meaning. 😅 I think what it is, is the grid-point-specific reduction in variance when going from the full residual local variability (i.e., what is called η in the ESD paper) to the innovations (i.e., the spatially (but not temporally) correlated noise term that is called ν in the ESD paper). I.e., the innovations have a smaller variability than the full residual local variability because that one also contains the variability induced by the AR memory terms. -> maybe call it something along the lines of ar _induced _reduction_factor? But please feel free to come up with something much better if you can. I’m aware my suggestion isn’t great either. ^^

leabeusch · 2022-07-03T12:43:01Z

mesmer/calibrate_mesmer/train_lv.py

@@ -306,31 +303,30 @@ def train_lv_find_localized_ecov(y, wgt_scen_eq, aux, cfg):

    """

+    return _find_localized_empirical_covariance(


What’s the rationale for introducing this private function here? Just making it easier for later refactoring by knowing which parts of aux & cfg are actually needed by this function?

Yes, the refactored function should eventually take 'simpler' input parameters. No compelling reason to do it here.

leabeusch · 2022-07-03T12:44:00Z

mesmer/calibrate_mesmer/train_lv.py

        if llh_cv_sum[L] > llh_max:
            L_sel = L
            llh_max = llh_cv_sum[L]
            print("Newly selected L =", L_sel)
        else:
            print("Final selected L =", L_sel)
-            idx_break = True
+            break


Haha good to know there’s actually python functionality that does exactly what I wanted to do here. Fingers crossed, I’ll remember it next time I need it. ^^

mathause

Thanks for the comments - I will update accordingly.

mathause · 2022-07-11T12:33:06Z

mesmer/calibrate_mesmer/train_lv.py

@@ -306,31 +303,30 @@ def train_lv_find_localized_ecov(y, wgt_scen_eq, aux, cfg):

    """

+    return _find_localized_empirical_covariance(


Yes, the refactored function should eventually take 'simpler' input parameters. No compelling reason to do it here.

mathause · 2022-07-11T12:45:07Z

mesmer/calibrate_mesmer/train_lv.py

+        # ATTENTION: STILL NEED TO CHECK IF THIS IS TRUE.
+        # THIS FORMULA IS DIFFERENT IN THE ESD PAPER! (coef not squared)
+        # But I am pretty sure that code is correct and the error is in the paper)


Thanks - I will update the comment.

Ok, this is good to know - will also add a comment.

Thanks.

No good reason to call it c but I needed some name.

mathause · 2022-07-11T13:23:48Z

I extracted adjusting the empirical covariance matrix to it's own function and tried to add the discussed comments in the docstring. I am not yet 100 % happy but suggest to leave it at the moment in order to move forward. Any objections?

inplace refactor train_lv.py

e2ecdca

mathause marked this pull request as draft June 22, 2022 12:05

mathause commented Jun 22, 2022

View reviewed changes

mathause mentioned this pull request Jun 27, 2022

auto regression: allow arbitrary lags? #164

Open

yquilcaille marked this pull request as ready for review July 1, 2022 14:34

yquilcaille approved these changes Jul 1, 2022

View reviewed changes

leabeusch reviewed Jul 3, 2022

View reviewed changes

mathause commented Jul 11, 2022

View reviewed changes

extract _adjust_ecov_ar1 to function

b79586a

linting

940e5b1

mathause merged commit 723eab5 into MESMER-group:main Jul 15, 2022

mathause deleted the refactor_train_lv branch July 15, 2022 07:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inplace refactor train_lv.py #167

inplace refactor train_lv.py #167

mathause commented Jun 22, 2022

codecov-commenter commented Jun 22, 2022 •

edited

Loading

mathause left a comment

mathause Jun 22, 2022

yquilcaille Jul 1, 2022

leabeusch Jul 3, 2022

mathause Jul 11, 2022

yquilcaille left a comment

leabeusch left a comment

leabeusch Jul 3, 2022

leabeusch Jul 3, 2022

mathause Jul 11, 2022

leabeusch Jul 3, 2022

mathause left a comment

mathause Jul 11, 2022

mathause Jul 11, 2022

mathause commented Jul 11, 2022

		@@ -306,31 +303,30 @@ def train_lv_find_localized_ecov(y, wgt_scen_eq, aux, cfg):

		"""

		return _find_localized_empirical_covariance(

inplace refactor train_lv.py #167

inplace refactor train_lv.py #167

Conversation

mathause commented Jun 22, 2022

codecov-commenter commented Jun 22, 2022 • edited Loading

Codecov Report

mathause left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yquilcaille left a comment

Choose a reason for hiding this comment

leabeusch left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mathause left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mathause commented Jul 11, 2022

codecov-commenter commented Jun 22, 2022 •

edited

Loading