Refactor implementation for (skipping) validation #2

danielhuppmann · 2021-05-29T06:19:34Z

Description of PR

While reviewing IAMconsortium#532, I noticed that there was still a duplication of effort - first computing the aggregate values, then appending, then (optionally, true by default) again computing the aggregate values for the consistency evaluation.

This PR implements a more efficient approach: first compute the aggregate values, split them into two groups (already existing vs. new data), (optionally) perform the validation on existing data, then append the new data.

To do this efficiently, I moved the compare feature into a separate submodule and made it compatible with receiving a pd.Series directly.

Also, I moved the test data for the recursive aggregation feature into conftest.py to remove duplicate code, and I added an explicit assertion that calling the recursive aggregation with inconsistent data raises an error.

danielhuppmann added 8 commits May 28, 2021 17:24

Switch order to ['left', 'right'] in returned object from compare()

364fe01

Move internal implementation of compare to own module

66cabcb

Save _data as pd.Series in swap_time_for_year()

c7d7168

Implement a once-through aggregate-and-validate method

d188e00

Move recursive-aggregation data to conftest.py

d43351d

Add validation that recursive aggregation fails if data is inconsistent

e3775bc

Fix the test of the compare function (changed order of cols)

b4f65f4

Fix calling the internal compare function

1d386fa

pjuergens merged commit f6464a8 into pjuergens:intermediate-aggregate Jun 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor implementation for (skipping) validation #2

Refactor implementation for (skipping) validation #2

danielhuppmann commented May 29, 2021

Refactor implementation for (skipping) validation #2

Refactor implementation for (skipping) validation #2

Conversation

danielhuppmann commented May 29, 2021

Description of PR