Post-processing for H models #287

damonbayer · 2025-01-08T19:20:55Z

Unfortunately, this has become a bit of monster PR, but it will result in a lot more consistency, clarity, and adaptability in the project.

Done

Out of Scope

Dashboard expects E models only.
- Make dashboards work with arbitrary PyRenew-HEW models #311
Notions of "daily" vs "epiweekly" data are not necessarily correct throughout. Generally, "daily" means "unaggregated" and "epiweekly" means "aggregated."
- Therefore, the "epiweekly" hubverse table is more of an "aggregated" hubverse table, and there is not yet an "unaggregated" hubverse table.
- Revise workflow to omit incorrect "daily" and "epiweekly" language. #330
Some aspects of the batch post-processing should be re-worked as needed to accommodate some concept of a "preferred" forecast, which may be a mixture of different models for different locations. For now, we generate many disease_category_pointintervals plots, one for each model. We may wish to instead make a single plot for the "preferred" forecast.

Closes

Closes #308
Closes #296

codecov · 2025-01-08T19:22:13Z

Codecov Report

Attention: Patch coverage is 25.49505% with 301 lines in your changes missing coverage. Please review.

Project coverage is 24.47%. Comparing base (3a0a5bf) to head (792d3b1).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
hewr/R/process_state_forecast.R	5.00%	209 Missing ⚠️
pipelines/collate_plots.py	0.00%	34 Missing ⚠️
hewr/R/make_forecast_figure.R	0.00%	25 Missing ⚠️
hewr/R/directory_utils.R	15.38%	22 Missing ⚠️
pipelines/postprocess_forecast_batches.py	0.00%	7 Missing ⚠️
pipelines/prep_data.py	0.00%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #287      +/-   ##
==========================================
- Coverage   25.56%   24.47%   -1.10%     
==========================================
  Files          22       22              
  Lines        1682     1704      +22     
==========================================
- Hits          430      417      -13     
- Misses       1252     1287      +35

Flag	Coverage Δ
hewr	`41.68% <28.69%> (-7.94%)`	⬇️
pipelines	`4.67% <0.00%> (+0.32%)`	⬆️
pyrenew_hew	`27.96% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

commit 724d09b Author: Dylan H. Morris <dylanhmorris@users.noreply.github.com> Date: Sat Jan 18 17:31:57 2025 +0000 Add helper data `PyrenewHEWData` class to hold input data to `PyrenewHewModel`s (#283) commit bc76d59 Author: Subekshya Bidari <37636707+sbidari@users.noreply.github.com> Date: Fri Jan 17 14:18:11 2025 -0500 add subpopulation to LatentInfectionProcess (#282) commit a70e296 Author: Subekshya Bidari <37636707+sbidari@users.noreply.github.com> Date: Wed Jan 15 15:29:44 2025 -0500 add setup-r (#295)

dylanhmorris

Thanks for this, @damonbayer. All my comments are addressed. Let me know when remaining test issues are fixed.

dylanhmorris · 2025-02-07T19:13:37Z

Going to try an end-to-end run now.

dylanhmorris · 2025-02-07T20:23:50Z

Fitting and postprocessing run end to end! Thank you, @damonbayer.

Two things before merge:

Afaict everything looks as expected, except that the titles on the prop_disease_ed_visits plots are wrong. They say "Other Emergency Department Visits". Presumably a bug in the figure labeling logic somewhere. I think this is worth fixing before merge, as it should be a quick fix.
The mega hubverse table is big enough that I don't think it's going to be readily human-readable, and any submitted hubverse tables will be downsampled from it. Given that, worth saving it as parquet rather than .tsv?

…enew-hew into dmb_multi_postprocess

damonbayer · 2025-02-07T23:31:40Z

@dylanhmorris I've addressed both of your most recent comments.

pipelines/hubverse_score.R

dylanhmorris

LGTM. Thanks for all your hard work, @damonbayer!

damonbayer added 6 commits January 6, 2025 16:01

correct bug in eval data

d09eb90

epiweekly combined

3faae9f

fix object name

164788f

Merge branch 'main' into dmb_multi_postprocess

16831db

checkin

3f87477

checkin

8b74781

damonbayer and others added 23 commits January 8, 2025 20:01

add progress notes

85d0529

location code

fea6f24

make figures

17f84ee

Merge branch 'main' into dmb_multi_postprocess

e7d4d00

update fake data workflow

04323e6

fix bug in file name

0a86505

revise all model combos test

da9c968

update process_state_forecast

d831c99

update some functions

7e5ff70

add more processing to testing

2d881f6

start making forecast figure

aeef87f

mild cleanup

4ba21cf

cleaning up before pivoting

49bd883

start better columns

6f3afdc

remove old data write and simplify epiweekly comp

85b47c6

rework baseline forecasting

68a9380

save correct timeseries forecasts

af2a8df

data type not needed in ts forecasting

17e91be

process_state_forecast working

5e7a77c

fix bug in process_state_forecast

e6dcb00

updating hewr

aa8032c

hewr namespace

0df7fa2

damonbayer added 4 commits February 4, 2025 18:08

rename samples_forecast_data

1fe9c60

edit tests

81e0109

rework to_epiweekly_quantile_table

078b779

tidyeval cleanup

dd49446

damonbayer mentioned this pull request Feb 5, 2025

Separate plotting from process_state_forecast #325

Open

damonbayer added 4 commits February 5, 2025 14:31

last_training_date -> first_forecast_date

b3d348e

comment on create_more_model_test_data

955d136

separate functionality in process_state_forecast

dec6999

multiple variables in hubverse table

7dc90f4

dylanhmorris reviewed Feb 6, 2025

View reviewed changes

damonbayer added 2 commits February 6, 2025 15:13

work around for pyrenew-h only

d783a82

Merge branch 'main' into dmb_multi_postprocess

8eec226

damonbayer marked this pull request as ready for review February 7, 2025 15:10

damonbayer requested a review from sbidari as a code owner February 7, 2025 15:10

Merge branch 'main' into dmb_multi_postprocess

589085e

damonbayer changed the title ~~(DRAFT) Post-processing for H models~~ Post-processing for H models Feb 7, 2025

damonbayer and others added 5 commits February 7, 2025 16:16

fix plot titles

761ce69

Merge branch 'dmb_multi_postprocess' of https://github.com/CDCgov/pyr…

b564498

…enew-hew into dmb_multi_postprocess

Merge branch 'main' into dmb_multi_postprocess

207a717

parquet hubverse

06c5587

Merge branch 'dmb_multi_postprocess' of https://github.com/CDCgov/pyr…

e901012

…enew-hew into dmb_multi_postprocess

dylanhmorris reviewed Feb 7, 2025

View reviewed changes

pipelines/hubverse_score.R Outdated Show resolved Hide resolved

Update pipelines/hubverse_score.R

792d3b1

dylanhmorris approved these changes Feb 7, 2025

View reviewed changes

dylanhmorris enabled auto-merge (squash) February 7, 2025 23:52

dylanhmorris merged commit 216a81b into main Feb 7, 2025
14 of 15 checks passed

dylanhmorris deleted the dmb_multi_postprocess branch February 7, 2025 23:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Post-processing for H models #287

Post-processing for H models #287

damonbayer commented Jan 8, 2025 •

edited

Loading

codecov bot commented Jan 8, 2025 •

edited

Loading

dylanhmorris left a comment

dylanhmorris commented Feb 7, 2025

dylanhmorris commented Feb 7, 2025

damonbayer commented Feb 7, 2025

dylanhmorris left a comment

Post-processing for H models #287

Post-processing for H models #287

Conversation

damonbayer commented Jan 8, 2025 • edited Loading

Done

Out of Scope

Closes

codecov bot commented Jan 8, 2025 • edited Loading

Codecov Report

dylanhmorris left a comment

Choose a reason for hiding this comment

dylanhmorris commented Feb 7, 2025

dylanhmorris commented Feb 7, 2025

damonbayer commented Feb 7, 2025

dylanhmorris left a comment

Choose a reason for hiding this comment

damonbayer commented Jan 8, 2025 •

edited

Loading

codecov bot commented Jan 8, 2025 •

edited

Loading