Implement Replace and Route as Step Function with Lambdas #508

shawncrawley · 2023-08-16T22:26:45Z

This contains all of the changes necessary to run replace and route as a step function with a custom lambda.

New AWS Resources

hv-vpp-${var.environment}-execute-replace-route Step Function
**replace_route_${var.environment}** Lambda Function

New Database Items

wrds_rfcfcst schema (populated by wrds_rfcfcst foreign data server pointing to rfcfcst schema of ingest db)
rnr schema
rnr.nwm_crosswalk table (manually created by code found in a file committed herein (Core\LAMBDA\replace_route\lambda_content\sql\dba_stuff.sql)
rnr.nwm_routelink table (manually created by code found in both in Jupyter notebook [manual ingest of RouteLink.nc file] and a file committed herein (Core\LAMBDA\replace_route\lambda_content\sql\dba_stuff.sql) [for db table optimization]
rnr.nwm_lakeparm table (manually created by code found in both in Jupyter notebook and a file committed herein (Core\LAMBDA\replace_ro
rnr.staggered curves table (manually created by code found in a file committed herein (Core\LAMBDA\replace_route\lambda_content\sql\dba_stuff.sql)

The following schemas (followed by the dumped tables in parenthesis) have been dumped and uploaded to s3://hydrovis-ti-deployment-us-east-1/viz_db_dumps/:

wrds_rfcfcst (schema only, since the foreign db connection will be setup against this schema on deploy)
rnr (nwm_routelink, nwm_lakeparm, nwm_crosswalk)

The rnr.nwm_routelink and rnr.nwm_lakeparm tables should be manually updated anytime we update the underlying version of WRF-Hydro. The rnr.nwm_crosswalk and rnr.staggered_curves tables should technically be updated every time the wrds_location database is updated. We'll need to make a TODO item for that.

A new target_cols property can now be specified on an entry to the "ingest_files" section of the product_configs. This has been documented in the template.yml. The viz_initialize_pipeline lambda and viz_db_ingest lambdas were both updated to properly use this parameter. This was essential because replace and route requires that additional columns be present on the nwm_channel_rt_ana table. The common default columns that were previously hard-coded in the viz_db_ingest lambda are now used as defaults in viz_initialize_pipeline if the target_cols property is left blank.

At a high level, here is the workflow for the hv-vpp-${var.environment}-execute-replace-route Step Function:

All steps beginning with "Create Domain" are calling the replace_route_${var.environment} Lambda Function. When that function is called with "step": "domain", then SQL is executed to created the following dynamic, domain-specific tables in the rnr schema: temporal_domain_flow_forecasts, domain_forecasts, domain_routelink, and domain_lakeparm. Within the Map Function, the various dynamic WRF-Hydro input files are created by performing the appropriate baseline SQL query (largely referencing one of the dynamic domain-specific SQL tables mentioned above), and then converting the results of the query to a Pandas DataFrame, then to an xarray object, and then to a netcdf file. These files are then uploaded to an S3 bucket. Then WRF-Hydro is kicked off from the step function. This function was modified to pull the domain-specific files from S3 and then kick off WRF-Hydro in an otherwise normal fashion. Once complete, a signal is sent to back to the step function to proceed, at which point the Initialize Pipeline function is called, which will kick off the "replace_route" configuration for viz processing.

Refs #339

shawncrawley · 2023-08-16T22:30:45Z

Before I forget - I didn't have time to add the EventBridge trigger to Terrraform that kicks off the hv-vpp-${var.environment}-execute-replace-route Step Function every 15 minutes. If someone could do that for me, that'd be great. Otherwise, I'll get to it once I'm back on Monday.

updated eventbridge TF to separate it out

TylerSchrag-NOAA

Wow - This is impressive Shawn. Thank you for doing what I've only been able to speculate on for the last couple of years. This will be hugely helpful at diagnosing issues with these products.

One testing suggestion, if you haven't done this already: There are enough variables in play here, that it would be good to do an apples-to-apples comparison of the output of your new workflow vs. the old... either before merging if you have it setup to do that, or after merging into TI, comparing to UAT.

Way to go!
Tyler

Updated replace_route lambda to be more specific with rnr_domain_generator

CoreyKrewson-NOAA · 2023-08-17T16:23:49Z

I added the eventbridge for a 15 minute kickoff and I also updated some of the folder/lambda names to align with the other functions as well.

Tyler, Shawn has been doing that as he went and has confirm that everything is working the same (and even better in some locations)

shawncrawley added 4 commits August 16, 2023 15:08

fully implements rnr as step function and lambdas

f99d8d5

Refs #339

updates db schema name: wrds_rfcfcst

8e08421

Adds changes to owp-viz-replace-route code

b5c03b1

Add new target_cols documentation to template.yml

2f78481

shawncrawley and others added 2 commits August 16, 2023 21:48

Adds cmds in rds bastion RnR install

0d17628

added eventbridge to step function

6865ea8

updated eventbridge TF to separate it out

TylerSchrag-NOAA reviewed Aug 17, 2023

View reviewed changes

CoreyKrewson-NOAA added 3 commits August 17, 2023 11:05

Updated naming convention and folder structure to match other lambdas

9d4737e

Updated more folders for aligning naming convention

26d5ab9

Updated replace_route lambda to be more specific with rnr_domain_generator

added missing region variable

ab0906f

CoreyKrewson-NOAA approved these changes Aug 17, 2023

View reviewed changes

CoreyKrewson-NOAA merged commit 2f3cc00 into ti Aug 17, 2023

CoreyKrewson-NOAA added this to the V2.1.2 milestone Aug 23, 2023

shawncrawley deleted the rnr-overhaul branch August 24, 2023 13:43

shawncrawley restored the rnr-overhaul branch August 24, 2023 13:43

shawncrawley deleted the rnr-overhaul branch August 24, 2023 13:50

This was linked to issues Aug 30, 2023

Retrofit RnR Model for AWS Development #339

Closed

Configure RnR to run on the same schedule as ahps_max_stage (i.e. every 15 mins) #475

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Replace and Route as Step Function with Lambdas #508

Implement Replace and Route as Step Function with Lambdas #508

shawncrawley commented Aug 16, 2023

shawncrawley commented Aug 16, 2023

TylerSchrag-NOAA left a comment

CoreyKrewson-NOAA commented Aug 17, 2023

Implement Replace and Route as Step Function with Lambdas #508

Implement Replace and Route as Step Function with Lambdas #508

Conversation

shawncrawley commented Aug 16, 2023

New AWS Resources

New Database Items

shawncrawley commented Aug 16, 2023

TylerSchrag-NOAA left a comment

Choose a reason for hiding this comment

CoreyKrewson-NOAA commented Aug 17, 2023