A new diffusive_utils file without features designed only for refactored hydrofabric #710

kumdonoaa · 2023-12-07T18:26:41Z

The current diffusive_utils.py includes features utilizing refactored hydrofabric that was originally developed for removing short stream segments of NHDv2.0 hydrofabric of NWMv3.0. The removal was mainly for reducing routing compute time and possible numerical instability. The development of refactored hydrofabric wasn't completed and was stopped. Now the nextgen HYfeature hydrofabric replaces the refactored hydrofabric, so features related to the refactored hydrofabric aren't needed any more. So the cleaned diffusive_utils_v02.py is created to be used for both NHDv2.0 hydrofabric and HYfeature hydrofabric. More features to come in this file for HYfeature pretty soon.

Additions

Removals

Changes

Testing

Screenshots

Notes

Todos

Checklist

Testing checklist

Target Environment support

Windows
Linux
Browser

Accessibility

Keyboard friendly
Screen reader friendly

Other

…drofabric found in diffusive_utils.py

shorvath-noaa

This ran fine for me. While you're updating this file, do you think we should fix this warning message?:

t-route/src/troute-routing/troute/routing/diffusive_utils_v02.py:539: PerformanceWarning: DataFrame is highly fragmented. This is usually the result of calling "frame.insert" many times, which has poor performance. Consider joining all columns at once using pd.concat(axis=1) instead. To get a de-fragmented frame, use "newframe = frame.copy()" usgs_df_complete.insert(i, timestamps[i], -4444.0*np.ones(len(usgs_df)), allow_duplicates=False)

kumdonoaa · 2023-12-07T19:52:09Z

Yes. Sean. please take a try if you have time after this PR is merged, I think.

AminTorabi-NOAA · 2023-12-07T19:54:39Z

I have a fix for it: In line 536 instead of for loop replace it with this

missing_timestamps = [ts for ts in timestamps if ts not in usgs_df.columns]
    
        missing_data = pd.DataFrame(-4444.0*np.ones((len(usgs_df), len(missing_timestamps))), 
                                    columns=missing_timestamps, 
                                    index=usgs_df.index)
usgs_df_complete = pd.concat([usgs_df_complete, missing_data], axis=1)`

shorvath-noaa · 2023-12-07T20:08:50Z

@kumdonoaa , @AminTorabi-NOAA 's fix looks good. Might want to add another line after to make sure columns are in the correct order. Something like:
usgs_df_complete = usgs_df_complete[timestamps]

AminTorabi-NOAA · 2023-12-07T20:09:15Z

src/troute-routing/troute/routing/diffusive_utils_v02.py

+
+        usgs_df_complete = usgs_df.replace(np.nan, -4444.0)
+
+        for i in range(len(timestamps)):


Instead of for loop because we keep adding column to dataframe it slow it down and give warning. This should solve the issue

missing_timestamps = [ts for ts in timestamps if ts not in usgs_df.columns] missing_data = pd.DataFrame(-4444.0*np.ones((len(usgs_df), len(missing_timestamps))), columns=missing_timestamps, index=usgs_df.index) usgs_df_complete = pd.concat([usgs_df_complete, missing_data], axis=1)

@AminTorabi-NOAA @shorvath-noaa Do these lines replace existing lines 537~539?:
https://github.com/kumdonoaa/t-route/blob/4c56cc615b49f6cc75ede3c6b8879689940bee9a/src/troute-routing/troute/routing/diffusive_utils_v02.py#L537-L539

Yes. As Sean suggested you can add usgs_df_complete = usgs_df_complete[timestamps] at the end of those lines too

…eature only with syn x-sec for now

JurgenZach-NOAA

I tested it, and it works.

a new diffusive input preprocessor without features for refactored hy…

4c56cc6

…drofabric found in diffusive_utils.py

kumdonoaa requested review from shorvath-noaa, AminTorabi-NOAA and JurgenZach-NOAA December 7, 2023 18:27

shorvath-noaa reviewed Dec 7, 2023

View reviewed changes

AminTorabi-NOAA reviewed Dec 7, 2023

View reviewed changes

temporary fix to qlats to run both NHD with syn/natural x-sec and HYf…

ff6f3f5

…eature only with syn x-sec for now

AminTorabi-NOAA approved these changes Dec 8, 2023

View reviewed changes

JurgenZach-NOAA approved these changes Dec 8, 2023

View reviewed changes

kumdonoaa merged commit 07d511b into NOAA-OWP:master Dec 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A new diffusive_utils file without features designed only for refactored hydrofabric #710

A new diffusive_utils file without features designed only for refactored hydrofabric #710

kumdonoaa commented Dec 7, 2023

shorvath-noaa left a comment

kumdonoaa commented Dec 7, 2023 •

edited

Loading

AminTorabi-NOAA commented Dec 7, 2023 •

edited

Loading

shorvath-noaa commented Dec 7, 2023

AminTorabi-NOAA Dec 7, 2023

kumdonoaa Dec 8, 2023 •

edited

Loading

AminTorabi-NOAA Dec 8, 2023

JurgenZach-NOAA left a comment


		usgs_df_complete = usgs_df.replace(np.nan, -4444.0)

		for i in range(len(timestamps)):

A new diffusive_utils file without features designed only for refactored hydrofabric #710

A new diffusive_utils file without features designed only for refactored hydrofabric #710

Conversation

kumdonoaa commented Dec 7, 2023

Additions

Removals

Changes

Testing

Screenshots

Notes

Todos

Checklist

Testing checklist

Target Environment support

Accessibility

Other

shorvath-noaa left a comment

Choose a reason for hiding this comment

kumdonoaa commented Dec 7, 2023 • edited Loading

AminTorabi-NOAA commented Dec 7, 2023 • edited Loading

shorvath-noaa commented Dec 7, 2023

AminTorabi-NOAA Dec 7, 2023

Choose a reason for hiding this comment

kumdonoaa Dec 8, 2023 • edited Loading

Choose a reason for hiding this comment

AminTorabi-NOAA Dec 8, 2023

Choose a reason for hiding this comment

JurgenZach-NOAA left a comment

Choose a reason for hiding this comment

kumdonoaa commented Dec 7, 2023 •

edited

Loading

AminTorabi-NOAA commented Dec 7, 2023 •

edited

Loading

kumdonoaa Dec 8, 2023 •

edited

Loading