Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...
This resource contains some files/folders that have non-preferred characters in their name. Show non-conforming files/folders.
This resource contains content types with files that need to be updated to match with metadata changes. Show content type files that need updating.
Harnessing Backcasting to Identify Drivers of Critical Warming at Hoover Dam Using Hydrodynamic and Machine Learning Models
| Authors: |
|
|
|---|---|---|
| Owners: |
|
This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource. |
| Type: | Resource | |
| Storage: | The size of this resource is 216.7 MB | |
| Created: | May 12, 2026 at 4:07 p.m. (UTC) | |
| Last updated: | May 21, 2026 at 11:40 p.m. (UTC) | |
| Citation: | See how to cite this resource | |
| Content types: | File Set Content Single File Content CSV Content |
| Sharing Status: | Public |
|---|---|
| Views: | 190 |
| Downloads: | 9 |
| +1 Votes: | Be the first one to this. |
| Comments: | No comments (yet) |
Abstract
This HydroShare resource supports the journal article “Harnessing Backcasting to Identify Drivers of Critical Warming at Hoover Dam Using Hydrodynamic and Machine Learning Models.” The repository provides the complete computational and data framework used to conduct the analyses presented in the publication, with the goal of enabling transparency, reproducibility, and reuse by the hydrologic research community.
The resource includes curated input datasets, Python-based workflows, and supporting scripts used to develop surrogate machine learning models that emulate hydrodynamic simulation outputs. These models are leveraged within a backcasting framework to identify the physical and operational drivers associated with critical thermal conditions at Hoover Dam. The repository also contains code for applying explainable artificial intelligence techniques to interpret model behavior and quantify the relative influence of key drivers, as well as scripts for generating all figures and visualizations presented in the article.
All materials are organized to facilitate replication of the study results and to support adaptation of the methods for related hydrologic systems and climate-impact analyses. This resource is intended for researchers, practitioners, and students interested in coupled hydrodynamic–machine learning modeling, explainable AI, and scenario-based assessment of extreme environmental conditions.
Subject Keywords
Coverage
Spatial
Content
README.md
SSI Calculation, LSTM, and SHAP Analysis
Overview
This directory contains the data products, trained models, and Jupyter notebooks used for the surrogate modeling and explainable AI components of the study:
Harnessing Backcasting to Identify Drivers of Critical Warming at Hoover Dam Using Hydrodynamic and Machine Learning Models
The materials in this folder support workflows for:
- calculation of the Schmidt Stability Index (SSI)
- training and applying Long Short-Term Memory (LSTM) surrogate models,
- storing optimized trained models for lower- and upper-intake analyses,
- computing and storing Shapley Additive ExPlanation (SHAP) values for model interpretation, and
- generating SHAP-based visualizations used in the study.
These files are provided to support transparency, reproducibility, and reuse of the machine learning analysis associated with the publication.
Folder Contents
Notebooks
-
LSTM_lower.ipynb
Jupyter notebook for the lower-intake LSTM workflow, including SSI calculation, model development, evaluation, prediction analysis, and visualization. -
LSTM_upper.ipynb
Jupyter notebook for the upper-intake LSTM workflow. -
shapplots_lower_v3_CRtemp.ipynb
Notebook for SHAP-based interpretation and visualization of the lower-intake optimized model. -
shapplots_upper_v3_CRtemp.ipynb
Notebook for SHAP-based interpretation and visualization of the upper-intake optimized model.
Trained Model Files
-
optimal_model_lower.keras
Saved optimized Keras model for the lower-intake analysis. -
optimal_model_upper.keras
Saved optimized Keras model for the upper-intake analysis.
Precomputed SHAP Outputs
-
shap_values_lower_CRtemp_optim.npy
Precomputed SHAP values for the lower-intake optimized model. -
shap_values_upper_CRtemp_optim.npy
Precomputed SHAP values for the upper-intake optimized model.
Data Directories
-
mergedfilled_daily_CRinflowtemp/
Daily merged and gap-filled input datasets used in the lower-intake workflow across multiple elevation and scenario combinations. -
mergedfilled_upper_daily_CRinflowtemp/
Daily merged and gap-filled input datasets used in the upper-intake workflow. -
merged_with_schmidt/
Merged datasets that include Schmidt stability-related variables. -
wtrtempdepth/
Water temperature-by-depth datasets derived from hydrodynamic model outputs. -
wtrtempdepth_mergedfilled_daily/
Daily-averaged water temperature datasets derived from the depth-resolved water temperature files.
Data Organization
This folder includes multiple datasets corresponding to combinations of:
- analysis elevation:
900ft950ft1000ft1050ft-
1100ft -
scenario or time period labels such as:
201620252026203020262030_2091209720998075pct
Many filenames also indicate processing steps such as:
daily_avgfilledmergedwtrtempschmidt
Users should preserve the original filenames and folder structure where possible, as notebooks may reference these paths directly.
Typical Workflow
A typical use of this directory is:
- Access prepared input datasets
Use files in: mergedfilled_daily_CRinflowtemp/-
mergedfilled_upper_daily_CRinflowtemp/ -
Run or inspect the LSTM notebooks
LSTM_lower.ipynb-
LSTM_upper.ipynb -
Reuse optimized trained models
optimal_model_lower.keras-
optimal_model_upper.keras -
Interpret model behavior using SHAP
shap_values_lower_CRtemp_optim.npyshap_values_upper_CRtemp_optim.npyshapplots_lower_v3_CRtemp.ipynb-
shapplots_upper_v3_CRtemp.ipynb -
Reproduce figures and interpretation results
Run the SHAP notebooks to regenerate visual summaries of driver influence.
Notes on Reproducibility
- Saved model files and SHAP arrays are included as precomputed artifacts to reduce runtime and facilitate reproduction of published results.
- Notebooks may depend on specific Python package versions for TensorFlow/Keras, SHAP, NumPy, pandas, and plotting libraries.
- Relative file paths may be hard-coded in some notebooks, so preserving the directory structure is recommended.
- Large CSV and NumPy files are included because they serve as intermediate or analysis-ready products used in the study.
Recommended Citation
If you use these materials, please cite the associated journal article and the HydroShare resource accompanying the publication.
How to Cite
This resource is shared under the Creative Commons Attribution CC BY.
http://creativecommons.org/licenses/by/4.0/
Comments
There are currently no comments
New Comment