Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

DATASETS FOR MACHINE LEARNING BASED CONTINENTAL SCALE MODELING FOR PREDICTING RUNOFF-COEFFICIENTS


Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Type: Resource
Storage: The size of this resource is 141.4 MB
Created: Mar 10, 2026 at 5:48 p.m. (UTC)
Last updated: Mar 10, 2026 at 6:11 p.m. (UTC)
Citation: See how to cite this resource
Content types: CSV Content 
Sharing Status: Public
Views: 32
Downloads: 0
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

Accurate prediction of the event-based runoff coefficient (ERC) is fundamental to understanding watershed response, yet achieving reliable predictions across diverse climatic and geophysical regions remains a significant challenge. This study presents a robust, data-driven framework for predicting ERC at the continental scale across the Contiguous United States (CONUS). Utilizing a large-sample hydrology approach, we harmonized a comprehensive dataset of 479,872 rainfall-runoff events from USGS-gauged watersheds. The model integrates sub-daily atmospheric forcing from the NLDAS-2 dataset with static watershed attributes, including topography, land cover, and soil characteristics. To address spatial heterogeneity, we implemented a regime-based modeling approach that utilizes K-means clustering to categorize watersheds into distinct hydrological regimes before training. Predictive performance was evaluated using advanced machine learning algorithms, specifically eXtreme Gradient Boosting (XGBoost) and Support Vector Regression (SVR). Our results demonstrate that the regime-based ML framework significantly outperforms traditional global modeling approaches, particularly in regions with high seasonal variability. Sensitivity analysis reveals that antecedent moisture conditions and land-use patterns are the primary drivers of ERC variability at the continental scale. To support community research and reproducibility, the processed dataset and model scripts are made publicly available via HydroShare. This work provides a scalable tool for predicting watershed response in ungauged basins and offers new insights into the process controls governing runoff generation across the CONUS.

Subject Keywords

Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Place/Area Name:
CONUS (Contiguous United States)
North Latitude
49.4000°
East Longitude
-124.8000°
South Latitude
23.8000°
West Longitude
-66.9000°

Temporal

Start Date:
End Date:

Content

How to Cite

Xu, T. (2026). DATASETS FOR MACHINE LEARNING BASED CONTINENTAL SCALE MODELING FOR PREDICTING RUNOFF-COEFFICIENTS, HydroShare, http://www.hydroshare.org/resource/77f5d52f961d4ec092861f122b396a29

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required