Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

Compiled data for modeling episodic salinization of Midwestern and Northeastern US rivers by road salt


Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Type: Resource
Storage: The size of this resource is 34.0 MB
Created: Oct 18, 2024 at 9:31 p.m. (UTC)
Last updated: Oct 15, 2025 at 8:32 p.m. (UTC)
Published date: Oct 15, 2025 at 8:32 p.m. (UTC)
DOI: 10.4211/hs.d3efc527a6b64b9890cfe5154954ce09
Citation: See how to cite this resource
Content types: CSV Content 
Sharing Status: Published
Views: 116
Downloads: 0
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

This resource contains data used to understand characteristics of rivers that experience episodic salinization from winter road salting events. These data products represent snapshots of public data compiled to support the AGU Water Resources Research publication, Platt and Dugan 2025. The code used to generate the data in this resource is available via a reproducible R pipeline published at https://doi.org/10.5281/zenodo.17095176.

Subject Keywords

Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
North Latitude
47.0927°
East Longitude
-96.8171°
South Latitude
35.5951°
West Longitude
-67.2967°

Temporal

Start Date:
End Date:

Content

README.md

The companion manuscript is published in AGU Water Resources Research: Episodic salinization of Midwestern and Northeastern US rivers by road salt (Platt and Dugan, 2025). The companion code is captured in a data pipeline published on Zenodo at https://doi.org/10.5281/zenodo.17095176.

Data and file descriptions

There are 3 data files included in this data publication. The first two represent snapshots of publicly available data and the third represents the specific conductance regime assigned to each site based on a time series analysis (see the companion manuscript Platt and Dugan 2025 for more information). Ultimately, the static attributes and site classification's were used as model inputs to a classification random forest to determine which stream attributes may be associated with which specific conductance regime.

  1. 1_specific_conductance.csv contains a time series of continuous specific conductance measurements across 437 sites from the US Geological Survey National Water Information System (USGS NWIS). The dates range from January 2, 2000 to May 1, 2025.
  2. 2_static_attributes.csv contains 17 different stream attributes matched to each of the 437 USGS sites gathered from a different sources, including the US Environmental Protection Agency's National Hydrography Dataset Plus (US EPA NHDPlus), National Oceanic and Atmospheric Administration National Water Model (NOAA NWM), as well as Zell and Sanford, 2020 (DOI: 10.1029/2019WR026724) and Falcone et al., 2023 (DOI: 10.5066/P96IX385). More information about each of these sources and exactly what was accessed is available in the companion paper, Platt and Dugan 2025. Also, see the table further below for definitions and units for each of the attributes.
  3. 3_site_classification.csv contains the classification model output stating whether a site's specific conductance patterns were "Episodic" or "Not episodic".

How to use the data here

The data in this resource represents a snapshot of data downloaded and processed from public resources (USGS National Water Information System and ScienceBase) in May 2025. If you want to be able to recreate the results from the companion manuscript, these data should be used as inputs to the reproducible data pipeline built in R, available at https://github.com/lindsayplatt/episodic-river-salinization-model. Details on how to do this are provided below. If you want to just build the data pipeline using the most recent data available from the raw inputs, see more in the GitHub repository README file.

By using the files in this data release, you can skip phases 1-3 in the companion pipeline (1_Download, 2_Prepare, 3_Filter).

  1. By using data from 1_specific_conductance.csv instead of p3_ts_sc_qualified in 4_EpisodicSalinization.R, you can rerun code used to classify a site as "Episodic" or "Not episodic". You can compare these outcomes to the data in 3_site_classification.csv.
  2. By using data from 2_static_attributes.csv instead of p3_static_attributes in 5_DefineCharacteristics.R, you can rerun code used to evaluate which characteristics are more associated with episodic vs not episodic through the random forest modeling code.

Information on units

In 1_specific_conductance.csv, specific conductance are in microSiemens per centimeter at 25 degrees Celcius (uS/cm at 25 degC).

Static attributes in 2_static_attributes.csv are described in this table (if the source is not explicitly stated, it is from the US EPA NHDPlus:

Attribute (Units) colname Description
Median Flow (m³/s) attr_medianFlow Median streamflow derived from hourly streamflow time series. Source: NOAA NWM
Streamorder attr_streamorder Strahler stream order as defined in NHDPlus
Watershed Area (km²) attr_areaCumulativeSqKm Total upstream area as calculated by summing catchment areas of all upstream COMIDs
Basin Slope (%) attr_basinSlope Average topographic slope to indicate basin steepness; uses NHDPlus attribute CAT BASIN SLOPE
Agriculture (% area) attr_pctAgriculture Percent of basin area that is agriculture; sum of NHDPlus attribute categories CAT NLCD19 81 (pasture/hay) and CAT NLCD19 82 (cropland)
Developed (% area) attr_pctDeveloped Percent of basin area that is developed; sum of NHDPlus attribute categories CAT NLCD19 22–25
Forested (% area) attr_pctForested Percent of basin area that is forested; sum of NHDPlus attribute categories CAT NLCD19 41–43
Open Water (% area) attr_pctOpenWater Percent of basin area that is open water (lakes, rivers, etc); uses NHDPlus category CAT NLCD19 11
Wetland (% area) attr_pctWetland Percent of basin area that is wetland; sum of NHDPlus attribute categories CAT NLCD19 90 and 95
Precipitation (mm/year) attr_annualPrecip Total annual precipitation; uses NHDPlus attribute CAT PPT7100 ANN
Snowfall (mm/year) attr_annualSnow Total annual snowfall; calculated from annual precipitation × CAT PRSNOW
Winter Air Temperature (°C) attr_winterAirTemp Average winter air temperature from Dec–Mar; uses CAT TAV7100 attributes
Baseflow Index (% total flow) attr_baseFlowInd Average percent of total streamflow that is baseflow; uses CAT BFI
GW Recharge (mm/year) attr_gwRecharge Annual groundwater recharge; uses CAT RECHG
Subsurface Contact (days) attr_subsurfaceContact Days water resides in saturated subsurface before entering a stream; uses CAT CONTACT
Depth to Water Table (m) attr_depthToWT Modeled depth to saturation zone. Source: Zell and Sanford, 2020
Transmissivity (m²/day) attr_transmissivity Modeled water transmissivity through shallow subsurface. Source: Zell and Sanford, 2020
Road Salt (kg/km²) attr_roadSaltCumulativePerSqKm Estimated weight of road deicing salt applied by area. Source: Falcone et al., 2023

Credits

Funding Agencies

This resource was created using funding from the following sources:
Agency Name Award Title Award Number
NSF 2144750

How to Cite

Platt, L., H. Dugan (2025). Compiled data for modeling episodic salinization of Midwestern and Northeastern US rivers by road salt, HydroShare, https://doi.org/10.4211/hs.d3efc527a6b64b9890cfe5154954ce09

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required