Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

Cleaned Water Quality and Weather Dataset for AI-based Alum Prediction (2011–2024)


Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Type: Resource
Storage: The size of this resource is 561.2 KB
Created: Oct 14, 2025 at 4:57 p.m. (UTC)
Last updated: Oct 14, 2025 at 7:49 p.m. (UTC)
Citation: See how to cite this resource
Content types: CSV Content 
Sharing Status: Public
Views: 56
Downloads: 0
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

This dataset was developed to support research on predicting alum dosage in small water treatment plants. It combines daily plant records with weather data, including maximum temperature (TMAX). To make the data reliable for analysis and modeling, outliers and incorrect readings were carefully removed using logical and domain-based rules.

Records with clearly impossible or error values, such as extremely high or negative numbers, were deleted. Each variable was kept within realistic operating limits—for example, alum between 0 and 3500 mg/L, hardness between 5 and 1000 mg/L, and alkalinity between 2 and 1000 mg/L. Unusual readings like pH = 0.54 were also removed. Missing value rows were entirely removed from the dataset.

Through this cleaning process, the dataset became consistent, accurate, and ready for machine-learning models that can better predict chemical dosing and support safer, more efficient water treatment operations.

Subject Keywords

Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Longitude
-97.0971°
Latitude
36.1213°

Temporal

Start Date:
End Date:

Content

Related Resources

The content of this resource was created by a related App or software program https://www.hydroshare.org/resource/6e58232cbf3346619ec37bbc51ba513d/
The content of this resource is derived from https://www.hydroshare.org/resource/e9e15a82b2ea4a7b98146a99e4b52614/
This resource belongs to the following collections:
Title Owners Sharing Status My Permission
Water & Weather Datasets for Alum Prediction (2011–2024) Saikumar Payyavula · Jeff Sadler  Public &  Shareable Open Access

How to Cite

payyavula, s., J. Sadler (2025). Cleaned Water Quality and Weather Dataset for AI-based Alum Prediction (2011–2024), HydroShare, http://www.hydroshare.org/resource/f18c4d34bc444e8eac1c12dee58b67aa

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required