Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

U.S. Community Water Systems Service Boundaries, v3.0.0


An older version of this resource http://www.hydroshare.org/resource/b11b8982eebd4843833932f085f71d92 is available.
Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Type: Resource
Storage: The size of this resource is 193.9 MB
Created: Nov 01, 2022 at 12:53 a.m.
Last updated: Nov 01, 2022 at 1:14 a.m.
Citation: See how to cite this resource
Sharing Status: Public
Views: 25438
Downloads: 2520
+1 Votes: 3 others +1 this
Comments: No comments (yet)

Abstract

This is a layer of water service boundaries for 45,973 community water systems that deliver tap water to 307.7 million people in the US. This amounts to 97% of the population reportedly served by active community water systems and 93% of active community water systems. The layer is based on multiple data sources and a methodology developed by SimpleLab and collaborators called a Tiered, Explicit, Match, and Model approach–or TEMM, for short. The name of the approach reflects exactly how the nationwide data layer was developed. The TEMM is composed of three hierarchical tiers, arranged by data and model fidelity. First, we use explicit water service boundaries provided by states. These are spatial polygon data, typically provided at the state-level. We call systems with explicit boundaries Tier 1. In the absence of explicit water service boundary data, we use a matching algorithm to match water systems to the boundary of a town or city (Census Place TIGER polygons). When multiple water systems match to the same TIGER boundary, we employ a "best match" algorithm that assigns one water system to one TIGER place based on features like population served and other locational information about the water system. Finally, in the absence of an explicit water service boundary (Tier 1) or a TIGER place polygon match (Tier 2), a statistical model trained on explicit water service boundary data (Tier 1) is used to estimate a reasonable radius at provided water system centroids, and model a spherical water system boundary (Tier 3). Water system centroids are taken from the ECHO database; however, where a system centroid is labeled as a county or state centroid, we take several steps to assign a better centroid (using sources like UCMR or TIGER). A summary of the systems and population assigned to different tiers is as follows:

Population coverage rates per Tier, for systems with population reported:
- Tier 1: 49.3% population covered (155,869,771 people)
- Tier 2: 35.13% population covered (111,074,087 people)
- Tier 3: 12.9% population covered (40,771,645 people)

Active community water systems coverage rates per Tier:
- Tier 1: 35.7% system covered (17645 systems)
- Tier 2: 22.42% system covered (11079 systems)
- Tier 3: 34.9% system covered (17249 systems)
- No Tier/Geometry: 6.98% system covered (3451 systems)

Several limitations to this data exist–and the layer should be used with these in mind. The case of assigning a Census Place TIGER polygon to the "best match" water system first introduced in v2.0.0 requires further validation. Tier 3 boundaries have modeled radii stemming from a lat/long centroid of a water system facility; but the underlying lat/long centroids for water system facilities are of variable quality. It is critical to evaluate the "geometry quality" column (included from the EPA ECHO data source) when looking at Tier 3 boundaries; fidelity is very low when geometry quality is a county or state centroid– but we did not exclude the data from the layer. Since v 2.0.0 we have improved the percentage of Tier 3 geometries with state centroids and county centroids from 50% of Tier 3 boundaries to 30% of Tier 3 boundaries. Missing water systems are typically those without a centroid, in a U.S. territory, or missing population and connection data. Finally, Tier 1 systems are assumed to be high fidelity, but rely on the accuracy of state data collection and maintenance.

Changelog:
# 3.0.0 (2022-10-31)
* Adding manually-contributed systems from the Internet of Water's Github: https://github.com/cgs-earth/ref_pws/raw/main/02_output/contributed_pws.gpkg
* Refactored to use geopackage through most of pipeline instead of geojson
* Added `geometry_source_detail` column to, where possible, include notes provided by the data sources themselves about how the geometry was sourced

Subject Keywords

Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Place/Area Name:
United States
North Latitude
71.3402°
East Longitude
-66.9914°
South Latitude
19.0652°
West Longitude
-176.6967°

Content

readme.md

Credits

The water service layer boundary was created by SimpleLab, with funding and strategy from EPIC, and technical advising from Internet of Water.

The technical code and method available on Github was developed by SimpleLab, Inc. As this is an MIT License, the repository code and data herein can be reused and re-purposed.

SimpleLab website

Collaboration

Water Data Lab contributed technical code and methods development for the development of this boundary layer and the TEMM methodology found on the Github.

WaDL website

Environmental Policy Innovation Center (EPIC) financed the development of this boundary layer and supported data collection and methodology development of this boundary layer as part of their efforts with the Justice40 Initiative.

EPIC website

Internet of Water (IoW) provided technical advising and feedback on the approach and is collaborating with as part of the broader effort to expand use and improvement of water service boundaries.

IoW

For more information about this project, please contact Jess Goddard at <jess at gosimplelab dot com>.

Related Resources

This resource updates and replaces a previous version SimpleLab, EPIC (2022). U.S. Community Water Systems Service Boundaries, v2.4.0, HydroShare, http://www.hydroshare.org/resource/b11b8982eebd4843833932f085f71d92
The content of this resource was created by a related App or software program https://github.com/SimpleLab-Inc/wsb

Credits

Funding Agencies

This resource was created using funding from the following sources:
Agency Name Award Title Award Number
Environmental Policy Innovation Center Funding for Version 1.0 Feb-April, 2022

How to Cite

SimpleLab, EPIC (2022). U.S. Community Water Systems Service Boundaries, v3.0.0, HydroShare, http://www.hydroshare.org/resource/9ebc0a0b43b843b9835830ffffdd971e

The MIT License (MIT)
Copyright © 2022 SimpleLab

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

https://opensource.org/licenses/MIT

Comments

There are currently no comments

New Comment

required