Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...
This resource contains some files/folders that have non-preferred characters in their name. Show non-conforming files/folders.
This resource contains content types with files that need to be updated to match with metadata changes. Show content type files that need updating.
Authors: |
|
|
---|---|---|
Owners: |
|
This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource. |
Type: | Resource | |
Storage: | The size of this resource is 193.9 MB | |
Created: | Nov 01, 2022 at 12:53 a.m. | |
Last updated: | Nov 01, 2022 at 1:14 a.m. | |
Citation: | See how to cite this resource |
Sharing Status: | Public |
---|---|
Views: | 27541 |
Downloads: | 2911 |
+1 Votes: | 3 others +1 this |
Comments: | No comments (yet) |
Abstract
This is a layer of water service boundaries for 45,973 community water systems that deliver tap water to 307.7 million people in the US. This amounts to 97% of the population reportedly served by active community water systems and 93% of active community water systems. The layer is based on multiple data sources and a methodology developed by SimpleLab and collaborators called a Tiered, Explicit, Match, and Model approach–or TEMM, for short. The name of the approach reflects exactly how the nationwide data layer was developed. The TEMM is composed of three hierarchical tiers, arranged by data and model fidelity. First, we use explicit water service boundaries provided by states. These are spatial polygon data, typically provided at the state-level. We call systems with explicit boundaries Tier 1. In the absence of explicit water service boundary data, we use a matching algorithm to match water systems to the boundary of a town or city (Census Place TIGER polygons). When multiple water systems match to the same TIGER boundary, we employ a "best match" algorithm that assigns one water system to one TIGER place based on features like population served and other locational information about the water system. Finally, in the absence of an explicit water service boundary (Tier 1) or a TIGER place polygon match (Tier 2), a statistical model trained on explicit water service boundary data (Tier 1) is used to estimate a reasonable radius at provided water system centroids, and model a spherical water system boundary (Tier 3). Water system centroids are taken from the ECHO database; however, where a system centroid is labeled as a county or state centroid, we take several steps to assign a better centroid (using sources like UCMR or TIGER). A summary of the systems and population assigned to different tiers is as follows:
Population coverage rates per Tier, for systems with population reported:
- Tier 1: 49.3% population covered (155,869,771 people)
- Tier 2: 35.13% population covered (111,074,087 people)
- Tier 3: 12.9% population covered (40,771,645 people)
Active community water systems coverage rates per Tier:
- Tier 1: 35.7% system covered (17645 systems)
- Tier 2: 22.42% system covered (11079 systems)
- Tier 3: 34.9% system covered (17249 systems)
- No Tier/Geometry: 6.98% system covered (3451 systems)
Several limitations to this data exist–and the layer should be used with these in mind. The case of assigning a Census Place TIGER polygon to the "best match" water system first introduced in v2.0.0 requires further validation. Tier 3 boundaries have modeled radii stemming from a lat/long centroid of a water system facility; but the underlying lat/long centroids for water system facilities are of variable quality. It is critical to evaluate the "geometry quality" column (included from the EPA ECHO data source) when looking at Tier 3 boundaries; fidelity is very low when geometry quality is a county or state centroid– but we did not exclude the data from the layer. Since v 2.0.0 we have improved the percentage of Tier 3 geometries with state centroids and county centroids from 50% of Tier 3 boundaries to 30% of Tier 3 boundaries. Missing water systems are typically those without a centroid, in a U.S. territory, or missing population and connection data. Finally, Tier 1 systems are assumed to be high fidelity, but rely on the accuracy of state data collection and maintenance.
Changelog:
# 3.0.0 (2022-10-31)
* Adding manually-contributed systems from the Internet of Water's Github: https://github.com/cgs-earth/ref_pws/raw/main/02_output/contributed_pws.gpkg
* Refactored to use geopackage through most of pipeline instead of geojson
* Added `geometry_source_detail` column to, where possible, include notes provided by the data sources themselves about how the geometry was sourced
Subject Keywords
Coverage
Spatial
Content
readme.md
Credits
The water service layer boundary was created by SimpleLab, with funding and strategy from EPIC, and technical advising from Internet of Water.
The technical code and method available on Github was developed by SimpleLab, Inc. As this is an MIT License, the repository code and data herein can be reused and re-purposed.
Collaboration
Water Data Lab contributed technical code and methods development for the development of this boundary layer and the TEMM methodology found on the Github.
Environmental Policy Innovation Center (EPIC) financed the development of this boundary layer and supported data collection and methodology development of this boundary layer as part of their efforts with the Justice40 Initiative.
Internet of Water (IoW) provided technical advising and feedback on the approach and is collaborating with as part of the broader effort to expand use and improvement of water service boundaries.
For more information about this project, please contact Jess Goddard at <jess at gosimplelab dot com>.
Related Resources
This resource updates and replaces a previous version | SimpleLab, EPIC (2022). U.S. Community Water Systems Service Boundaries, v2.4.0, HydroShare, http://www.hydroshare.org/resource/b11b8982eebd4843833932f085f71d92 |
The content of this resource was created by a related App or software program | https://github.com/SimpleLab-Inc/wsb |
Credits
Funding Agencies
This resource was created using funding from the following sources:
Agency Name | Award Title | Award Number |
---|---|---|
Environmental Policy Innovation Center | Funding for Version 1.0 Feb-April, 2022 |
How to Cite
The MIT License (MIT)
Copyright © 2022 SimpleLab
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
Comments
There are currently no comments
New Comment