High Resolution Residential Water Use Data in Cache County, Utah, USA


Authors:
Owners:
Resource type: Composite Resource
Storage: The size of this resource is 1.1 GB
Created: Oct 27, 2020 at 9:04 p.m.
Last updated: Nov 01, 2021 at 4:55 p.m.
Citation: See how to cite this resource
Sharing Status: Public
Views: 226
Downloads: 716
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

This resource contains high resolution residential water use data and classified end uses of water for 31 residential homes located in Logan City and Providence City in Cache County, Utah, USA. Data were collected using a low-cost, open source monitoring device that was designed to operate on magnetically driven residential water meters. Data were recorded with a temporal frequency of 4 seconds and were collected for a period of at least two weeks during the summer when outdoor water use was active and two weeks during the winter when no outdoor water use was expected. The events disaggregation and classification was conducted using the tools available on http://www.hydroshare.org/resource/3143b3b1bdff48e0aaebcb4aedf02feb. The data were measured on the meter located on the water supply line to each home and represent a trace of the total water use for each residence. The dataset also includes secondary data about each of the residences at which data were collected. These data have been anonymized to remove any personally identifiable information from participants in this data collection effort.

Subject Keywords

Deleting all keywords will set the resource sharing status to private.

Resource Level Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Place/Area Name:
Logan and Providence Cities, Cache County, Utah, USA
North Latitude
41.7865°
East Longitude
-111.7636°
South Latitude
41.6774°
West Longitude
-111.9112°

Content

readme.md

Folders are organized as follows:

  • 1_SecondaryData contains all secondary information collected for each site in a single file (sites.csv) with the following columns:

    1. SiteID: Identifier assigned to each participant in the study. Type: Integer.
    2. City: Name of the city where the participant lives. Type: Logan / Providence.
    3. N_Residents: Total number of residents in the site. Type: Integer.
    4. N_Residents_0-10: Number of residents in the age range listed. Type: Integer.
    5. N_Residents_10-25: Number of residents in the age range listed. Type: Integer.
    6. N_Residents_25-40: Number of residents in the age range listed. Type: Integer.
    7. N_Residents_40-60: Number of residents in the age range listed. Type: Integer.
    8. N_Residents_Over60: Number of residents in the age range listed. Type: Integer.
    9. IrrigationType: Descriptor of the irrigation system in the property. Type: Hose / SprinklerSystem
    10. MeterBrand: Manufacturer of the meter used by the utility company to record water use at the site. Type: Neptune / MasterMeter
    11. MeterSize: Size of the meter used by the utility company to record water use at the site. Units: Inches.
    12. MeterResolution: Volumetric meter resolution for the meter located at the site. Units: Gallons. Multiplying pulses at any location by this value will result in the volume (in gallons) used at the site.
    13. N_Bathrooms: Number of bathrooms in the site. Type: Integer
    14. Own/Rent: Indicates if the habitants of a site own or rent the property. Type: Own / Rent.
    15. LegalAcreage_SqFt: Legal area of the parcel. Units: Square Foot - Rounded to the nearest 10.
    16. YearBuilt: Year the property was built. Unit: Year - rounded to the nearest 5.
    17. BuildingSqFt: Built area of each site. Units: Square Foot - rounded to the nearest 10.
    18. ZipCode: Zip code for each site. Type: 5-digit integer.
    19. UserPercentile_City_LastYear: Ranking of the participants sites compared with all residential properties, by city, for the last year of data available. Logan (2018), Providence (2019). Units: ranking.
    20. MonthlyAverageWinter: Average monthly water use during winter months (November - April) at each site. Units: 1,000 Gallons.
    21. MonthlyAverageSummer: Average water use during winter months (May - October) at each site. Units: 1,000 Gallons.
    22. Irr_Area: Irrigated Area, computed from satellite imagery. Units: Square Foot.
  • 2_LogFiles contains a log for each site (in CSV format) with the following columns:

    1. DataCollectionPeriod: an id for each data collection period from 1 to n.
    2. StartDate: the start date of each data collection period. Format: 'YYYY-MM-DD HH:MM:SS'
    3. EndDate: the end date of each data collection period. Format: 'YYYY-MM-DD HH:MM:SS'
    4. Meter_WaterUse: volume of water used in each data collection period, obtained from reading the actual meter installed at each site. Unit: Gallons.
    5. CIWS-DL_WaterUse: volume of water used in each data collection period, measured by the CIWS-Datalogger installed at each site. Unit: Gallons.
    6. PercentError_Vol: percent error, in volume, of the CIWS-Dl measurement. PE_vol = (column 4 - column 3)*100 / column 3. Unit: %.
    7. N_ExpectedValues: number of values expected in each data collection period, calculated as seconds(column 2 - column 1)/4/ Unit: count
    8. N_ActualValues: number of actual values measured by the CIWS-Dl in each data collection period. Unit: count.
    9. PercentError_Count: percent error, in count, of the CIWS-Dl measurement. PE_count = (column 7 - column 6)*100 / column 6. Unit: %.
    10. OutdoorWaterUse_Expected: an indication of whether outdoor water use is expected during the data collection period or not. Type: Binary (Yes / No).
The log files are named sitenqc_log.csv, where n represents the siteID for each site where data was collected.
  • 3_QC_Data contains all the data collected that passed the quality control procedure defined. There is 1 CSV file per site with the following columns:
    1. Time: date and time stamp for each value collected. Format: 'YYYY-MM-DD HH:MM:SS'
    2. Pulses: the number of pulses recorded in every 4-second time period. Type: integer.
The QC_Data files are named siteNNNqc_data.csv, where NNN represents the siteID for each site where data was collected.
The meter resolution (volumetric) for each pulse is included in the SitesInformation_HS.csv file (MeterResolution)
  • 4_EventFilesOriginal contains an events file for each site (in CSV format) with the following columns:
    1. StartTime: Start date and time of each individual event. Format: 'YYYY-MM-DD HH:MM:SS'
    2. EndTime: End date and time of each individual event. Format: 'YYYY-MM-DD HH:MM:SS'
    3. Duration: Duration of each individual event (end time - start time). Units: Minutes
    4. OriginalVolume: Volume of water used in each individual event. Unit: Gallons.
    5. OriginalFlowRate: Average flow rate of each individual event. Unit: Gallons per minute.
    6. Peak_Value: Maximum value observed in each 4-seconds period within each event. Unit: Gallons
    7. Mode_Value: Most frequent value observed in an event. Unit: Gallons
    8. Label: Event classification. Values: faucet, toilet, shower, irrigation, clotheswasher, bathtub.
    9. Site: Identifier assigned to each participant in the study. Type: Integer.
The event files are named LabelledEvents_site_n.csv, where n represents the siteID for each site where data was collected. The files in this folder are the generated from processing the files in the 3_QC_Data folder using the tools available on Attallah and Bastidas., (2021), included in the references of this resource.
  • 5_EventFiles_Processed contains an events file for each site (in CSV format) with the following columns:
    1. StartTime: Start date and time of each individual event. Format: 'YYYY-MM-DD HH:MM:SS'
    2. EndTime: End date and time of each individual event. Format: 'YYYY-MM-DD HH:MM:SS'
    3. Duration: Duration of each individual event (end time - start time). Units: Minutes
    4. OriginalVolume: Volume of water used in each individual event. Unit: Gallons.
    5. OriginalFlowRate: Average flow rate of each individual event. Unit: Gallons per minute.
    6. Peak_Value: Maximum value observed in each 4-seconds period within each event. Unit: Gallons
    7. Mode_Value: Most frequent value observed in an event. Unit: Gallons
    8. Label: Event classification. Values: faucet, toilet, shower, irrigation, clotheswasher, bathtub, unclassified, unknown.
    9. Site: Identifier assigned to each participant in the study. Type: Integer.
The event files are named Events_site_n.csv, where n represents the siteID for each site where data was collected. These files were processed using the methodology described in Bastidas Pacheco and Horsburgh (2021), included in the references of this resource.

References

Related Resources

This resource cites: Bastidas Pacheco, C. J., Horsburgh, J. S., Tracy, R. J. (2020). A low-cost, open source monitoring system for collecting high-resolution water use data on magnetically-driven residential water meters, Sensors, 20(13), 3655, https://doi.org/10.3390/s20133655.
This resource cites: Attallah, N., C. J. Bastidas Pacheco (2021). Supporting data and tools for "Tools for Evaluating, Developing, and Testing Water End Use Disaggregation Algorithms", HydroShare, http://www.hydroshare.org/resource/3143b3b1bdff48e0aaebcb4aedf02feb
This resource cites: Bastidas Pacheco, C. J., J. S. Horsburgh (2021). Supporting data and tools for "Variability in Consumption and End Uses of Water for Residential Users in Logan and Providence, Utah, USA", HydroShare, http://www.hydroshare.org/resource/379d9e7037f04478a99d5aec22e841e6

Credits

Funding Agencies

This resource was created using funding from the following sources:
Agency Name Award Title Award Number
National Science Foundation Cyberinfrastructure for Intelligent Water Supply (CIWS): Shrinking Big Data for Sustainable Urban Water CBET 1552444

How to Cite

Bastidas Pacheco, C. J., N. Atallah, J. S. Horsburgh (2021). High Resolution Residential Water Use Data in Cache County, Utah, USA, HydroShare, http://www.hydroshare.org/resource/0b72cddfc51c45b188e0e6cd8927227e

This resource is shared under the Creative Commons Attribution CC BY.

 http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required