Global Flood Database


Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Resource type: Composite Resource
Storage: The size of this resource is 14.4 MB
Created: Mar 16, 2021 at 9:12 p.m.
Last updated: Mar 16, 2021 at 9:35 p.m.
DOI: 10.4211/hs.6461528501c14f7c9d6b10d20dd4f657
Citation: See how to cite this resource
Content types: Single File Content  Geographic Feature Content 
Sharing Status: Published
Views: 1574
Downloads: 72
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

## Global Flood Database Scripts & Data

This repository includes code and supporting data for the Global Flood Database. This include descriptions of the data and code, and how they relate to *Tellman et al, Satellite observations indicate increasing proportion of population exposed to floods*

Subject Keywords

Resource Level Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Place/Area Name:
global
North Latitude
90.0000°
East Longitude
-180.0000°
South Latitude
-90.0000°
West Longitude
180.0000°

Temporal

Start Date:
End Date:

Content

README.md

Global Flood Database Scripts & Data

This repository includes code and supporting data for the Global Flood Database. Below are descriptions of the data and code and how they relate to Tellman et al, Satellite observations indicate increasing proportion of population exposed to floods

Data

Flood Maps

The flood maps (.tif files) can be accessed through a visualization and data portal at: http://global-flood-database.cloudtostreet.info/

You can also download the entire database as GeoTIFF files directly from Google Cloud Storage (GCS) using the gsutil cp command from the GCS bucket "gfd_v3". You can use the following command to download the entire database to a local directory:

gsutil -m cp gs://gfd_v3 your/local/directory/to/save/to

Flood Mapping

  1. data\shp_files\dfo_polys_20191203.shp: the Dartmouth Flood Observatory (DFO) flood polygon dataset used in our analyses and processing of satellite imagery.
  2. data\gfd_qcdatabase_2019_08_01.csv: the Quality Control (QC) database described in Tellman et al.

Validation

  1. data\gfd_validation_points_2018_12_17.csv: validation data of 123 selected flood events that includes geo-location of each assessment point, the classified data for different methods (e.g. 3day Standard), analyst initials and spectral data from the interpretation imagery (i.e. Landsat-5, 7 & 8). Field values are explained in the main_validation.pynb (see below)
  2. data\gfd_validation_sensitivity.csv: assessed validation points up to 400 points for selected flood events to test appropiate sampling intensity
  3. data\gfd_validation_metrics.csv: summarized validation metrics (e.g. commission error) for each validation flood
  4. data\sample_frame_CC20_D1_051618.csv: a summary of available Landsat images (5, 7 & 8) for each flood event. Used to determine which flood events can be used to collect validation data. The field DELTA is the number of days following max flood extent, CLOUD_COVER is the maximum allowable percent cloud cover for a validation image, X and Y are the centroid of the flood event from the DFO polygon.

Exposed Population Estimates

  1. data\SSP2010.csv: 2010 population estimates from the SSP2 (Socioeconomic pathways scenario)
  2. data\SSP2030.csv: 2030 population estimates from the SSP2 (Socioeconomic pathways scenario)
  3. data\aqueductcountrydata.csv: WRI Aqueduct flood exposure estimates for various return periods for 2010 and 2030
  4. data\aqueduct_dictionary.xlsx: data dictionary to explain columns in WRI Aqueduct flood exposure estimates
  5. data\gfd_popsummary.csv: Global Flood Database population exposure estimates per country, in 2000 and 2015, and associated statistics.
  6. data\GFDabove_13_wBias.csv: Global Flood Database population exposure estimates per country, in 2000 and 2015, with bias correction factor based on comparison to HRSL data.
  7. data\gfd_popdictionary.xlsx: data dictionary to explain columns in Global Flood Database exposure estimates
  8. Population Exposed Per Event: Population exposure estimate per event. To access click on the INFO button on our data portal at: http://global-flood-database.cloudtostreet.info/
  9. Population Exposed Per Country Per Event: Population exposure estimates per country by event. To access click on the INFO button on our data portal at: http://global-flood-database.cloudtostreet.info/

Pop Sensitivity & Uncertainty

  1. data\gfd_popsensitivity.csv: Global Flood database population exposure estimates per country using the Global Human Settlement Layer (GHSL), High Resolution Settlement Layer (HRSL) and GridPop3. Countries are limited to those with HRSL data.

Flood Mechanism

  1. data\gfd_floodmechanism.csv: Global Flood database disaggregated by "flood type" (data from the Dartmouth Flood Observatory) and estimated population exposure estimate per in 2000 and 2015.

Code

Our code includes modules written in Python, Javascript and R. In the case of Javascript, this code is stored as a .txt file (.js files are prohibited as Gmail attachments) and can be run by "copy and pasting" into Google Earth Engine's code editor. Python scripts are based on Google Earth Engine's Python API and require installation before running. Additional code in R require publicly available downloads of R or RStudio.

Below is a short description of scripts within our repository and how they relate to Tellman et al, Satellite observations indicate increasing proportion of population exposed to floods

Flood Mapping

  1. main_gfd.py - uses GEE Python API to create flood maps for each Dartmouth Flood Observatory flood event. This script relies on modules found in the flood_detection folder. The exports are stored in Google Cloud Storage which can be accessed as described above.

Validation

  1. gee_sampleFrameLandsat.txt - uses GEE Code Editor to determine what floods have available Landsat imagery coincident within 1-day of the max extent of a flood event. This code produces data\sample_frame_CC20_D1_051618.csv.
  2. gee_validationGUI.txt - used GEE Code Editor to collect validation data using a custom tool designed in GEE that retrieves a flood event, coincident Landsat imagery and creates a statrified sample. An example of our validation GUI can be seen below in Figure 1. Analysts can then interpret sample points based on Landsat imagery and results are recorded. This code relies upon gee_landsatTools.txt and gee_misc.txt sub-modules. The outputs of assessment points by each analyst were stored in Google Cloud Storage and is compiled here data\gfd_validation_points_2018_12_17.csv.
  3. main_validation.pynb - This script uses the accuracy assessment points (i.e. data\gfd_validation_points_2018_12_17.csv) to calculate various accuracy metrics including ommission and commission errors. The results are stored in data\gfd_validation_metrics.csv. This script also analyzes the validation sensitivity (Extended Data Fig 8).

Exposed Population Estimates

  1. main_popstats.py - uses GEE Python API to estimate exposed populations for each flood event and country. This script relies on modules found in the flood_stats folder. Outputs are available on our data portal by clicking on the INFO button. These population estimates do not filter out isolated pixels as described in the methods.
  2. main_popchange.txt - uses GEE Code Editor to calculate population change in areas of observed inundation from GFD between years 2000 and 2015 for each country. This method removed isolated pixels for a conservative estimate of change. This script yields data\gfd_popsummary.csv. Additional fields in data\gfd_popsummary.csv are described in data\gfd_popdictionary.xlsx.
  3. ext.datafig10.R- This script was used to make extended data figure 10, which compares the population exposed to at least one flood event between 2000-2018 from the Global Flood Database to floods in 2010 in the WRI Aqueduct flood exposure 100 year return period at the country scale.
  4. ext.datafig7.R - This script was used to make extended data figure 7, which is a sensitivity analysis of the proportion of population exposed to floods under climate change and population growth across return period
  5. main_gfdsummarystats.R- This script was used to generate summary statistics from the Global Flood Database for the paper.

Pop Sensitivity & Uncertainty

  1. main_popsensitivity.txt - uses GEE Code Editor to calculate population exposure using the Global Human Settlement Layer (GHSL), High Resolution Settlement Layer (HRSL) and GridPop3. This method removed isolated pixels for a conservative estimate of change. This script yields per region files that are later compiled into data\gfd_popsensitivity.csv.
  2. main_sensitivityanalysis.R - R script that compiles individual region files generated from main_popsensitivity.txt and then calculates a bias factor. This script additionally joins the bias factor to a number of datasets including data\gfd_popsummary.csv and data\gfd_floodmechanism.csv
  3. uncertaintyanalysis.R - R script that estimates uncertainty in population trend estimates per country using the population dataset data\GFDabove_13_wBias.csv. It identifies countries we deem uncertainty and reproduces Figure 2 in the Supplementary discussion. This script recalcualtes the global flood exposure trend analysis removing the "uncertain" countries.

Flood Mechanism

  1. main_floodmechanism.txt - uses GEE Code Editor to disaggregate the Global Flood database into flood plains representing different causes/ drivers. Population exposure is calculated using the Global Human Settlement Layer (GHSL) for 2000 and 2015. This script yields per mechanism files that are later compiled into data\gfd_floodmechanism.csv.

Figure 1. Example of the GFD Validaiton GUI

Data Services

The following web services are available for data contained in this resource. Geospatial Feature and Raster data are made available via Open Geospatial Consortium Web Services. The provided links can be copied and pasted into GIS software to access these data. Multidimensional NetCDF data are made available via a THREDDS Data Server using remote data access protocols such as OPeNDAP. Other data services may be made available in the future to support additional data types.

Related Resources

This resource is referenced by http://global-flood-database.cloudtostreet.info/

Credits

Funding Agencies

This resource was created using funding from the following sources:
Agency Name Award Title Award Number
Google Google Earth Engine

How to Cite

Tellman, B. (2021). Global Flood Database, HydroShare, https://doi.org/10.4211/hs.6461528501c14f7c9d6b10d20dd4f657

This resource is shared under the Creative Commons Attribution-NoCommercial-NoDerivs CC BY-NC-ND.

http://creativecommons.org/licenses/by-nc-nd/4.0/
CC-BY-NC-ND

Comments

There are currently no comments

New Comment

required