Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

Modelling Data for Predicting Cyanobacteria Blooms - JPIWater Project BLOOWATER


Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Type: Resource
Storage: The size of this resource is 113.2 MB
Created: Oct 07, 2020 at 12:26 p.m.
Last updated: Jun 20, 2023 at 8:05 p.m.
DOI: 10.4211/hs.10e1281196d34550b42501f611e268f9
Citation: See how to cite this resource
Content types: Multidimensional Content 
Sharing Status: Published
Views: 292
Downloads: 1
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

The European Union Water JPI (http://www.waterjpi.eu/) has funded the project BLOOWATER (Supporting tools for the integrated management of drinking water reservoirs contaminated by Cyanobacteria and cyanotoxins (https://www.bloowater.eu/) The main objective of the BLOOWATER project is to produce information resources for Public water supply systems to prepare and respond to the risk of the cyanotoxins in drinking water. Practically the project proposes innovative technological solutions aim to develop a methodological approach based on the integration of monitoring techniques and treatment of water affected by toxic blooms. BLOOWATER aims to create forecasting models and systems of surveillance and early warning of toxic blooms to perform immediate actions such as opportune potabilization treatment. The project intends to develop and implement methods to treat cyanobacteria laden water with more efficient processes, to define diagnostic protocols through the use of innovative techniques for water monitoring, and create forecasting models and systems of surveillance and early warning of toxic blooms. Combined these actions will allow water treatment fallibilities to optimally adjust treatment plant operations in response to the onset of cyanobacteria blooms.

To develop cyanobacteria forecasts two different but complimentary methods are being tested

1) The use of Process based models, in this case the combination of the GOTM Hydrodynamic model and the SELMA biogeochemical model coupled using the Framework for Biogechemical Models (FABM) SELMA simulates the biomass of a generic cyanobacteria group and we will test if this can be of useful predictor of cyanobacteria blooms

2) Use of machine learning based models that will be forced and trained on the same data sets used to simulate and verify the process based models, but which may also take as imput data generated by the process based models.

Here we provide an archive of forcing data and measured lake chemistry and phytoplankton data that will be used by BLOOWATER to develop and test model forecasts using both process based modeling and machine learning approaches.

Data are provided for Lake Erken Sweden a primary case study site in the BLOOWATER project

All data files are formatted for use with the GOTM version 5.3 (https://gotm.net/) and SELMA models that are coupled by the frame work for biogeochemical models (https://github.com/fabm-model). The lake model was calibrated using the Parallel Sensitivity Analysis and Calibration tool ParSAC (https://bolding-bruggeman.com/portfolio/parsac/) The measured data used for calibration in the format used by ParSAC are also included in this archive

Additional data and machine learning workflows developed by the BLOOWATER project are available at https://github.com/Shuqi-Lin/Algal-bloom-prediction-machine-learning

Subject Keywords

Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Longitude
18.6290°
Latitude
59.8390°

Temporal

Start Date:
End Date:

Content

README.MD

Archive Description

LAKE ERKEN

All data were collected from Lake Erken the site of the Uppsala University Limnology field station (http://www.ieg.uu.se/erken-laboratory/). The simulation period is between 2004-2018, and we used a 5 year (1999-2003) model spin-up period prior to simulation and calibration. Model forcing data are obtained from meteorological stations at the Erken laboratory, and from Erken's routine monitoring program. To calibrate the model, data sets of high frequency automated monitoring of water temperature, chlorophyll fluorescence and dissolved oxygen are combined with laboratory measurement from the routine monitoring program. Calibration data for nutrients are only available from the Erken laboratories routine monitoring program All data files are formated as required by the lake models. Files that define the model parameterization (yaml) are also included

Files Defining Model Setup

_ hypsograph.dat _

File containing data to define the depth vs area relationship for the lake basin, which in turn are used to calculate the water volume associated with a given depth. Data are in the format expected by the GOTM model. Line 1 contains two numbers the first is the number of depths at which the surface area is calculated the second defines the direction of the profile bottom to top = 1 or top to bottom =2 the remaining lines contain the surface area (m2) of the lake at the specified depth.

_ gotm.yaml _

Parameter settings for the GOTM model These have been optimized to give the best match between simulated and measured water temperature. In yaml format expected by GOTM. Parameter values in this file are default values for the Erken simulations

_ fabm.yaml _

Parameter settings for the Selma model. These have been optimized to give the best match between simulated and measured values of chlorophyll oxygen NOX, PO4-P and Total P. This file is in yaml format expected by the GOTM/SELMA model Parameter values in this file are default values for Erken Simulations

Model Forcing Data

Meteorological data

_ Erken_MetFile_1960-2018.dat _

These are data collected from an automated monitoring station located on Malma Island, a small island 500 m offshore from the Erken Laboratory (59.83909N 18.629558W). SMHI meteorological station at Svanberga (59.83232N 18.654550W)

More information on this data set is also available from Moras, S. (2019). Simulation of Lake Erken water temperature (1961-2017) with GOTM: Model configuration, input data, output data and observed water temperature, HydroShare, https://doi.org/10.4211/hs.7e5ec8c0e2b245199ab13cc9ae08b841

The data are formatted to work with the GOTM model

  • Date/Time yyyy-mm-dd hh:mm:ss
  • UWind U wind component (m s-1) For 1D GOTM only this component is used
  • VWind U wind component (m s-1) Not used
  • AirPressure_hPa Air pressure (hPa)
  • Air_Temp Air temperature (C)
  • RelHumidity_Avg Relative Humidity (percent)
  • Cloud Cover Fraction of sky (0-1)

Data in this file are at an hourly time step

_ ErkenSWR_1960-2018.dat _

Short wave radiation data read in a separate input file, which has an hourly time step

  • Date/Time yyyy-mm-dd hh:mm:ss
  • SW_Rad_Avg Shortwave radiation (watts m-2)

_ ErkenDailyPrec_1960-2018.dat _

Precipitation data in a separate input file, which has an daily time step

  • Date/Time yyyy-mm-dd hh:mm:ss
  • Precip Precipitaion (mm d-1)

_ ErkenWSE_1995-2018.dat _

Lake water level elevation data in a separate input file, which has an daily time step

  • Date/Time yyyy-mm-dd hh:mm:ss
  • Water_Level_Avg Level of the lake surface water meters above sea level

Nutrient Loading Data

_ Kristine_InputQ1999-2018.dat _

This is the file of the daily discharge and water temperature for the largest input to Lake Erken that comprises 50.7% of the total watershed area. Data are obtained from the SMHI S-HYPE model (http://www.smhi.se/en/research/research-departments/hydrology/hype-in-sweden-s-hype-1.7891) with the Kristineholm station having an outlet at, SWEREF99: 694802, 6640401. Note that these data begin 2004-01-01. To enable a 5 year spin-up of the model we have taken the period 2004-2008 and appended it to this file as the time period 1999-2003

  • Date/Time yyyy-mm-dd hh:mm:ss

  • Inflow Stream discharge (m3 s-1)

  • Temp C Stream water temperature (C)

_ Unguaged_InputQ1999-2018.dat _

This file is for the discharge for the remaining area of the watershed that is not in the sub-basin measured at Kristineholm. Discharge is adjusted based on the areas of the Kristineholm watershed and the remaining ungauged area. Stream water temperature is assumed to be the same as that at Kristineholm. This file is in the same format as the Kristineholm file above.

_ InFlowChem1999-2018_Selma.dat _

This is a file of water chemistry measurements made at Kristineholm as part of the Erken laboratory routine monitoring program. Sampling frequency is monthly to every two weeks. GOTM does a linear interpolation between the measured concentrations and combines these estimates with daily discharge data to calculate nutrient loads. Note that we assume the same nutrient concentrations would be representative for the ungauged portion of the watershed. All concentrations are in mmole per liter

  • Date/Time yyyy-mm-dd hh:mm:ss
  • PO4-P (mmole/l) PO4-P concentration in the stream water(mmole l-1)
  • TP (mmole/l) Total P concentration in the stream water(mmole l-1)
  • NOX-N (mole/l) Nitrate + Nitrite concentration in the stream water(mmole l-1)
  • NH4 (mole/l) Ammonia concentration in the stream water(mmole l-1)

Model Calibration Data

Files used for model calibration are not directly used force model simulations, but were used to calibrate the model in order to obtain the optimized parameter values in the gotm.yaml and fabm.yaml files. They are included here to allow independent attempts to calibrate GOTM/SELMA or other models, and also to provide data that can be compared against the model simulations. Model calibration was done using the Program ParSAC (https://bolding-bruggeman.com/portfolio/parsac/). The file config_acpy.xml is the file that is used by ParSAC to define the model state variables and sources of data to be used for comparison in the model calibrations. Also in this file are all of the parameters that were optimized and the range of acceptable calibration values for each parameter.

Water temperature data

_ Erken_DailyTempData_2004-2018_NoWinter.obs _

During ice free conditions full profiles of water temperature are automatically recorded at the Eastern end of Lake Erken approximately 500 m to the NE of the Malma island meteorological station at a depth of 15.5 m (59.84297N 18.635433W). During the PROGNOS project 2 different systems were used both based on thermocouple sensors. Between 1989-2016 temperatures were measured every 30 min at 30 depths between 0.5 and 15 meters depth using a seasonally deployed floating system. Starting in 2016 an upgraded system measured temperatures at 50 depths between 1.0 and 15.5 m depth at hourly intervals. This system was designed to operate year round and is moored underwater below the depth of ice formation. Finally at the Malma Island Meteorological station water temperature were measured year round at 3 depths 1.0m, 3.0m, and 15.0 m. These three data sources were combined to create as complete a data record as possible. Prior to the development of the new temperature measurement system, the Malma Island temperature measurements were merged with data from the moored system in order to provide a continuous data record that covered the periods when the moored system was not in operation. Once the new system was in operation its results were append onto the end of the data file created from the other two data sources. All data collected between 15 December - 31 March are excluded from the temperature data used for model calibration. This was done since GOTM does not simulate sediment heat loss and warming of the bottom water during ice covered conditions. The data file format is that expected by the ParSAC program that we used to calibrate the mode.

Each file line contains:

  • Time stamp
  • Depth
  • Water temperature (C)

Water Quality Data

HF water quality measurements collected by a YSI EXO2 sonde (https://www.ysi.com/exo2) that was deployed on a profiling system (https://www.ysi.com/Pontoon-Vertical-Profiling-System) were used to calibrate the GOTM/SELMA model. Data used were dissolved oxygen, and chlorophyll fluorescence. This system was also deployed near Malma Island in the main basin of the lake the exact location (and maximum depth) varied from year to year. The present location is at 59.84530N 18.624217W and at depth of 18.0 meters. Profiles are collected every hour at 0.5 meter intervals. There is also a long-term routine monitoring program which collects a volume weighted sample of the epilimnion and hypolimnion water at one to two week intervals. Laboratory analysis of these samples provided measurements of PO4-P, Total P, NH4, NOX (nitrate + nitrite) and Chlorophylla that could be used for model calibration. Separate yearly files are provided for each of the water quality parameters and they are in the format expected by ParSAC

_ Comb_Chl_YSI_Selma.obs _

_ Spring_Comb_Chl_YSI_Selma.obs _

_ Remaining_Comb_Chl_YSI_Selma.obs _

These are chlorophyll data files that are created by merging data from both the automated and manual measurements. The automated profiles are averaged to calculate a mean daily profile. For the laboratory measurements we assigned a depth of -3 meters to the epilimnion sample and a depth of -15 meters to the hypolimnion sample. Evaluation of the chlorophyll data from both sources showed that the two data sources matched well during most of the year, but that the YSI data underestimated the lab measured chlorophyll concentrations during summer when large colonial cyanobacteria were present , especially in the presence of Gloeotrichia.colonies This is most likely because the larger (500 ml -1000ml) water sample used for the lab chlorophyll measurements more effectively sampled the inhomogenously distributed chlorophyll rich cyanobacteria colonies. During periods when cyanobacteria were present and there were large discrepancies between the laboratory and automated measurements, the manual measurements were removed from the data set. There are also three data files used for calibration Comb_Chl_YSI_Selma.obs contains the entire data record and can be used to calibrate all of the modeled phytoplankton groups (diatoms, flagellates and cyanobacteria) simultaneously. The remaining two files are used in a different calibration stragegy. Spring_Comb_Chl_YSI_Selma is a file containing data collected between (1 March and 16 May) the period during which the spring bloom normally occurs and it is used to specifically calibrate the model parameters associated with diatoms. Remaining_Comb_Chl_YSI_Selma.obs is a file containing the measured chlorophyll data from the remainder of the year and is used to calibrate the flagellate and cyanobacteria groups.

Each file line contains:

  • Time stamp
  • Depth
  • Chlorophyll concentration (mg l-1)

_ MeanDaily_2004-2018_Erken_O2_mmole.obs _

This file contains oxygen data from both the YSI automated profiling system and field measurements made during the routine sampling program. The hourly automated profiles are averaged to calculate a mean daily profile. During the routine sampling program measurements were made between the surface and 20 meters at 1 meter intervals using a temperature /oxygen probe at the deepest area of the lake. Mean daily oxygen profiles from the automated sampling were merged with those collected by the routine monitoring program in this file. The format is that expected by ParSac.

Each file line contains:

  • Time stamp
  • Depth
  • Oxygen concentration (mmole l-1)

_ Comb_TotP_Selma.obs _

_ Comb_PO4_Selma.obs _

_ Comb_NOX_Selma.obs _

_ Comb_NH4_Selma.obs _

These are Lake Erken nutrient data collected by the routine monitoring program. No automated measurements of these nutrient are collected. A depth of -3 meters is assigned to the volume weighted epilimnion sample and a depth of -15 is assigned to the volume weighted hypolimnion sample.

Each file line contains:

  • Time stamp
  • Depth
  • Nutrient concentration (mmole l-1)

Example Simulation Output

_ Output.nc _

This is a file in NetCDF format and contains a large amount of simulated output as well as the model forcing data. The simulation that produced this output is are forced by the data described above and model parameterization is defined by the setup files described above. The simulation period was between 2004-2018, as defined in gotm.yaml. To simply view the data contained in the file it is recommended to use the program PyNcView (https://sourceforge.net/projects/pyncview/)

Calib_Plot

This is a directory of plots that show time series and scatter plot of the simulated vs measured data for the example simulation related to the gotm.yaml, fabm.yaml files deposited here. Much of these data can also be viewed in the Output.nc file above.

LAKE VANSJÖ

Data Services

The following web services are available for data contained in this resource. Geospatial Feature and Raster data are made available via Open Geospatial Consortium Web Services. The provided links can be copied and pasted into GIS software to access these data. Multidimensional NetCDF data are made available via a THREDDS Data Server using remote data access protocols such as OPeNDAP. Other data services may be made available in the future to support additional data types.

Credits

Funding Agencies

This resource was created using funding from the following sources:
Agency Name Award Title Award Number
FORMAS BLOOWATER Supporting tools for the integrated management of drinking water reservoirs contaminated by Cyanobacteria and cyanotoxins 2018-02771
Swedish Research council Swedish Infrastructure for Ecosystem Science (SITES), funding to the Erken Laboratory Uppsala University
EU Water JPI ERA-NET WaterWorks2017 Cofunded Call. This ERA-NET is an integral part of the 2018 Joint Activities developed by the Water Challenges for a Changing World Joint Programme

How to Cite

Pierson, D. (2023). Modelling Data for Predicting Cyanobacteria Blooms - JPIWater Project BLOOWATER, HydroShare, https://doi.org/10.4211/hs.10e1281196d34550b42501f611e268f9

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required