One Ecosystem : Research Article
PDF
Research Article
The performance of two urban flood regulation models using different input data
expand article infoVanya Stoycheva, Stoyan Nedkov
‡ National Institute of Geophysics, Geodesy and Geography - Bulgarian Academy of Sciences, Sofia, Bulgaria
Open Access

Abstract

Increasing climate change has led to an increase in urban flood events. Events with a return period of twenty years become events with a period of two to three years. The primary objectives of this study are to evaluate the performance of flood regulation models using different input data and compare their performance for flood regulation supply (FRS) assessment. The assessment of the ecosystem services of urban FRS was performed using two models: Urban Flood Risk Mitigation (UFRM) by Integrated Valuation of Ecosystem Services and Tradeoffs (InVEST) and IMECOGIP Waterflow Regulation (Retention Model) (WRRM). They rely on relatively accessible open-source data, require a short time to model each scenario and provide opportunities to interpret results in different spatial and measurement units. We performed several improvements to the input data, including updating the national ecosystem-type assessment for the case study area by connecting it with the Urban Atlas typology and proposing an approach for soil data refinement. The outputs of the two models show significant differences in FRS values for each land-use/land-cover (LULC) class. The UFRM assesses different impervious urban classes with different values, varying from very low to medium. In contrast, the WRRM assesses the FRS in densely sealed areas with one fixed value for low supply (where nearly all rainfall is transformed into runoff).

Keywords

flood regulation supply, InVEST, IMECOGIP, Sofia, Urban Atlas, MAES ecosystem types

Introduction

Floods are amongst the primary impacts of climate change (UN DESA 2022), leading to widespread losses and damage to cities, settlements and infrastructure in highly dense urban ecosystems (Calvin et al. 2023). The predominance of impervious surfaces in these ecosystems is one of the primary factors that increase flood risk (Jacobson 2011). Although floods are primarily hydrological phenomena, flood risk management requires a complex approach, particularly in urban planning. To mitigate the damage to the requirements of urban planning, it is necessary to establish reasonable methods for identifying disaster prevention areas (Park and Lee 2019). Hydrological cycle and water flow regulation (including flood control and coastal protection), regulation of the chemical composition of the atmosphere and oceans (air quality) and regulation of temperature and humidity, including ventilation and transpiration, are key regulating ecosystem services (ES) in cities (Maes et al. 2016, Stoycheva and Geneletti 2023) that provide essential benefits for the urban population.

Flood regulation mapping and assessment can provide the necessary data to define the flood mitigation capacity of different areas and, thus, contribute to effective urban planning related to flood risk reduction. The flood regulation ES is based on the function of the ecosystems to retain a part of the incoming rainwater in cases of flood events (Nedkov and Burkhard 2012). Wetlands, floodplains and forests are the primary water collectors in natural ecosystems. However, flood regulation in urban areas has a different mechanism for regulating water flows, as sealed surfaces increase runoff levels and green and blue infrastructure reduce them (Warhurst et al. 2014, Wübbelmann et al. 2022). The interception by vegetation and water infiltration into the soil are the primary processes driving this reduction. Their measurements are crucial for quantifying the flood regulation supply (FRS).

Such quantification requires data, which are typically unavailable through either direct or indirect measurements. To overcome this difficulty, modelling approaches for water regulation have been used to provide data on different aspects of the water cycle (Nedkov et al. 2022). Stoycheva and Geneletti (2023) identified the modelling methods most commonly used for the quantification of FRS. Hydrological models, such as KINEROS (Nedkov and Burkhard 2012), STREAM (Stürck et al. 2014), SWAT (Cheng et al. 2017), SCS (Shen et al. 2019), HEC-RAS (Wübbelmann et al. 2021) and LEAFlood (Wübbelmann et al. 2022) have been used for flood regulation ES assessment. However, hydrological modelling requires considerable resources in terms of time, data and expertise, which renders its usage by a wide range of users a challenging task. Specialised ES tools, such as the Integrated Valuation of Ecosystem Services and Tradeoffs (InVEST), the Land Utilisation and Capability Indicator (LUCI) and the Integrated Model to Assess Global Environment (IMAGE), utilise modelling approaches that can assess trade-offs and scenarios for multiple services using relatively simple and user-friendly techniques (Palomo et al. 2017). Therefore, the study of their performance in large-scale flood regulation assessments in different urban areas using widely available data is a crucial task that can contribute to both research and decision-making. A comparison of different models tested with various input data would provide valuable information for selecting appropriate tools in urban planning.

Integrated tools for the mapping and assessment of ES have developed rapidly in recent years. They are designed specifically for ES modelling and mapping to assess trade-offs and scenarios for multiple services (Vihervaara et al. 2019). These methods are usually organised into modules, each designed for the assessment of a particular service. Flood regulation, one of the most crucial and frequently studied ES, is an integral part of most of the tools. Тhe Urban Flood Risk Mitigation (UFRM) model of InVEST and CICES 2.2.1.3: Waterflow Regulation (Retention Model) (WRRM) which is part of IMECOGIP toolbox are two models that ensure appropriate means for flood regulation assessment that can be highly useful for urban planning purposes. There are three primary types of floods: riverine (fluvial)—caused by extreme river flows; coastal—caused by sea storms or tsunamis; and stormwater (pluvial, flash floods)—caused by heavy and intensive rainfall. Both UFRM and WRRM quantify the effect of pluvial floods.

Since the release of the InVEST UFRM model in 2020 (Natural Capital Project 2020), several publications have demonstrated its applicability in various ES studies. A subset of these studies has focused on flood risk issues, such as the identification of flood vulnerability areas (Salata and Arslan 2022), the ability of the urban system to reduce the runoff effect (Quagliolo et al. 2021) and the assessment of damage to the shoreline (AlRuheili 2022). However, most studies (Dai et al. 2021, Cortinovis et al. 2022, Rodríguez González et al. 2022, Claron et al. 2022, Sebastiani and Fares 2023, Veerkamp et al. 2023, Fang et al. 2023, Dai et al. 2023, Orta-Ortiz and Geneletti 2023) have focused on a bundle of ES issues and flood regulation is only one of the modelled services. Therefore, there has been no in-depth study on the performance of this model to the assessment of FRS. The sensitivity of the model to different types of input data has also not yet been studied. Furthermore, there is a lack of comparative studies on the model results with other similar models that can reveal their actual advantages and disadvantages. IMECOGIP is a relatively new tool and its models (including WRRM) have not been widely applied (Zepp et al. 2023).

In this study, we applied two open-source ES models, InVEST UFRM and IMECOGIP WRRM, to quantify FRS in urban ecosystems, based on the city of Sofia, Bulgaria as a case study for pluvial floods. The primary objectives of this study are to evaluate the performance of the models using different input data and compare their usage for FRS assessment. The specific tasks were as follows:

  1. collect the available free input data;
  2. execute the models with the available input data;
  3. update the quality of the input data;
  4. execute the models with updated data and
  5. assess the FRS based on different inputs.

This study focuses on ES supply assessment because most models used for ES assessment have been developed and specialised in this part of the assessment process. In addition, the study is dedicated only to ensuring the quality of inputs and outputs and their analysis.

Material and methods

Methodological approach

The two models—InVEST UFRM (ver. 3.13.0, (Natural Capital Project (2023))) and IMECOGIP WRRM (Alpha Release, Ruhr-Universität Bochum, Geogr. Institut (2021))—were selected for this study because they can be run with open access data and their processing time for each case/scenario is relatively low, provided that their results are applicable for different uses, including urban planning (Assumma et al. 2022). They have relatively similar modelling approaches using the SCS-CN method and similar outputs that can represent the FRS-runoff retention index (RRI) (Natural Capital Project 2023) and discharge coefficient (CW) (Ruhr-Universität Bochum, Geogr. Institut 2021). However, there are certain differences which render them appropriate for comparison, both in terms of the quality of the input data and the precision of the results for the flood regulation ES.

We developed a specific five-stage approach to assess the performance of the two models, based on different inputs, which enables the execution of models with different input data and comparative analyses of the results (Fig. 1). The first stage consisted of collecting and analysing all available and applicable data for the case study area, as well as studying the two models and their characteristics. In the second stage, the models were executed using the initial datasets. The next stage involved updating the initial datasets to improve their quality. The improvement in quality and accuracy was performed for the model input data—soil types, ecosystem types datasets and Urban Atlas classes by updating the already existing datasets and applying an approach for soil types update. Stage 4 included new modelling with already updated datasets and the final stage was data analysis and FRS assessment (Fig. 1). Each step is described in more detail in Subsections Input data, Flood regultion modelling and Comparison of model results.

Figure 1.

Conceptual scheme of the proposed approach.

UFRM—InVEST Urban Flood Risk Mitigation;

WRRM—IMECOGIP CICES 2.2.1.3: Waterflow Regulation (Retention Model);

CW—Discharge coefficient;

MAES BG—Mapping and Assessment of Ecosystems and their Services in Bulgaria.

Case study area

Sofia is the capital city and largest urban area in Bulgaria. The case study area of Sofia is located in western Bulgaria and covers an area of 221.53 km2 (Fig. 2). It represents urban ecosystems and their subtypes, as well as other ecosystem types, such as croplands, grasslands, woodlands and forests, heathlands and freshwater. The same case study has been studied as part of the INES project (Petkova et al. 2022).

Figure 2.

Case study area. A) Ecosystem subtypes (Level 3) (see Table S1, Suppl. material 1, Apostolova et al. (2017a), Apostolova et al. (2017b), Kostov et al. (2017), Velev et al. (2017), Yordanov et al. (2017), Zhiyanski et al. (2017)). B) Urban Atlas classes (2018) (see Table S2, Suppl. material 1, European Environment Agency 2020).

The boundaries of the area do not follow the administrative division of the Sofia Municipality, but reflect the spatial extent of the impact of the densely urbanised area. The case study was delineated following the extent of the urban heat island intensity in the 20–22 h interval (August 2019) (Sofiaplan 2020) and the boundaries were corrected according to the extent of the polygons of the ecosystem types from the national methodologies for Mapping and Assessment of Ecosystems and their Services in Bulgaria (MAES BG) (Maes et al. 2013, Apostolova et al. 2017a, Apostolova et al. 2017b, Karamfilov et al. 2017, Kostov et al. 2017, Sopotlieva et al. 2017, Uzunov et al. 2017, Velev et al. 2017, Yordanov et al. 2017, Zhiyanski et al. 2017).

Six of eight terrestrial ecosystem types from the MAES classification (Maes et al. 2013) were presented in the case study (Fig. 2A). These include urban, cropland, grassland, woodland and forest, heathland and shrub and freshwater ecosystems. Urban ecosystems were the predominant type presented by all ten subtypes developed for Bulgaria according to the methodology for urban ecosystems (Zhiyanski et al. 2017). The largest proportion was in J1: residential and public areas; J5: urban green areas (including sports and leisure facilities); J6: industrial sites (including commercial sites); and J7: transport networks and other constructed hard surface sites.

We also used classes from the Urban Atlas dataset (European Environment Agency 2020) to evaluate the performance of the models using different input data. There were 22 of 27 classes in the case study area, which corresponded to a highly diverse urban environment (Fig. 2B). Class 12100, ‘Industrial, commercial, public, military and private units’, was the most common as it covers over 1/5 of the case study area (21.74%). The other highly represented areas were classes 11210 ‘Discontinuous Dense Urban Fabric (S.L.: 50– 80%)’ (17.72%) and 14100 ‘Green urban areas’ (15.64%).

Sofia has most recently experienced flooding in 2005 (Council of Ministers of the Republic of Bulgaria 2022), July 2020 (Council of Ministers of the Republic of Bulgaria 2022) and June 2023 (National Institute of Meteorology and Hydrology of Bulgaria 2023). In the 2020 flood (around 50 mm/24 h in different locations), the most affected were the southwest districts and certain areas alongside the urban rivers Vladayska, Perloska and Slatinska (National Institute of Meteorology and Hydrology of Bulgaria 2020). The most recent flood, in June 2023 (61.30 mm on 15 June), had a slightly different spatial extent because the affected areas were in the south and southeast of Sofia. The three flooding events were preceded by periods of rainfall that exceeded the rainfall norms for each month.

Input data

Initial dataset

As described above, these models were selected because their input data are commonly used, open-source and relatively easy to set. Both models used similar sets of input data (Table 1); however, their outputs are relatively different. The area of interest (AOI), which is a vector file, is common and mandatory in both models. The precipitation in both models is set to be the rainfall depth for a specific event in a specific period. There is a difference in the LULC datasets because the UFRM requires a raster layer, whereas the WRRM requires a vector layer. For UFRM, there are no restrictions on the use of different classifications because users can define their specific classifications for the case study area. On the contrary, WRRM requires the LULC classes to be adapted to the sample data provided in the toolbox or own datasets with CW values have to be provided.

Table 1.

Correspondence of input data between InVEST UFRM and IMECOGIP WRRM.

InVEST UFRM

IMECOGIP WRRM

Input data

Type/Unit

State

Input data

Type

State

Area of Interest (AOI)

vector

Required

AOI

vector

Required

Rainfall Depth

mm

Required

Precipitation

mm

Required

Land Use/Land Cover

raster

Required

Land Use

vector

Required

Soil Hydrologic Group

raster

Required

-

Biophysical Table

.csv

Required

Predefined CW Values

.csv

Required

Built Infrastructure

vector

Optional

-

Damage Loss Table

.csv

Optional

-

-

Symbology

.qml

Required

Both models use LULC classes. The difference is that, in the WRRM, the LULC dataset already has predefined CW values for each class, which accompanied the model input sample data, whereas the UFRM does not use a specific predefined value and the user can decide which source to use for the data provision. Another difference is the use of the curve number approach in UFRM, which is incorporated into the biophysical table that connects LULC classes with soil hydrological groups. The WRRM uses a slightly different approach that does not consider soil types in the calculation. The model uses predefined values for the CW that are connected only to biotopes (equal to LULC classes) and do not correspond to different soil types.

UFRM has an optional usage of built infrastructure and damage loss table for the assessment of demand of the flood regulation service, whereas IMECOGIP WRRM does not use this approach and assesses only ES supply. This module was not used because the focus of the current study was on FRS rather than demand.

We used two available sources for LULC data: ecosystem subtypes (MAES BG, level 3) (Bratanova-Doncheva et al. 2017, Zhiyanski et al. 2017) and Urban Atlas classes (European Environment Agency 2020). Urban Atlas data were used as a LULC input, as they cover the Sofia functional urban areas (FUA), which is one of the scales used for ecosystem assessment in MAES (Semerdzhieva and Borisova 2021). We extracted part of the Sofia FUA within the case study area and performed additional analyses and cross-walking between the 2018 version of the Urban Atlas database and ecosystem types, as well as verified them with orthophotos. Further verification was conducted using Google Maps (Google Maps 2023) to update the database.

InVEST UFRM uses the Curve Number method, which combines the LULC dataset with soil hydrological groups to derive runoff retention values per pixel in the watershed (AOI) (United States Department of Agriculture 2007). Soil datasets were collected from several data sources (Geography Institute at the Bulgarian Academy of Sciences 2002, Japan International Cooperation Agency 2008, Hiederer 2013, Gyurov and Artinova 2015, International Union of Soil Sciences Working Group WRB 2022, Todorova and Zhiyanski 2023) and for usage in the UFRM, a cross-walking between different classifications was required (see Soil dataset update: additional information, Suppl. material 1). The case study had three of four soil hydrological groups (B, C and D), whereas the only group not presented in the area was group A.

Precipitation depths (Table 2) were derived from the National Disaster Risk Profile in Bulgaria (NDRPB) (Council of Ministers of the Republic of Bulgaria 2022). These values have been used for the scenarios in the modelling of flood risk in Sofia Municipality, where rainfall is the source of flooding (Council of Ministers of the Republic of Bulgaria 2022).

Table 2.

Precipitation depth used for modelling.

Precipitation (mm)

Description

Frequency

Source

69

High-probability event

1 per 20 y

NDRPB

91.6

Medium-probability event

1 per 100 y

NDRPB

140.7

Low-probability event

1 per 1000 y

NDRPB

The precipitation depths correspond to the low-probability event (1000 y rainfall frequency occurrence), medium-probability event (100 y rainfall frequency occurrence), and high-probability event (20 y rainfall frequency occurrence) (Council of Ministers of the Republic of Bulgaria 2022). These depths meet the requirements of the European Floods Directive and the National Flood Hazard and Risk Mapping Methodology (Council of Ministers of the Republic of Bulgaria 2022). The data themselves is a 24-hour sum of rainfall derived from the National Institute of Meteorology and Hydrology (Council of Ministers of the Republic of Bulgaria 2022). All these scenarios were tested with the models; however, the high-probability event (69.00 mm) was used for most of the analyses in the results section as the frequency of similar events had increased in the last 5–10 years for the case study.

The two models function with look-up tables that combine previously prepared data for each LULC class. For UFRM, it is called a biophysical table (Tables S5–6 Suppl. material 1), which combines the LULC classes with the curve number for the respective hydrological soil group. The CN values were derived from the EU-level-based study by Vallecillo et al. (2020) and were first assigned to the ecosystem types datasets and after the cross-walking between the ecosystem types and Urban Atlas typology assigned to each corresponding Urban Atlas class. The Table contains columns corresponding to each soil group. The WRRM also requires previously prepared data, which include the CW predefined values for each biotope (Ruhr-Universität Bochum, Geogr. Institut 2021). The model provides sample data which can be adapted, based on the local LULC or datasets of CW values that must be determined previously. In this study, the available biotope classifications were related to the MAES BG ecosystem subtypes and UA classes.

Data update

All the datasets have limitations and shortcomings that reflect the manner in which they have been developed. Improvement in the initial data may lead to more precise and accurate modelling results. The purposes of this stage are to analyse the quality of the initial data, apply a set of procedures that can improve their quality and further assess the sensitivity of the models to the quality of the input data.

The analyses and updates of the LULC datasets include different approaches for both ecosystem types and UA classes. The primary issues with ecosystem subtypes are topological errors, including gaps and overlaps between all layers of different ecosystem datasets in the case study area (Petkova et al. 2022). The ecosystem types covering the case study were corrected and verified. Detail correcting steps can be found in Suppl. material 1.

Cross-walking for the UA–MAES was developed, based on the CORINE Land Cover (CLC)–MAES BG cross-walking performed by Hristova and Stoycheva (2021). The cross-walking results showed a lack of full coverage of all UA classes in the case study area (a subset of the arable land classes, the port area class and a subset of the forest classes); several classes that did not have matching classes in the CLC and a class that is not presented in the UA classification, but is presented in the CLC classification.

After analysing the cross-walking results, a specific approach was developed for database correction, which was applied to a subset of the UA classes (for a detailed approach explanation, see Suppl. material 1).

Soils in urban areas are mostly sealed, resulting in changes in their infiltration functions and an increase in surface runoff (McGrane 2016). The soil datasets suggested in the InVEST user guide and other open sources are too coarse and do not reflect the reality of the urban environment, where most of the area is sealed. The latter is the driver for deriving an improved method for downscaling available soil datasets (Table S3, Suppl. material 1). This approach includes specifying the soil types in urban areas to decrease modelling uncertainties. Flood plains, with respect to their alluvial soils, are identified as areas that can be derived from available open-source data, such as DEM. Soil dataset updating was necessary because soil is one of the primary factors driving the runoff retention potential of each LULC unit. The collected soil dataset was analysed and updated in a GIS environment. A 5-m DEM from the Sofia Municipality dataset (Sofiaplan 2017), was used to derive the slopes in ArcGIS (Environmental Systems Research Institute 2018). They were classified into five classes (Huggett 2016): 0–3, 3.01–5, 5.01–15, 15.01–35 and 35.01– 45°, where the flood plains covered the area between 0 and 3°. An intersection between the slopes and river channels was determined to define the exact spatial distribution of the floodplains. River geospatial data from the MAES BG dataset were used as it was the most appropriate and had full coverage for the case study. For precise identification of the alluvial plains, a 20-m buffer (10 m left and 10 m right) was used. The upgraded spatial distribution of alluvial soils was derived from the later intersections.

The resulting dataset was verified using urban soil sampling. The primary attribute for performing the verification was the clay-sand ratio (required for the derivation of soil hydrological groups for InVEST UFRM) of the corresponding sampling points within the case study floodplains (Table S3, Suppl. material 1).

Adjustment of the CN based on the imperviousness was made for the artificial Urban Atlas datasets. This follows the suggested by Vallecillo et al. (2020) approach for refinement of the Urban Atlas dataset (which is derived from the Corine Land Cover). We have used the following equation (1):

(1) CNTotal = 98 * (Imp/100) + (1-Imp/100) * CNCLC

where Imp is the level of imperviousness (soil sealing) and CNCLC (in our case UA) is the CN derived from Vallecillo et al. (2020) without correction.

The biophysical table was updated according to LULC and soil datasets. The added and extracted LULC classes were corrected in the initial table and the hydrological soil groups were updated with the new classes.

Overall verification of the updated datasets was performed through two approaches. The first one includes creating a grid with random points in GIS with a distance of 200 m between each validation point. Changes in the ecosystem subtypes cannot be traced through a simple count of the changed features, as the dataset has been updated by correcting the topological errors, as well as by removing the overlaps between different polygons. The verification of the Urban Atlas dataset through grid points shows a 13.4% change in the overall features (Suppl. material 2).

The second approach was comparing the average polygon feature area, as lowering the average polygon area leads to an increase in the precision of each mapped feature (Suppl. material 3). Nine out of twenty-five Urban Atlas classes showed a decrease in the average feature area with an average of 35,200 m2, mostly due to the reclassification of class 23,000 ‘Pastures’ to other classes. The ecosystem subtypes have a more distinctive lowering of the average feature area, where 24 out of 35 subtypes have changes and the average lowering is around 46,500 m2.

Flood regulation modelling

InVEST Urban Flood Risk Mitigation

The InVEST UFRM calculates the runoff retention per pixel (based on the input data resolution) of the LULC layer (Natural Capital Project 2023). We executed the model with different input data, simulating different scenarios (Table S4, Suppl. material 1), as follows: three precipitation depths, ecosystem subtypes as LULC (initial and updated), Urban Atlas classes as LULC (initial and updated), soils (initial and updated) and the required input data of the biophysical table for each of the LULC scenarios.

Another input required is the specification of the AOI. The input layer must be in a vector format that predefines the possibility of using the file as a tool for cross-referencing between different models. The output of the model (Table 3) uses vector features such that common geographical units can be compared, as was performed in this study. Urban Atlas polygons were used as a common unit between the two models and each specific LULC class was assessed.

Table 3.

Correspondence of the outputs between InVEST UFRM and IMECOGIP WRRM.

* Outputs used for analysis

InVEST UFRM IMECOGIP WRRM
Outputs Type Outputs Type
Parameter log text (.txt) No similarity
Runoff retention raster (no unit) Partial similarity
Runoff retention raster (m3) No similarity
Runoff values raster (mm) No similarity

Flood risk service:

- rnf_rt_idx*: average of runoff retention values per watershed;

- rnf_rt_m3: sum of runoff retention volumes in m3 per watershed;

- flood_vol: the flood volume per watershed;

- aff_bld: potential damage to built infrastructure in currency units per watershed;

serv_blt: service.built values for this watershed.
vector (permanent)

Cw Values

- CW value*: multiplier/factor: 0 avoid runoff to 1 full runoff;

- Area (A): m2;- P (mm): precipitation intensity (mm = l/m²);

- R_R (km2): Present-Retention Situation of avoid runoff;

- R_R_gew (%): Present-Retention Situation area weighted;

- F_R (km2): Best Retention case (whole area is forest CW = 0.15 and CW = 1 for waterbodies);

- F_R_gew (%): Best Retention case (Forest Retention) area weighted;

- W_R (km2): Worst Retention case (whole area is sealed surface CW = 0.9);

vector (temporary and permanent)
- ES Balance: a standardised scale (0 to 1) showing the deficit in ES supply compared to the ideal conditions. vector (temporary and permanent)

The model uses raster layers as input for LULC and soil data; therefore, the resolution of the input (initial) data was modified to match the overall resolution of the input data. This included the transformation from a vector to a raster of soil and LULC geospatial data, as well as resampling the soil dataset to the LULC resolution. For this study, we selected a raster resolution of 10 m for all analyses, as the resolution matches the minimum mapping width of the UA.

IMECOGIP

The IMECOGIP toolbox was developed under the "Implementation of the ecosystem services framework in green infrastructure planning for resilient urban development in the Ruhr and in Chinese megacities" (IMECOGIP) project within the research initiative "Sustainable Development of Urban Regions" (SURE) funded by the German Federal Ministry of Education and Research (Ruhr-Universität Bochum, Geogr. Institut 2021). It is an open GIS-based toolbox for ES assessment and trade-off evaluation for different urban environments, consisting of six different modules for regulating ES (Alpha Release, IMECOGIP 2021).

The IMECOGIP WRRM is designed to model individual heavy rain events (Ruhr-Universität Bochum, Geogr. Institut 2021) to calculate the runoff avoidance capacity of a case study area. The model is used as a toolbox in the QGIS software and consists of a platform for input datasets. The processing time of the results is similar to that of InVEST UFRM, with slightly more time when modelling with UA classes.

The model was executed under three rainfall scenarios (Table 2). In contrast to UFRM, the WRRM input data format is a vector and additional conversion of the working files is not required. This contributed to preserving the vector outputs and their scales (Table 3). The model provides an opportunity to save the output as different options, including as a shapefile, as well as other QGIS-available files (QGIS 2023), geopackage and database tables. The WRRM can be performed as a batch of processes, enabling easier usage of the model.

Apart from CW values, the other output parameters from the model (Table 3) are the results of simulating different scenarios for the case study area, including the current condition (present retention case), the entire area covered in forest (best retention case) and the entire area being sealed (worst retention case). Another primary output is the ES diagram (output within the model result, which cannot be exported as a separate file), which presents the absolute amount of ES, the ideal state for the area and the least favourable scenario for it.

Comparison of model results

The evaluation of model performance using different input data was the primary objective of this study. To perform the comparison, we have selected a few approaches to identify how the two models react to different input datasets and rainfall scenarios. We conducted a comparative analysis of FRS results in terms of two primary aspects. The assessment of FRS necessitates the identification of appropriate indicators to quantify the ecosystem regulation function. First, FRS maps derived from different models were compared and the results were analysed. The FRS values were normalised so they could be compared in each simulation. The final score for the FRS varied between zero (no retention capacity) and one (full retention capacity). Second, the FRS maps derived using different input data were compared and analysed. The analyses were based on the type and area coverage, as well as distribution of each LULC class and FRS capacity. FRS for each ecosystem type (subtype) and Urban Atlas class; the initial and updated datasets, were derived, as well as overall statistics (average, median, quartiles) for these datasets.

Another crucial aspect of input data analysis is modelling using different rainfall scenarios. We applied three rainfall scenarios (Table 2) using different combinations of LULC classes. After thoroughly reviewing and comparing the results of each simulation, we determined the performance of the models under different rainfall scenarios calculating the FRS for each LULC scenario and overall statistics (average, median, quartiles). For the analysis of different scenarios, a set of data was selected to better represent the differences in the model outputs. For this analysis, the simulation with initial and updated ecosystem subtypes, Urban Atlas datasets (for the LULC) and the 69 mm scenario were selected.

The geospatial data visualisation was performed using ArcMap, where each indicator provided on the assessment value had a different colour scheme. Data processing was performed using statistical operations within ArcMap and analysed using data processing software (MS Excel).

Results

FRS: InVEST vs. IMECOGIP

The InVEST UFRM outputs include three runoff retention parameters representing the amount of precipitation captured per grid cell (Table 4). Runoff values varied between 0 (no retention) and 69 mm (the entire amount of precipitation captured by the ecosystem). For the runoff retention parameters, these values are transformed into quantities (m3) and relative retention capacities ranging from zero to one. The latter was defined as the most appropriate to depict the spatial distribution of runoff retention (Fig. 3) and was used for further analyses of the FRS. These outputs provide the option for estimating the FRS of each ecosystem subtype and Urban Atlas classes.

Table 4.

Results of InVEST UFRM and IMECOGIP WRRM simulations with the 69 mm precipitation and initial dataset UA.

InVEST UFRM

IMECOGIP WRRM

Outputs

Result

Average

Outputs

Result

Average

Runoff retention

0–1

0.55

Runoff retention

0–1

0.35

Runoff retention (m3)

0–6.9

2.73

-

Runoff values (mm)

0–69

40.85

-

Figure 3.

InVEST UFRM (A) and IMECOGIP WRRM (B) outputs that can be used for flood regulation supply mapping (based on the initial Urban Atlas dataset).

The only IMECOGIP WRRM output that has similarity to the UFRM output is the CW value, which serves as an avoided discharge (or avoided runoff) captured per unit (in our case, Urban Atlas classes). The CW varied from zero (fully avoided runoff) to one (no retention and full runoff) (Table 4). CW is the most appropriate for analysing the spatial distribution of runoff retention and it is a non-parametric coefficient which can be used for comparison with the RRI from InVEST UFRM.

The analyses showed that the RRI (InVEST UFRM) and CW values (IMECOGIP WRRM) were the most appropriate parameters to be used as indicators for quantifying FRS. Both parameters were measured in the same dimensions (zero–one) but in reverse, which resulted in inverse representations in the maps (Fig. 3). Therefore, the CW values were normalised for the FRS, whereas the RRI derived from the InVEST UFRM was used without changes. The runoff retention values vary from zero to one, where zero is full retention (full runoff, for example, roads) and one is avoided runoff (full infiltration, for example, forests).

The differences between the two models are due to the different approaches used to calculate the water flow components in them. The FRS from the UFRM model varies between zero and one based on the type of LULC datasets. Therefore, the results (FRS) for UFRM showed extensive variability within the LULC classes (Fig. 3 and see FRS using different data sources). In contrast, the FRS from the WRRM consisted of fixed values for each LULC class and changes in the class area did not reflect the final FRS values.

The outputs of the two models showed significant differences in the FRS values for each LULC class. The UFRM assesses different impervious urban classes with different values, as for the case study they vary between 0.1 to 0.5, whereas the WRRM assesses the FRS in these densely sealed areas with one fixed value (0.2) (where nearly all rainfall is transformed into runoff). The most significant difference was the lower FRS in the WRRM modelling results. The lower results (0.2–0.3) were derived from the lower assessment values assigned for impervious urban classes (primarily residential units).

Urban green areas received relatively similar values, varying between 0.8 and 1.0, where the UFRM values exceeded those from the WRRM. Neither model assessed green areas, urban green areas or urban forests, with large differences in flood regulation values. UFRM assessed green areas with similar values, varying between 0.9 and 1.0, whereas the WRRM provided slightly lower values (0.8–0.9) for the same areas.

In contrast, sub-urban areas within the case study received fluctuating values. The UFRM indicated relatively medium values (0.3–0.5), while the WRRM assessed the FRS as lower, between 0.3 and 0.4. Moreover, the variability of the flood regulation values was a notable difference between the two models. The WRRM provided lower assessment values to sub-urban areas, substantially lower than those in the same classes in the UFRM.

Artificial (impervious) areas, including industrial sites and transportation networks, also received varying FRS values in both models. UFRM assessed certain industrial sites with values varying between 0 and 0.3, whereas the WRRM values were closely clustered between 0.3 and 0.4, indicating a propensity for less rainfall to be transformed into direct runoff.

FRS using different data sources

Initial vs. Updated datasets

A comparison of the results using different LULC sources revealed particular differences in the assessment of the FRS. The primary differences between the two models arose from the use of different data sources. Although the UА classes had a larger minimum mapping unit (0.25–1 ha) than that of the ecosystem types (0.1–0.25 ha), the Urban Atlas classes more precisely covered each of the different LULC classes (Figs 4, 6). The data coverage for each ecosystem subtype was aggregated into larger spatial units, whereas the UA classes were substantially more diverse and reflected real-world situations more precisely.

Figure 4.

FRS based on modelling results using ecosystem subtypes. Flood regulation supply for: A) Initial ecosystem subtypes (without improvement)—UFRM; B) Initial ecosystem subtypes (without improvement)—WRRM; C) Updated ecosystem subtypes—UFRM; and D) Updated ecosystem subtypes—WRRM.

Figure 5.

Area distribution (km2) of FRS based on modelling results using ecosystem subtypes. Flood regulation supply for: A) Initial ecosystem subtypes (without improvement)—UFRM; B) Initial ecosystem subtypes (without improvement)—WRRM; C) Updated ecosystem subtypes—UFRM; and D) Updated ecosystem subtypes —WRRM.

Figure 6.

FRS based on modelling results using Urban Atlas classes. E) Initial Urban Atlas classes (without improvement)—UFRM; F) Initial Urban Atlas classes (without improvement)—WRRM; G) Urban Atlas classes—UFRM; and H) Updated Urban Atlas classes—WRRM.

Updating the initial datasets improved the details of each LULC class and increased the precision of the FRS values. The results of the comparative analysis of ecosystem subtypes and Urban Atlas classes showed significant differences in the final results for FRS, mainly for the FRS modelled with the WRRM. The usage of ecosystem subtypes resulted in lower values for FRS, as the average distribution was between 0.2 and 0.3, whereas the usage of the Urban Atlas classes led to higher FRS values, based on the difference in the size of each assessed feature (average smaller spatial units) and a more precise classification system.

The differences between the initial and updated datasets were more prominent for the FRS modelled with the UFRM, whereas the results from the WRRM did not show significant changes. The latter was due to the use of only the LULC dataset and its assigned CW for the final results. Therefore, the changes in the FRS for the results of this model were derived solely from the updated spatial units (both ecosystem subtypes and UA).

The update in the initial datasets was implied by the increasing diversity of the FRS values for both ecosystem subtypes and UA spatial units. Improvements in the modelling input data enhanced the assessment results. This is because of the integration of different types of data: LULC and soil hydrological groups (as part of the biophysical table in the UFRM). The significant changes caused by the update of the soil hydrological groups and, to a lesser extent, by the changes in ecosystem subtypes and UA classes were those in the eastern and north-western parts of the case study. They were derived from the changes in the soils from group D (highest runoff potential when saturated) to group B (moderately low runoff potential when saturated).

The FRS values using the initial and updated ecosystem subtypes with UFRM (Fig. 4A and Fig. 6E) showed significant differences in both the spatial distribution assessment and FRS values for each ecosystem subtype. The updated FRS modelled with UFRM (Fig. 4C and Fig. 6G) exhibited a similar pattern to the overall initial datasets. The updated ecosystem subtype-based dataset showed an increase in the overall FRS. This included an increase of the FRS of primary urban ecosystem subtypes (‘Residential and public areas of cities and towns’ and ‘Sub-urban areas’) as a result of updating both the geospatial and soil-related datasets. Overall, the FRS results were lower than those of the UA-based LULC data.

Overall, the FRS modelled with the WRRM had lower values for each ecosystem subtype and UA class than modelled values with UFRM, especially for the urban core area. The initial ecosystem subtypes dataset (Fig. 4B) provided lower values (0.2–0.4) to most of the ‘Residential and public areas of cities and towns’ and ‘Sub-urban areas’, whereas the UA-based assessment (Fig. 6F) had higher FRS values (primarily 0.3–0.4) for the same urban areas. Another difference between the two datasets was the green urban area. The iteration of the ecosystem subtype-based model had higher FRS values (0.8–0.9) than the UA-based model did (0.7–0.8) (Fig. 4B and Fig. 6F).

The FRS for Sofia Airport also showed another bias due to differences in classifications. The airport areas were part of the ‘Transport networks and other constructed hard surfaced sites’ urban ecosystem subtype, whereas the UA classified them as being in a separate class with a 1.28-fold lower CW value. The updated ecosystem subtypes and UA datasets (Fig. 4D, Fig. 6, Figs 4, 5H) had similar FRS distributions because the only difference between these two sets was based on the changes in the UA classes themselves.

Ecosystem subtypes vs. Urban Atlas classes

The changes in area distribution using the ecosystem subtypes (Figs 4, 5) showed slightly different results compared to those of the UA. The results within each model showed a relatively similar pattern, based on changes within the dataset. The distribution of each FRS class exhibited particular differences. The low to medium FRS values (0.2–0.5) prevailed, whereas the areas with high FRS values had a limited extent (0.6–0.8) (Fig. 5).

The changes in the area distribution of FRS between each UA class were a result of updating the LULC, soil datasets and curve number-related data (Figs 6, 7). The area distributions within the UA classes of the two models exhibited different internal dimensions. The UFRM results showed higher FRS values (0.4–0.6) (around 60%), whereas the WRRM results were inclined to very low to low values (0.0–0.4) between 66 and 68% (Fig. 7).

Figure 7.

Area distribution (km2) of FRS, based on modelling results using Urban Atlas classes. Flood regulation supply for: E) Initial Urban Atlas classes (without improvement)—UFRM; F) Initial Urban Atlas classes (without improvement)—WRRM; G) Urban Atlas classes— UFRM; and H) Updated Urban Atlas classes—WRRM.

The area distribution for the WRRM results showed nearly no change in the FRS values because of the fixed predefined CW values for each ecosystem subtype. Overall, the UFRM was more likely to assess LULC classes with lower, but diverse FRS values, whereas the WRRM usually provided lower assessment values to the same LULC classes.

The box-plot diagram (Fig. 8) provides an overview of the distribution of the FRS values in the UFRM with the initial and updated datasets. The results from the WRRM did not show any variance within each LULC class, which is why the analysis of these results is not included in this section. The results showed significant variance in most of the studied LULC classes in both the initial and updated datasets. The initial datasets were characterised by fluctuations within each ecosystem subtype. A decrease in variances and an increase in FRS values were observed in residential areas, whereas an increase in FRS values was observed for both urban green areas and industrial sites. A similar pattern was found in the FRS values for the Urban Atlas classes. The highest reduction was found in the low- (11230) and very-low-density urban fabric (11240) areas, similar to the results for the ecosystem subtypes.

Figure 8.

FRS variances within: 1. initial EC subtypes (without improvement); 2. updated EC subtypes; 3. initial Urban Atlas classes (without improvement); and 4. updated Urban Atlas classes.

For the codes of ecosystem subtypes and Urban Atlas land-use classes, please refer to Suppl. material 1.

Several changes in FRS results were led by the updating of the soil datasets (Fig. 6E and G). The predominantly industrial area in the north-eastern part of the case study, south of Sofia Airport, had a perceptible increase in FRS values (twice as high). These changes were due to the improvement of the datasets from hydrological soil groups B to D, as well as certain changes in the LULC (in this case, the Urban Atlas classes). In contrast, the changes in WRRM were based only on the correction of the LULC datasets.

FRS: InVEST vs. IMEGOGIP for different precipitation scenarios

The two models have different results when used for a series of rainfall events for the same case study area. The UFRM shows changes in its output, especially the FRS values for different classes, whereas the WRRM did not adapt its FRS values for each rainfall scenario, which results in preserving the same values for all modelled events, regarding different precipitation depths. The FRS values of the two models were highly diverse (Fig. 9). First, their comparison showed a gradual increase in the FRS values for the UFRM results, whereas the WRRM results did not change in all modelled rainfall scenarios. This is a result of the input data, which included only LULC and CW values, whereas the use of soil datasets in UFRM can be interpreted as a second adjustment of the results.

Figure 9.

FRS based on different precipitation depth scenarios.

The FRS values derived from the UFRM had distinctive differences based on rainfall scenarios. Green urban areas (14100; Fig. 2) preserved or lowered (decreased by 7.5%) their FRS in all scenarios. The pattern of preserving the same FRS values was also observed in open waterbodies.

The changes in FRS are primarily within several urban classes, which correspond to a decrease in the efficiency of rainfall-runoff reduction. The most remarkable changes in FRS values can be found in the urban fabric-related classes (primarily 11100 and 11210) and industrial, commercial, public, military and private units (12100) which had decreased FRS by 18% (for the 20-years probability scenario) and by 40% (for the 1000-years probability scenario; from 0.44 to 0.26), based on the percentage changes.

Discussion

Performance of the models for FRS assessment

The current study compared the performance of two ecosystem service assessment models, InVEST and IMECOGIP, while using two LULC classifications: ecosystem subtypes and Urban Atlas classes. The study added more insights to the current research as a comparative model assessment study was performed. The proposed soil data updating approach could be applied to different case studies so the used data matches the needs in the local context.

UFRM and WRRM are applicable to stormwater floods because they estimate the retention capacity of urban ecosystems to store part or the entire amount of incoming water. Both models have various outputs that can be used as indicators to quantify the FRS. In the UFRM, these are the runoff retention parameters that estimate the amount of captured water quantitatively (mm, m3 and RRI). The RRI is most appropriate for complex ES assessments when flood regulation is assessed together with other services (e.g. Cortinovis et al. (2022), Marino et al. (2023)). These quantitative parameters can be useful in flood risk assessment studies that need to calculate the amount of water captured to propose specific management measures. The flood risk service parameters of the UFRM (Table 3), in actuality, normalise the runoff retention values per spatial unit. This is useful for further analyses of the FRS per ecosystem subtype or LULC class. In the WRRM, the most appropriate output for the assessment of FRS is the CW, which is relatively similar to the RRI (UFRM).

The soil characteristics (including their initial water conditions) could affect the flood regulation potential of the ecosystems (Zepp et al. 2023). The UFRM requirement of soil data and their incorporation in the modelling process ensure more precise results of flood regulation assessment, in comparison to the WRRM.

After comparing results from the two models and different modelling scenarios, the InVEST UFRM using the Urban Atlas dataset (but connected with ecosystem subtypes) could be recommended for usage in FRS assessment in urban environments, as it is more comprehensive and precise, based on the input data and internal modelling processes. The model has been used in different environments and the results have been found to have different applicability (Cortinovis et al. 2022, Marino et al. 2023). However, the IMECOGIP WRRM could also be applied, at the initial stages of flood regulation ES assessment.

In this study, the models were tested to assess FRS, based on the available data. However, the analyses of the model outputs showed that the values built by the UFRM service can potentially be used to assess also flood regulation demand, but after including other sets of indicators representing the human activity concentration area (Dai et al. 2021) and socio-demographic indicators (Fang et al. 2023). However, they cannot fully cover the entire range of flood exposure factors. Our experience with the WRRM does not provide evidence for possible applications for the assessment of flood regulation demands. The least favourable scenario for runoff retention can be interpreted as a step towards generating a quantitative measure for assessing demand (as part of the supply scenarios).

Sensitivity of the models to different data sources

The InVEST UFRM is sensitive to the presence of specific data types. For example, a biophysical table (.csv) must contain all LULC classes in the raster geospatial file; otherwise, the model would not complete the iteration. The same applies to the presence/absence of the soil type in the raster file. The modelling process has certain uncertainties related to the different stages of data processing. The spatial units, as well as the minimum mapping unit (for vector data) and resolution (for raster data), can be sources of uncertainty, as they both can cause uncertainty in the final results, especially when converting from one format to another. The other source of uncertainty for this input dataset is the LULC classification used. This can be an uncertainty in the interpretation of the final results, as different researchers use different classifications and cross-walking might be challenging. For the IMECOGIP WRRM, a predefined LULC was prepared and used, but its linkage (within the model literature) with other commonly used LULC classes was not fully described. This can lead to possible mismatches in the results. Another possible source of uncertainty is the lack of soil data in the WRRM input data. The soil data (infiltration) could be used as an adjustment point (one of the major influencing factors for flood regulation) of the other input datasets which are similar between both models.

Limitations and possibilities for further improving the models

With the growing number of models and tools for flood regulation assessment, the number of studies has increased; however, there is a lack of comparative work on model results with other similar models that can reveal their actual advantages and disadvantages (Veerkamp et al. 2023). Both models used in this study have strengths and limitations. The primary advantage of using the InVEST UFRM is that it includes the required data for the curve number (incorporated into the biophysical table) and soil data. This enables users to obtain comparatively more precise outputs. The soil hydrological groups are essential for the second verification (along with the curve number) and for improving the final results by connecting them to the local characteristics of the case study area.

A common limitation of both models is the lack of inclusion of the specific topography of the area (e.g. elevation, slopes), referring to the lack of relief input data in both models. The inclusion of such elements in the model can serve as a second round of verification of the model outputs (if we interpret the soils as the first). The approach suggested by Vallecillo et al. (2020) to adjust the curve number by slope is a good example of integrating topography into the modelling process and improving the results. We have tested applying the adjustment; therefore, the results were not prominent, as the urban territory of the case study does not exceed the 5° average slope. Regarding this, this approach could apply to areas with more than a 5° slope. The other suggested by Vallecillo et al. (2020) improvement that we applied includes the adjustment of the CN values by the percentage of imperviousness for each LULC class. This adjustment enables the differentiation of the artificial classes (sealed) and refining the CN number.

Another possible method of improving the input data and overall model performance could be to use the vector file format of the input and output data, which would contribute to preserving the original scale and accuracy of the input datasets rather than converting them to raster format (especially applicable for the UFRM model). A possible method to increase the performance of the UFRM model, including its usage with other models, is by preserving the attribute data from the initial AOI such that the output data can be used for comparison between different spatial scales.

The possibilities for improving the IMECOGIP WRRM can be categorised into two groups. The first includes all improvements in modelling data preparation. There are several mismatches between the LULC classes in the English and German versions of the sample data table, including the names and CW data. This could be solved by the possible inclusion of a clearer explanation of the urban classes, as there are two versions of them in the predefined table and sample data. The second group is related to possible output enhancements, which include suggestions for calculating the output layer of the ES balance output (diagram and vector file) within the spatial dimensions of the AOI (spatial units). Thus, the outputs (spatial units) should remain stable for comparison across scenarios following the changes in their time and spatial evolution. This can help in the following analysis, including the possibility of further applying box-plot analysis for the spatial units, in which the performance in the current version of the WRRM does not present differences in the LULC units.

The WRRM has the same FRS results when used for a series of rainfall events for the same case study area. The model did not adapt its FRS values for each rainfall scenario, which results in preserving the same values for all modelled events, regarding different precipitation depths. The reason behind this outcome is the lack of usage of soil dataset, resulting in fixed FRS values, without further adjustment of the LULC and CW values, which in the UFRM are included in the CN values (soil hydrological groups).

As both models assess the capacity of the ecosystem to provide specific flood regulation services due to extreme rainfall events and not river flood events, uncertainties regarding validation could be identified. The validation is done when there are enough field measurements, which in our study is not applicable, as the models use modelling variables which are not studied in detail and further research is needed for validation.

Conclusions

This study presents an approach for the comparison of two urban flood regulation models using different input data and adapting them to the local characteristics of the case study area of Sofia for the assessment of urban FRS. To achieve this aim, two models were used with three different rainfall events with return periods of 20, 100 and 1000 y, as well as two different sets of available spatial data—ecosystem subtypes and Urban Atlas classes— which were updated to improve their accuracy and real-world reflection as of 2023.

The results showed slightly different values for the FRS from the two models and their executions with different inputs (both initial and updated). The InVEST UFRM assessed urban flood regulation for the case study with relatively medium to low values, both with initial and updated inputs. Its assessment values were comparably lower for the ecosystem subtypes (very low and low to medium-level FRS), whereas the urban FRS in the Urban Atlas classes was higher. In contrast, the IMECOGIP WRRM FRS values were relatively low compared to the InVEST UFRM values, especially when assessing different urban ecosystem subtypes.

The two models use similar input datasets, and their outputs are relatively similar, which enables comparison in this study. However, several differences (including the available datasets and their precision, the area of the case study and the application of the expected results) need to be considered when selecting the correct model for modelling urban FRS.

In this study, we demonstrate how the improvement of input data can increase the precision of modelling results and, thus, ensure more precise and detailed data for designing measures to reduce urban flood risk. This approach is an objective trade-off between data accuracy, accessibility of datasets and relevant adaptation to the local characteristics of the case study area. The developed approach for urban FRS assessment, by updating different input modelling data, is feasible for applications in different case study areas. The proposed method for improving the initial datasets for executing the models can be applied to any case study or spatial unit (any type of LULC), as well as different scales. Further development of the approach will be based on flood regulation demand assessment, as well as on the balance between the supply and demand of flood regulation ES.

Acknowledgements

The study was carried out within the INES project (INtegrated assessment and mapping of water-related Ecosystem Services for nature-based solutions in river basin management), funded by the National Science Fund of the Bulgarian Ministry of Education and Science, under Grant No KP-06-N-54/4.

The authors would like to thank the EU-funded project SELINA (Science for Evidence-based and sustainabLe decIsions about NAtural capital, No 101060415) and Miglena Zhiyanski for her valuable support for processing the soils data.

Conflicts of interest

The authors have declared that no competing interests exist.

References

Supplementary materials

Suppl. material 1: Additional information on the update of the input datasets 
Authors:  Vanya Stoycheva, Stoyan Nedkov
Data type:  text and tables
Brief description: 

Case study details, ecosystem subtypes, Urban Atlas classes, soil types and biophysical tables (part of UFRM input data).

Suppl. material 2: Grid point validation 
Authors:  Vanya Stoycheva
Data type:  table
Brief description: 

Validation of performed data improvement throughout random grid points in the GIS environment.

Suppl. material 3: Average area (m2) of ecosystem subtype and Urban Atlas class 
Authors:  Vanya Stoycheva
Data type:  table
Brief description: 

Average area (m2) of ecosystem subtype and Urban Atlas class (polygon features) before and after improvement.

login to comment