One Ecosystem :
Research Article
|
Corresponding author: Ina M. Sieber (sieber@phygeo.uni-hannover.de)
Academic editor: Joachim Maes
Received: 25 May 2021 | Accepted: 15 Oct 2021 | Published: 02 Nov 2021
© 2021 Ina M. Sieber, Malte Hinsch, Marta Vergílio, Artur Gil, Benjamin Burkhard
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Sieber IM, Hinsch M, Vergílio M, Gil A, Burkhard B (2021) Assessing the effects of different land-use/land-cover input datasets on modelling and mapping terrestrial ecosystem services - Case study Terceira Island (Azores, Portugal). One Ecosystem 6: e69119. https://doi.org/10.3897/oneeco.6.e69119
|
|
Modelling ecosystem services (ES) has become a new standard for the quantification and assessment of various ES. Multiple ES model applications are available that spatially estimate ES supply on the basis of land-use/land-cover (LULC) input data. This paper assesses how different input LULC datasets affect the modelling and mapping of ES supply for a case study on Terceira Island, the Azores (Portugal), namely: (1) the EU-wide CORINE LULC, (2) the Azores Region official LULC map (COS.A 2018) and (3) a remote sensing-based LULC and vegetation map of Terceira Island using Sentinel-2 satellite imagery. The InVEST model suite was applied, modelling altogether six ES (Recreation/Visitation, Pollination, Carbon Storage, Nutrient Delivery Ratio, Sediment Delivery Ratio and Seasonal Water Yield). Model outcomes of the three LULC datasets were compared in terms of similarity, performance and applicability for the user. For some InVEST modules, such as Pollination and Recreation, the differences in the LULC datasets had limited influence on the model results. For InVEST modules, based on more complex calculations and processes, such as Nutrient Delivery Ratio, the output ES maps showed a skewed distribution of ES supply. Yet, model results showed significant differences for differences in all modules and all LULCs. Understanding how differences arise between the LULC input datasets and the respective effect on model results is imperative when computing model-based ES maps. The choice for selecting appropriate LULC data should depend on: 1) the research or policy/decision-making question guiding the modelling study, 2) the ecosystems to be mapped, but also on 3) the spatial resolution of the mapping and 4) data availability at the local level. Communication and transparency on model input data are needed, especially if ES maps are used for supporting land use planning and decision-making.
InVEST, ecosystem services modelling, Islands, geospatial data
Modelling ecosystem services (ES) allows us to predict the spatial distribution of different ES that sustain and support human life. Moreover, modelling ES enables us to assess changes in spatio-temporal distribution and the state of ecosystems and how they affect the flows of ES to people. Therefore, modelling ES has become an essential tool for mapping and assessing ES, which is heavily used, for instance, in the context of the European Union's (EU) Initiative Mapping and Assessing Ecosystems and their Services (MAES*
The number of readily available model suites has grown in the last decades, with different options to map the actual or potential supply, demand or use of ES. In 2013, more than 17 decision-support tools for ES quantification and valuation models were identified for the assessment of ES (
However, issues related to ES maps and models remain unaddressed, especially those related to uncertainty assessments (
The InVEST model has been applied in numerous ES mapping applications and first calibration studies highlight the model sensitivities to specific input parameters (
Therefore, the rationale for this ES assessment is two-fold:
In this chapter, the case study area, the selected models and selected input datasets, as well as the model parameterisation, are described.
Terceira Island is part of the Azores Archipelago, a European Outermost Region (OR) and an Autonomous Region of Portugal with political and administrative autonomy. The Region is an isolated oceanic archipelago located in the Northern Atlantic, approximately between 37° and 40ºN and 24° and 31ºW. Terceira Island is the third largest island of the archipelago and the third oldest (ca. 3.5 million years) with an area of approximately 400 km2 (maximum length and width of 21 km and 14 km, respectively). It is part of the Central Group, together with the Islands of Faial, Pico, São Jorge and Graciosa (Fig.
Terceira Island preserved some pristine areas at high elevation (
Additionally to the balanced representation of Azores-specific ecosystems, Terceria Island provides well-documented land use information (see Chapter 2.1). With three official LULC datasets, Terceira surpasses other islands of the Archipelago and, therefore, presents an ideal case study area for conducting an ES modelling study.
The MAES process is comparably advanced in the Azores Archipelago. Many scientific ES assessments of the archipelago have been published (
For this study, a modelling approach, based on biophysical models, has been selected (Tier 3; see
InVEST is organised in 21 final ES modules and four supporting tools. For this case study, six ES were selected considering the data availability for Terceira Island: recreation, pollination, carbon storage, erosion control, water quality and flow retention (Table
Selected ecosystem services for the modelling on Terceira Island, Azores
Ecosystem service |
Ecosystem service type |
InVEST module |
Recreation |
Cultural |
Visitation: Recreation and Tourism |
Pollination |
Regulating and maintenance |
Pollinator Abundance: Crop pollination |
Carbon Storage | Regulating and maintenance |
Carbon Storage and Sequestration |
Erosion control |
Regulating and maintenance |
Sediment Delivery Ratio (SDR) |
Water quality | Regulating and maintenance | Nutrient Delivery Ratio (NDR) |
Flow retention |
Regulating and maintenance |
Seasonal Water Yield (SWY) |
Even though InVEST is stand-alone software, additional GIS software is required to pre-process data, visualise results and perform any further analysis (e.g. data overlays). In this study, data processing was based on ArcGIS ArcMap 10.5. Model post-processing is described in Section 2.5.
Modelling ES is often implemented on the basis of geospatial data, such as LULC data. The identification of the ecosystems assessed in this study was dependent on the categories of LULC included in each geodataset used (CLC, COS.A 2018 and Sentinel 2-based LULC map, all available as shapefiles).
CLC presents a standardised classification of land use for the entire European Union and some associated countries. The data are publicly available from Copernicus Land Monitoring Services*
Overview of the different LULC classes from the three LULC geodatasets, land cover area (ha) and percentage of land cover for Terceira Island.
CORINE Code |
Land cover (ha) |
% |
COS.A Code |
Land cover (ha) |
% |
Sent2 Code |
Land cover (ha) |
% |
|||
111 |
Continuous urban fabric |
101.39 |
0.25 |
111 |
Continuous urban fabric |
408.07 |
1.02 |
1 |
Urban |
3799.99 |
9.49 |
112 |
Discontinuous urban fabric |
2730.38 |
6.79 |
112 |
Discontinuous urban fabric |
1318.95 |
3.29 |
||||
121 |
Industrial or commercial units |
314.36 |
0.78 |
121 |
Industrial or commercial units |
215.36 |
0.54 |
||||
122 |
Road and rail networks and associated land |
68.95 |
0.17 |
||||||||
123 |
Port areas |
289.09 |
0.72 |
123 |
Port areas |
60.10 |
0.15 |
||||
124 |
Airports |
319.23 |
0.79 |
124 |
Airports |
309.39 |
0.77 |
||||
131 |
Mineral extraction sites |
140.15 |
0.35 |
131 |
Mineral extraction sites |
166.76 |
0.42 |
||||
132 |
Dump sites |
40.60 |
0.10 |
132 |
Dump sites |
22.80 |
0.06 |
||||
133 |
Construction sites |
1.02 |
0.00 |
2 |
Bare Soil |
493.69 |
1.23 |
||||
141 |
Green urban areas |
120.24 |
0.30 |
141 |
Green urban areas |
26.53 |
0.07 |
1 |
Urban |
||
142 |
Sport and leisure facilities |
86.81 |
0.22 |
142 |
Sport and leisure facilities |
108.74 |
0.27 |
||||
211 |
Non-irrigated arable land |
3007.67 |
7.47 |
211 |
Non-irrigated arable land |
1118.25 |
2.79 |
5 |
Arable crops |
13124.52 |
32.78 |
212 |
Permanently irrigated land |
359.05 |
0.90 |
5 |
Arable crops |
5547.42 |
13.86 |
||||
221 |
Vineyards |
107.26 |
0.27 |
3 |
Other crops |
3558.65 |
8.89 |
||||
231 |
Pastures |
14862.60 |
36.93 |
231 |
Pastures |
23034.29 |
57.53 |
3 |
Other crops |
131.49 |
0.33 |
242 |
Complex cultivation patterns |
1629.68 |
4.05 |
||||||||
243 |
Land principally occupied by agriculture |
6516.78 |
16.19 |
243 |
Land principally occupied by agriculture |
124.40 |
0.31 |
||||
311 |
Broad-leaved forest |
2348.66 |
5.84 |
311 |
Broad-leaved forest |
4123.12 |
10.30 |
6 |
Pittosporum |
2607.05 |
6.51 |
7 |
Eucalyptus |
1534.72 |
3.83 |
||||||||
8 |
Acacia |
337.66 |
0.84 |
||||||||
312 |
Coniferous forest |
1355.01 |
3.37 |
312 |
Coniferous forest |
2836.15 |
7.08 |
10 |
Pinus |
8.69 |
0.02 |
11 |
Cryptomeria |
2576.86 |
6.44 |
||||||||
313 |
Mixed forest |
115.22 |
0.29 |
313 |
Mixed forest |
918.85 |
2.29 |
13 |
Calluna-Juniperus |
420.92 |
1.05 |
14 |
Juniperus-Ilex |
1562.04 |
3.90 |
||||||||
321 |
Natural grasslands |
954.81 |
2.37 |
321 |
Natural grasslands |
45.39 |
0.11 |
||||
322 |
Moors and heathland |
3387.98 |
8.42 |
322 |
Moors and heathland |
11.36 |
0.03 |
4 |
Erica |
955.43 |
2.39 |
324 |
Transitional woodland-shrub |
1223.68 |
3.04 |
324 |
Natural herbaceous vegetation |
1117.03 |
2.79 |
||||
325 |
Shrubland, bushland, heathlands |
1476.33 |
3.69 |
12 |
Shrub peatland |
2391.50 |
5.97 |
||||
411 |
Bare rock |
321.15 |
0.80 |
||||||||
412 |
Peat bogs |
589.42 |
1.46 |
421 |
Flooded zones |
1407.63 |
3.52 |
9 |
Peatlands |
797.64 |
1.99 |
511 |
Water courses with vegetation |
429.67 |
1.07 |
||||||||
512 |
Water bodies |
7.84 |
0.02 |
||||||||
Total land-cover |
40241,1 ha |
Total land-cover |
40037.2 ha |
Total landcover |
40046 ha |
The official Azores Region LULC Map for 2018 (COS.A 2018) was developed by the Azorean Regional Government (
The Sentinel 2-based LULC map of Terceira Island was developed by the Azorean Biodiversity Group*
All three LULC geodatasets vary in purpose LULC classes and MMU. These differences make them suitable for a comparison of the effects that input datasets can have on ES model outcomes. A visualisation of the three LULC with their individual land use classes is shown in Fig.
In addition to the LULC data, the InVEST modules require additional input data, for example, geospatial or statistical information. The amount of needed input data strongly depends on the complexity of the modules, more specifically, the model processes. These range from very simplistic modules, such as Visitation, to highly complex, multi-parametrical modules, such as Nutrient Delivery or Seasonal Water Yield.
The majority of input data was obtained from a thorough literature review. As locally-specific information on Terceira Island was often scarce, these data gaps were filled with the best available comparable data. For example, most of the data, such as information on Azores-specific soils, were not available online as geodata. In many available datasets, the Azores were not included or were represented as hardly visible, undistinguishable pixels (e.g. FAO World Soil Database (
Input data per module |
Data type |
Data sources |
Visitation |
||
Photo User Days |
Image hosting service |
Flickr (2017)* |
Pollinator abundance: Crop Pollination |
||
Floral availability and nesting suitability |
Literature |
|
Carbon |
||
Carbon storage values |
Literature |
|
Nutrient delivery ratio (NDR) |
||
Digital Elevation Model (DEM) |
Grid (cell size 25 m x 25 m) |
Copernicus (2018) |
Watershed boundaries |
Web Map Service |
|
Nutrient loads |
Literature |
|
Sediment delivery ratio (SDR) |
||
Digital Elevation Model (DEM) |
Grid (cell size 25 m x 25 m) |
|
Soil Data |
30 arc-second raster database |
JRC (2009)
|
Rainfall erosivity |
Literature |
|
P Factor |
Literature |
|
C Factor |
Literature |
|
Seasonal water yield (SWY) |
||
Soil Data |
30 arc-second raster database |
JRC (2009)
|
Annual precipitation |
Literature |
World Weather Online (2018) |
Soil hydraulic parameters |
Literature |
|
Runoff curve numbers |
Grey literature |
|
Crop evapotranspiration |
CROPWAT Model, Literature |
|
Soil types |
Geotiff |
|
Watershed boundaries |
Web Map Service |
|
In order to assess the relative importance and effects of each input LULC dataset for the outcomes of the model, it was necessary to statistically compare the individual model results. Such an analysis exceeds the scope of existing pairwise map comparison (
Statistical analyses were run for the new data frames, looking at normal distribution, variances and standard deviations for the six InVEST modules with their three LULC datasets. As a normal distribution was not given for any model outputs, a Kruskal-Wallis H test was performed (
In a last step, the one-dimensional arrays with the results of the statistical evaluation were reshaped and re-assembled to its original format, so that each value was assigned to the former cell in the raster, visualising the difference between the arrays in maps for variance and standard deviation per InVEST module per cell in ArcMap.
The six InVEST model modules were applied for Terceira Island. Each module was run with the three input datasets. This process resulted in 18 different ES maps for CLC, COS.A and Sentinel-2-based LULC maps (Figure 3 and Figure 4). Despite the differences amongst the input LULC maps, some models show similar output maps, for example, recreation and pollination. Other model results, for example, carbon storage and nutrient delivery ratio, show differences in the model outputs, indicating differences in the spatial modelled ES supply derived from the differences amongst the input LULC datasets.
Recreation
The InVEST Recreation module quantifies recreation, based on Photo User Days (PUD) uploaded on the online photo-community Flickr*
Pollination
Pollinator abundance was modelled, based on average pollinator abundance, a dimensionless index considering 0 as no to very low abundance to 1 as maximum abundance. Results showed highest potential for pollinator abundance inland of the Island, with the strongest supply in moors and heathlands and coniferous forests edges (CLC) as seen in Figure 3. Both COS.A and Sentinel2-based LULC maps had a lower average pollinator abundance modelled for the same location around the natural reserves (e.g. "Reserva Florestal Natural Parcial do Biscoito da Ferraria" and "Reserva Florestal Parcial da Serra de S. Barbara e dos Misterios Negros") in the Island's centre, with higher potential. The modelled species abundances in the three model results ranged from an average of 0 in port areas and dump sites to average maxima in grasslands (0.316), moors and heathlands (0.329) and territories mainly occupied by agriculture, with significant areas of natural vegetation (0.21). Forested areas, heath and moorlands scored highest in the CLC dataset, but overall pollinator abundance patterns throughout the three LULC remained similar. Agricultural sites, urban areas and port areas scored lowest with all three LULC datasets, with no to very little pollinator abundance (0 – 0.09).
Carbon storage
The carbon storage module calculates a carbon balance for above-ground, below-ground and soil carbon, including carbon stored in dead material. The model computed high capacities for the ES "carbon storage" (Fig.
Nutrient Delivery Ratio (NDR)
The NDR module spatially depicts the outwash of nitrogen (N) from different LULC types. The ES "nutrient export" shows overall low potential for the Island. As shown in Fig.
Sediment Delivery Ratio (SDR)
This module estimates the export of sediment particles (Fig.
Flow retention (Seasonal Water Yield)
The Seasonal Water Yield module calculates the quick flow water recharge and allows to calculate a flow retention index (1-Qn/P). The ES “flow retention" computed very good retention capacities (0.81 – 1) for the majority of the Island, with coastal areas showing slightly lower retention capacities (0.61-0.8) for all three input LULC datasets. The model results (Fig.
A statistical analysis of the different datasets reveals trends for each set of model outputs. The arithmetic mean for all modules and LULC datasets is visualised in boxplots (Fig.
Nevertheless, Kruskal-Wallis-H proves significant for all six modules and all three model runs. With p-values < 2.2e-16 < 0.05 = α, the null hypothesis can be rejected in all cases and we conclude that there are significant differences amongst the three model outputs and that there is a significant effect of input LULC datasets on model results. This difference is most distinct for the Nutrient Delivery Ratio module, with differences in nitrogen outwash rates of up to 23 kg ha-1 y-1 for the agricultural areas deriving from the high nitrogen loading rates from the CLC dataset. In addition, differences between the LULC maps for the Pollination module were distinct on Terceira Island, as results of the size and type of the different forest patches. For Flow retention, based on the SWY module, the deviation reached up to 0.46, an effect of the aquatic LULC classes present in the COS.A dataset. For the Visitation module, the differences mainly occurred in a few small spots, showing overall the highest consensus of the three input LULC datasets (Fig.
The applications of the six different InVEST model modules on Terceira Island present an overview of the spatial distribution of ES, based on three different input datasets. The statistical analysis showed that the choice of input LULC data largely affects the model outcomes. The maps of the ES model outputs of CLC, COS.A and Sentinel-2 clearly demonstrated significant differences in terms of the modelled distribution of ES supply. Hence, the decision on the input LULC data largely affected the modelled distribution of ES in a case study region. In the following, the reasons for the differences, as well as some practical guidance on what factors to look for when choosing an appropriate LULC dataset, will be given.
For those InVEST modules that model ES, based on specific indicators, such as recreation (Photo User Days), input LULCs only determined the spatial extent of ES supply as seen in the Visitation module. Differences arise, based on the size of LULC features when visualised as maps. Due to the overall low number of PUDs throughout large parts of the Island, differences between the input LULCs are highly local and small, as shown in Fig.
As the InVEST modules require LULC as the major input for their modelling, modules such as Carbon Storage, Visitation or Pollination, showed little differences between the three output maps - potential reason for this could be the similar type of land use. Whether the classification foresees an area to be covered by "broad leaved forest" (CLC and COS.A) or "Pittosporum" (Sentinel-2) or "transitional woodland-shrub" (CLC), "shrubland, bushland, heathlands" (COS.A) or "Shrub peatland" (Sentinel-2), the corresponding data, for example, Carbon Storage values, remain similar and therewith, differences in ES distribution remain rather small, where ES classes closely resemble each other. For example, the outcomes of the pollinator abundance reflect similar trends throughout all three LULC maps. For other modules drawing upon more complex calculations and combinations of input data, the differences in ES maps are substantial. For example, the results of the NDR model show higher deviations between the individual datasets (Fig.
The differences in model results can - as was expected - be explained by the use of different input datasets. Differences in input data influence the model outcomes. The different categorisation of LULC classes and the MMU of each dataset are important for spatial accuracy. The InVEST models assume that the supply of ES changes linearly with the land-use change (
Based on our results, the choice of input LULC datasets depends on different factors. The proper selection not only depends on data availability, but also on: 1) the research or policy/decision-making question guiding the modelling study, 2) the ecosystems to be mapped, but also on 3) the spatial resolution of the mapping and 4) data availability at the local level.
Following the MAES guidelines, the purpose of an ES modelling and mapping exercise is often linked to a guiding decision-making or policy question (
Another factor of importance for the selection of input LULC data is the ecosystem(s) in the focus of the mapping exercise. Here, the level of detail in the classification scheme adopted for each LULC dataset is important for the model output. For example, the carbon storage maps increase in detail with more specific LULC types. Especially for agricultural areas, the difference becomes visible. Both CLC and COS.A datasets contain >=4 subcategories, whereas the Sentinel-2-based LULC map only uses two out of 13 LULC types for agriculture. This is important, when ES maps are to be used for urban or agricultural areas.
The spatial resolution of the modelling exercise is closely linked to the investigated ecosystem. To model ES that highly depend on the structure and composition of the natural mosaic landscape, with its interactions and processes, this study recommends to select habitat-specific LULC, such as the Sentinel-2-based LULC map. Particularly ES, such as pollination, require spatial information on (small) patch sizes of different ecosystem types. It is recommended to use the highest possible spatial resolution for modelling pollinator abundance, even though the results of all three input LULC datasets show a high degree of similarity. A comparison of these results with the work of
Depending on the size of the study area, the application of different LULC datasets can be useful. For national ES accounting and large scale ES modelling in the European Union, CLC is recommended, as its large average feature size might be sufficient. For regional and local ES assessments, LULC data, such as COS.A and Sentinel-2-based maps, can be used, especially for smaller areas with detailed landscape structures or feature sizes, including small patches and locally-specific ecosystem types. Examples of this are the EU's Outermost Regions and Overseas Countries and Territories, for which CLC is available, but this is unable to capture the dominant local ecosystem types ranging from tropical rainforest to arctic steppe (
Lastly, data availability at local level can impact the choice of LULC. With abundant data on a local level, it is possible to model ES for locally-specific ES types, as the example of Sentinel-2 with its endemic ecosystem types shows. Where such local data are limited, COS.A or CLC can be suitable options to conduct the modelling, as reference data on European level can be used as a proxy. However, this comes at the cost of model robustness and introduces uncertainties to the model outputs. Such effects can be severe for small islands as this affects the modelled distribution of ES and hence, the quality of ES maps.
The approach taken in this paper, running ES models with different available LULCs, attempts to minimise uncertainties, based on LULC input datasets. Each model entails uncertainties. As in all computer-based model approaches, model outputs are strongly dependent on the precision, quality and quantity of model input data. Many of the InVEST models require only few data inputs – for example, the carbon storage or visitation modules - which constitute an advantage of InVEST. This makes it broadly applicable across a variety of social-ecological contexts (
This study shows that the choice of input LULC datasets can have a significant impact on the outcomes of ES maps computed with the InVEST model suite on small islands, such as the EU Outermost Region of Terceira on the Azores Archipelago. Comparing three input LULC datasets and six InVEST Modules, significant differences are found between each input LULC. The choice for a particular LULC can either lead to visually enlarging or visually diminishing areas of ES supply on the ES maps. Furthermore, this choice can affect the magnitude of ES supply through the inclusion or omission of certain LULC types. While studies acknowledge sensitivities of the models to input parameters, our work highlights the implications of selecting proper LULC input data - a novel aspect. The use of different input LULC maps in the modelling process can enhance the accuracy of ES maps. Studies and researchers should not only include information on their input parameters, but also on the input LULC dataset with its different classes and feature sizes in order to ensure transparency of the maps for potential users. This is especially relevant if policy and decision-makers or land-use planners are to base their decisions on ES model results.
A big thank you to Phil Maurischat for the statistical discussions, to Fernando Santos-Martín for his critical comments and to Angie Faust for language editing. Additionally, Carolina Parelho deserves to be mentioned for her endless efforts to coordinate the MOVE project. Lastly, we would like to thank the reviewers for their valuable feedback. Without them, this paper would not have been possible.
The research leading to these results has received funding from the European Union, under the programme Pilot Project — Mapping and Assessing the State of Ecosystems and their Services in the Outermost Regions and Overseas Countries and Territories: establishing links and pooling resources, MOVE Project (MOVE- Facilitating MAES to support regional policy in Overseas Europe: mobilizing stakeholders and pooling resources, grant agreement Nº 07.027735/2018/776517/SUB/ENV.D2, www.moveproject.eu).