One Ecosystem : Research Article
PDF
Research Article
Comparative assessment of SoilGrids system with regional soil data for advancing ecosystem reporting in Bulgaria
expand article infoLora Stoeva, Miglena Kircheva Zhiyanski
‡ Forest Research Institute - Bulgarian Academy of Sciences, Sofia, Bulgaria
Open Access

Abstract

The aim of this study is to test the suitability of the SoilGrids system for ecosystem reporting, research and monitoring. The study is conducted in the Rila Mountains in Bulgaria, an area characterised by diverse ecological factors. We propose a methodological approach to compare SoilGrids predictions with independent point observations, addressing issues of inconsistency across survey layers when combining data from different sources. The comparative analysis is discussed in respect to point data, soil type, altitude and climate. The results show that the SoilGrids represents the main soil parameters well in terms of their dynamics over the altitudinal range. There is a good agreement between the observed and predicted values for the averages of the parameters - bulk density, coarse fraction, soil organic carbon (SOC) content and SOC stock. The average measured SOC stock (0-30 cm) is 58.54 t/ha, while the average predicted SOC stock (0-30 cm) is 55.38 t/ha. However, the study also showed that the predicted values for nitrogen content are almost two times higher than the observed figures and the pH values from the SoilGrids are less acidic than those measured in the field.

Keywords

soil data, soil parameters, SOC stock, independent validation, ecosystem reporting

Introduction

The various socio-economic and environmental challenges which modern society is facing nowadays, such as climate change, biodiversity loss and overexploitation of natural resources, call for the need of different systems and tools to monitor and inform the policy-makers and businesses about the development ways towards sustainability (Mooney et al. 2004, IPCC 2014 (Pachauri and Meyer 2014), IPBES 2018 (Scholes et al. 2018), IPBES 2019 (Diaz et al. 2019)). These systems and tools (Banerjee et al. 2016), embedded in academic research, offer policy-makers robust analytical frameworks, facilitating evidence-based decision-making and fostering a deeper and common understanding of the complex interplay between economic, social and environmental dimensions in the pursuit of sustainability.

There are numerous examples of such systems. In the context of climate actions, the enhanced transparency framework (ETF) under the United Nations Framework Convention on Climate Change (UNFCCC) is a reporting and review system for climate data, including GHG inventories, to track progress and provide information for policy development. The natural capital accounting represents another example of an important additional tool for providing information on sustainable development (Guerry et al. 2015). The natural capital is defined as the stock of natural resources, which includes rocks, soil, air, water and living organisms (Pearce 1988), that generate a flow of ecosystem services (Costanza et al. 2014). The framework, providing a systematic way to measure and report on stocks and flows of natural capital, is the Ecosystem accounting under the System of the Environment and Economic Accounts (SEEA-EA). It represents a valuable tool, developed by a long-lasting science-policy interface (Ekins et al. 2003, Edens and Hein 2013, Guerry et al. 2015, Costanza et al. 2017, Ruijs et al. 2018) aiming to integrate the economic and environmental data together with social data into a single, coherent framework for holistic decision-making (Edens et al. 2022). The implementation of the SEEA-EA requires diverse knowledge and expertise, such as economics, environmental science and data analysis. Accurate and reliable data, along with advanced modelling capacity, are fundamental to the effective implementation of the SEEA-EA (Hein et al. 2020).

The Ecosystem accounting includes spatial modelling of ecosystems towards organising biophysical data, measuring ecosystem services, tracking changes in ecosystem assets and linking this information to economic and other human activity through five separate accounts (Hein et al. 2015, La Notte et al. 2019, Hein et al. 2020). Like this, it presents a coherent and comprehensive view of ecosystems. The SEEA-EA framework also addresses the so called 'thematic accounting', a method that arranges data based on specific policy-relevant environmental themes like biodiversity, climate change, oceans and urban areas (Lange et al. 2022). For example, carbon accounting is part of a dedicated thematic account on climate change, but at the same time, the analysis of the carbon stock and carbon sequestration is an important part of the assessment of the ecosystem condition and services.

Soil organic carbon (SOC) stock is a critical component of the global carbon cycle, making it essential not only for carbon accounting within the SEEA-EA natural capital accounting framework, but also under the UNFCCC, where accurate calculations on the emissions due to SOC stock changes should be reported.

Estimating soil carbon content and stocks requires information on soil properties, which are measured through sampling and laboratory analyses. This is a labour and cost consuming activity (Singh et al. 2012). Soil parameter measurements on a national scale are usually part of a dedicated soil inventory or monitoring programme. In Bulgaria, the National Soil Monitoring Programme, established in 2004, is the main source of information on the relevant soil characteristics in the country. The system operates on a 16 km x 16 km grid and re-measures every 5 years. Data coming from the soil monitoring programme are used to derive country-specific emission factors to report emissions and removals from the soil carbon pool due to land-use changes as part of the national GHG inventory in line with the UNFCCC regulations. However, the current climate change policy introduces reporting-based targets, which require improvements in methodologies for reporting to achieve a higher level of accuracy. This poses a challenge for GHG reporting authorities as the application of greater precision in emissions estimates is linked to the availability of high-resolution measurement data and modelling approaches. This requires a combination of various high quality data sources in order for the calculation to reflect differences in parameters and management across different areas.

There are similar challenges when implementing ecosystem accounting at different scale – national, regional or local. As the ecosystem accounts take a spatial approach, the analysis and the assessments are presented using maps that bring together geographical, environmental, ecological and economic information in one place. If the accounting is performed at regional or national level, it is usually not sufficient to rely on individual point observations to spatially represent the soil condition and services, given the large vertical and horizontal variability of soil characteristics. Thus, large set of data as well as modelling work is required to better represent the spatial variability (Le Noë et al. 2023).

By facing these challenges, the international scientific community developed suitable tools at global level, such as SoilGrids, GlobalSoilMap.net, FAO SOIL PORTAL etc. as global sources for soil information. They are developed, based on different types of data and modern technologies. One of the main advantages of these systems is that they provide spatial distributions of soil properties across the globe. This makes the datasets suitable for analysis and spatial representation of the ecosystem condition and services under SEEA-EA. In addition, these tools have the potential to contribute to better estimates of the country-specific emission factors in the GHG Inventories under UNFCCC, especially when the available dataset does not provide enough measurements to average out the sampling error, when grouping soil measurements into climate regions and soil types.

SoilGrids is a system for digital soil mapping, based on global compilation of soil profile data and environmental layers. Digital mapping methods are constantly evolving and improving; particularly popular nowadays are machine-learning models and algorithms, which, according to Khaledian and Miller (2020)and Poggio et al. (2021), provide better results than most multiple linear regression methods. However, the application of these tools in regional and national assessments requires that model data be validated with independent information, despite their acceptable levels of accuracy (Poggio et al. 2021).

In this regards, the aim of the current study is to compare the modelled data from SoilGrids with regional soil information by presenting a methodological approach for verification of the suitability of SoilGrids data for the purpose of ecosystem reporting under the natural capital accounts or the GHG inventory under the UNFCCC regulations.

Materials and methods

Case study area

The study area covers the geographical scope of Rila Mountains – the highest mountain range in Southeast Europe (Mousala peak – 2925 m). The mountain has a massive, dome-like shape and is divided into four distinct parts – Northwest Rila, Central Rila, East Rila and Southwest Rila, which are formed between the valleys of the Rivers Beli Iskar, Levi Iskar, Rilska and Belishka. The area of the mountain range is around 263,000 ha. The mean altitude of the range is about 1573 m. Only 16% of its territory is below 1000 m a.s.l., whereas 67% of the area is between 1000 and 2200 m a.s.l. and about 17% of the territory is above 2200 m a.s.l. (Krastanov et al. 1985).

According to the IPCC classification schemes for the default climate regions, most of the territory of Rila Mountains has a cold temperate wet climate (Eggleston et al. 2006). The low altitudes of the south-western part of the mountain range are characterised by a warm temperate dry climate. The annual temperature amplitude ranges from 21.9°C at 400 m altitude to 15.7°C at Musala Peak, indicating distinct climatic characteristics in the different altitude belts. Winter temperatures are positive in the lower parts of Rila, while the middle and high parts of the mountains consistently maintain negative temperatures. In summer, positive temperatures prevail in the high and alpine parts, where monthly average air temperatures of 5 - 6°C are recorded, while in the lower parts of the mountains they fluctuate between 10 and 20°C.

Almost 70% of the study area is covered by forests. Coniferous forests predominate with a share of 75%, of which 11% are dwarf pine (Pinus mugo L.) stands, while deciduous forests occupy approximately 25% of the forested area. The most common tree species from conifers are Scots pine (Pinus sylvestris L.), Norway spruce (Picea abies (L) H. Karst.) and Dwarf pine (Pinus mugo L.). The dominant broad-leaved species are Beech (Fagus sylvatica L.) and Oak (Quercus spp. L.).

Different soil types are distributed across the study area, the most common being Cambisols, Umbrisols and Luvisols. The parent materials are products of the physical weathering of various silicate rocks – eluvium, proluvium and colluvium.

Methodological approach

The suitability of the SoilGrids data for ecosystem reporting has been verified against the independent set of data. The correspondence of the predicted values from SoilGrids has been compared to randomly distributed point observations from the dataset. The comparison has been done at plot level and then at sample average. The parameters which have been compared are pH, bulk density, coarse fraction, soil organic carbon (SOC) content, nitrogen content and soil organic carbon (SOC) stock.

The SoilGrids data are downloaded in a raster image format at a spatial resolution of 250 m for each of the studied indicators (pH, org. C, N, bulk density, coarse fraction, C stock) and by layers – 0-5, 5-15, 15-30 cm. SoilGrids data on soil C stock are presented on the web-based platform only for the 0-30 cm soil layer. By using a mask layer of the study area, the analysis is guided to the range of assessment. Raster calculator was used to derive data for each indicator for the 0-30 cm depth for subsequent analysis and processing (Fig. 1).

Figure 1.

Conceptual scheme of the study.

To verify the accuracy of the model data, we used an independent set of data from a few sources - ICP Forest plots in the region, published data from scientific study and our own observations (Suppl. material 1). The soil measurements from these sources differ in comprehensiveness and layers surveyed (Table 1). This makes validation by layer depth difficult. In order to reduce the error resulting from filling in missing information (e.g. for bulk density at depth and SOC content for 20-30 cm), the comparison of model results was performed for 0-30 cm depth and not by layers. The only exception was the bulk density, which was validated for 0-5 cm, as the model data correspond exactly to the depth analysed in the field using a ring-type metal probe.

Table 1.

Soil parameters by source of information and soil depth.

Depth 0-5 cm 5-15 cm 5-20 cm 0-10 cm 10-20 cm 15-30 cm 20-30 cm 0-30 cm

SoilGrids (raster data, 250 m resolution)

pH, CF, BD,SOC content, SOC stock, N content pH, CF, BD,SOC content, SOC stock, N content pH, CF, BD,SOC content, SOC stock, N content SOC stock (derived)

ICP Forest (soil profile data)

BD pH, CF, SOC content, N content pH, CF, SOC content, N content pH, CF, SOC content, N content SOC stock calculated according to IPCC GPG LULUCF (2003)

Eli Pavlova-Traykova (2018) (soil profile data)

pH, CF, BD,SOC content, N content pH, CF,SOC content, N content SOC stock calculated according to IPCC GPG LULUCF (2003)

Own observations (soil profile data)

BD pH, CF, SOC content, N content pH, CF, SOC content, N content pH, CF, SOC content, N content SOC stock calculated according to IPCC GPG LULUCF (2003)

We analysed the agreement between model data and field observations by parameters both at the sample plot level and as sample average. The averages of the two sets have been tested by two-tailed paired samples t-test and Wilcoxon signed-rank test.

To facilitate the comparative analysis further, ancillary materials, such as climatic data, soil types and terrain models, were incorporated into the estimation process to reflect stratification needs and improve the efficiency of the analysis. The forest vegetation zoning is used as a proxy for the combined effect of these factors. According to the Bulgarian classification on forest vegetation zones, there are three main vegetation belts in the region of Rila Mountains - The Lower Plains and Foothills oak woodland (0-700 m a.s.l.), The Mid-montane beech and conifer forests (701-2000 m a.s.l.) and the zone of High Mountain area (> 2000 m a.s.l.).

Data sources

SoilGrids

SoilGrids is a system for digital soil mapping at a global scale that uses geo-statistics combined with a machine-learning algorithm to generate the necessary spatial patterns (Poggio et al. 2021). It is a result of a large-scale international collaboration and is maintained by ISRIC – World Soil Information. The system produces maps of basic soil physical and chemical properties for the entire globe at six standard depth intervals with an average spatial resolution of 250 m. The SoilGrids prediction models are fitted to around 240,000 soil profile observations worldwide combined with over 400 global environmental covariate data describing vegetation, terrain morphology, climate, geology, hydrology etc. (Poggio et al. 2021). Soil properties are derived mostly from ISRIC's World Soil Information Service (WoSIS), which provides consistent, standardised data on soil profiles worldwide (Batjes et al. 2020). The WoSIS data for Bulgaria includes information from 177 profiles, three of which fall within the Rila Mountains.

Regional data

The dataset which was used to compare the SoilGrids predictions in the region of Rila Mountains consists of information from 39 sample plots from three different sources:

  1. a subset of the field observations from the National Forest Monitoring Network under the ICP-Forest Programme (http://icp-forests.net/) provided by the Executive Environment Agency in Bulgaria;
  2. point data published by Pavlova-Traykova et al. (2018);
  3. our own field observations from plots in the region.

The data from ICP-Forest programme in Rila Mountains consist of 14 sample plots in both coniferous and deciduous forests. The sample plots are mostly represented in the range of altitude between 1100 and 1600 m. The soil data are presented at 10 cm intervals between 0 and 30 cm depth. Most of the sample plots are in the north-western and south-eastern parts of the mountains.

The information on soil properties from Pavlova-Traykova et al. (2018) consists of 12 sample plots within the lower altitude of the mountain with a range between 600 and 800 m a.s.l. The sample plots are located mostly in coniferous plantations in areas affected by erosion processes. The climate in the area is influenced by the Mediterranean and, thus, it is characterised by hotter and drier conditions. The soil data are presented in two depths intervals 0-5 cm and 5-20 cm.

The soil information from our own observations consists of 13 sample plots, which have been created specifically for this study. The sample plots are established according to a square scheme with a centre. Soil samples are taken from each point in three depth intervals – 0-10 cm, 10-20 cm, 20-30 cm by using soil coring. The main criteria for choosing the appropriate location for each of the sample plots are altitude, exposition, soil type and vegetation. The experimental sites are located on the northern and eastern slopes of Rila Mountains at high altitude (1700-2000 m), where the available data are scarce. The sample plots are positioned across different soil types to capture the variety and diversity found in upland areas in Bulgaria.

Laboratory analysis

The bulk density (0-5 cm) is determined by Kachinsky method (Kachinsky 1965) after collection of a dedicated sample from 0-5 cm depth in two replicates as suggested by Cools and De Vos (2013). In-depth bulk density has been calculated using pedotransfer functions (De Vos et al. 2005). The following chemical properties are also analysed: pH in H2O (ISO 10390:2005), nitrogen content (Kjeldahl method) and carbon content (modified Turin method).

Calculating SOC stocks (0-30 cm)

The soil organic carbon stock is estimated according to the IPCC GPG LULUCF (Penman et al. 2003) as follows:

\(OC=\sum_{layer=1}^{layer=n} SOC_{layer} = \sum_{layer=1}^{layer=n}([SOC]*BulkDensity*Depth*(1-frag).10)_{layer}\)

Where,

SOC – soil organic carbon stock, tonnes C ha-1;

SOC layersoil organic carbon stock per layer, tonnes C ha-1;

[SOC] – carbon content per layer, g C (kg soil)-1;

Bulk Density, tonnes soil m-3 (equivalent to Mg m-3);

Depth, m;

frag – coarse fraction, %/100

Results

The analysis of the main physico-chemical properties of the soils in Rila Mountains, based on the data of SoilGrids, shows that the bulk density of soils ranges from 0.93 to 1.45 g/cm3. The mean value was determined to be 1.18 g/cm3 (± 0.1). Soil bulk density variation in the study area exhibits a clear trend of decreasing with increasing elevation (Fig. 2).

Figure 2.

Soil bulk density in the Rila Mountains for 0-30 cm depth. Source: SoilGrids. The Barplot is a histogram with x: bulk density (cg/cm3) and y: frequency.

In terms of coarse fraction, the soils in Rila Mountains are characterised as grainy (Fig. 3). The coarse fraction varies between 4.5% and 30%. Most often it is around 14.5% (± 4.4%). The higher the altitude, the greater is the coarse fraction. The value above 2000 m a.s.l. varies between 20 and 30%.

Figure 3.

Coarse fraction content of soils in the Rila Mountains for 0-30 cm depth. Source: SoilGrids. The Barplot is a histogram with x: coarse fraction (cm3/dm3) and y: frequency.

The pH (for 0-30 cm) is mostly slightly acidic to neutral (pH 5.5-6.5 and above 6.5). The mean pH (H2O) was determined to be 6.03 (±0.4) according to SoilGrids data. From (Fig. 4), it can be seen that there is a clear pattern of increasing soil acidity with increasing elevation.

Figure 4.

pH values of soils in Rila Mountains for 0-30 cm depth. Source: SoilGrids. The Barplot is a histogram with x: pH and y: frequency.

The content of the soil organic carbon (for 0-30 cm) in the soils of Rila Mountains varies between 5.1 and 96.7 g/kg (Fig. 5). Values between 20 and 60 g/kg predominate, with the mean content determined to be 38.5 g/kg (± 13.35). The nitrogen content of soils ranges from 1.44 g/kg to 5.60 g/kg (Fig. 6). The mean value of nitrogen (N) concentration is calculated to be 3.21 g/kg (± 0.9). In both parameters – C and N concentrations, the content of both SOC content and N is higher with an increase in altitude.

Figure 5.

Soil organic carbon (SOC) content in Rila Mountains for 0-30 cm depth. Source: SoilGrids. The Barplot is a histogram with x: SOC content (dg/kg) and y: frequency.

Figure 6.

Nitrogen content of soils in Rila Mountains for 0-30 cm depth. Source: SoilGrids. The Barplot is a histogram with x: N content (cg/kg) and y: frequency.

When comparing the information from SoilGrids with the data from field observations in the Rila Mountains, it is noticeable that the results from the field studies show a greater variation in soil parameters compared to those predicted by the model (Table 2, Fig. 7). This is to be expected taking into account that there is high variability in soil indicators even within relatively homogeneous soils and environmental factors (Wells and Case 1995). However, with respect to the mean values of the individual indicators, comparable results are reported for SOC content, coarse fraction, bulk density and satisfactory with respect to N content and pH values (H2 O) (Table 2).

Table 2.

Comparison table between model results and field observations of physico-chemical characteristics of soils in the Rila Mountains.

Data source SoilGrids 2.0 Field data
SP

SOC, g/kg

0-30 cm

N, g/kg

0-30 cm

pH

0-30 cm

Coarse fraction, %

0-30 cm

Bulk density g/cm3

0-5 cm

SOC, g/kg

0-30 cm

N, g/kg

0-30 cm

pH

0-30 cm

Coarse fraction, %

0-30 cm

Bulk density g/cm3

0-5 cm

ICP FOREST 3014 35.43 3.09 5.90 13.90 - 26.77 1.98 - 8.72 -
4512 31.00 2.79 6.10 9.40 - 32.75 1.75 - 2.33 -
4642 38.28 3.17 6.10 11.45 - 15.57 0.94 - 2.33 -
4701 42.38 3.78 5.60 15.08 - 17.04 1.06 - 21.71 -
5041 35.07 3.40 5.80 14.07 - 31.36 1.92 - 2.00 -
5058 46.38 3.89 5.58 16.03 - 25.62 1.20 - 20.71 -
6008 41.05 3.65 5.60 14.27 - 14.87 2.56 - 11.43 -
53 48.20 3.25 5.80 15.60 - 53.06 2.10 - 15.90 -
54 49.35 3.97 5.98 21.67 - 13.36 0.63 - 3.00 -
69 26.95 2.01 6.95 10.20 - 12.17 0.63 - 14.28 -
597 51.62 4.22 5.70 16.92 - 22.98 1.24 - 17.20 -
652 49.35 3.97 5.98 21.67 - 19.79 1.99 - 15.80 -
671 38.75 2.76 5.90 13.35 - 40.88 1.99 - 15.80 -
703 37.45 2.93 5.80 16.35 - 11.14 2.47 - 11.43 -
Case study plots SP1 61.00 4.04 5.75 17.77 0.98 57.84 1.46 4.62 6.41 1.23
SP2 50.82 4.00 5.55 15.67 1.05 25.75 0.44 4.56 8.94 1.15
SP3 51.43 3.67 5.55 14.47 1.01 109.26 3.21 4.50 2.33 0.91
SP4 56.77 4.65 5.20 16.03 0.95 49.85 1.12 4.76 8.92 1.37
SP5 41.95 3.73 5.83 13.22 0.99 98.37 2.41 4.63 7.17 1.12
SP6 36.47 3.22 5.80 9.70 1.06 36.06 0.88 4.48 5.81 1.10
SP7 32.80 3.19 5.80 9.00 1.10 50.87 2.16 4.98 1.85 1.11
SP8 51.65 4.33 5.50 18.05 0.96 74.33 2.57 3.97 3.28 0.84
SP9 39.13 4.02 6.20 15.33 1.05 61.74 2.91 3.93 8.80 0.91
SP10 41.87 4.47 5.60 16.68 1.00 23.56 1.76 3.88 4.79 1.01
SP11 39.02 4.00 5.70 14.13 1.10 100.35 3.83 4.06 24.64 0.61
SP12 45.00 3.55 5.70 17.37 1.03 48.21 1.56 4.51 14.90 0.97
SP13 37.35 3.39 5.80 13.75 1.06 40.80 2.14 4.59 21.28 0.99
Pavlova-Traykova (2017) SP14 21.97 2.12 6.55 7.60 1.19 18.30 1.05 5.59 22.72 1.17
SP15 23.97 2.15 6.70 11.98 1.24 11.78 0.70 5.68 21.17 1.12
SP16 20.97 2.20 6.50 12.63 1.21 49.40 1.98 6.20 21.78 1.01
SP17 22.90 2.07 6.60 12.18 1.20 34.95 2.03 6.45 33.09 0.97
SP18 20.68 2.04 6.60 8.35 1.19 9.47 0.95 5.84 30.46 1.20
SP19 22.58 2.23 6.45 9.33 1.15 18.13 1.02 5.45 20.43 1.28
SP20 24.22 2.05 6.65 13.78 1.19 22.90 1.33 6.69 14.70 0.85
SP21 23.13 2.10 6.65 11.02 1.18 13.97 1.22 5.95 9.01 1.11
SP22 22.03 1.94 6.80 11.17 1.23 18.12 1.28 5.43 27.85 1.30
SP23 23.02 2.17 6.60 12.63 1.21 13.93 1.33 5.25 33.80 1.05
SP24 22.75 2.14 6.58 12.48 1.21 20.37 1.73 6.29 48.65 1.08
SP25 20.10 2.09 6.70 9.05 1.23 26.85 1.93 5.72 17.03 1.12
Mean 36.53 3.12 6.05 13.67 1.11 35.19 1.68 5.12 14.93 1.06
Figure 7.

Distribution of values of the studied parameters by type of data.

The comparative analysis and the statistical test performed (Student t-test; paired, two tailed, p = 0.05) showed that, in terms of bulk density data, coarse fraction, SOC content and C stock, no statistically significant differences were found between the sample means from the field data and those from the models (Table 3). The tests show that there is a significant difference between the means of the field observations and the model for pH and N.

Table 3.

Paired t-test (p = 0.05): Bulk Density, Coarse Fraction, SOC content, SOC stock, N content, pH.

* indicating no significant difference in mean, ** indicating significant difference in mean.

Field data SoilGrids Field data SoilGrids Field data SoilGrids Field data SoilGrids Field data SoilGrids Field data SoilGrids

BD g/cm3

0-5 cm

BD, g/cm3

0-5 cm

CF, %

0-30 cm

CF, %

0-30 cm

SOC, g/kg

0-30 cm

SOC, g/kg

0-30 cm

SOC, tC/ha

0-30 cm

SOC, tC/ha

0-30 cm

N g/kg

0-30 cm

N g/kg

0-30 cm

pH (H2O)

0-30 cm

pH (H2O)

0-30 cm

Mean 1.06 1.11 14.93 13.68 35.19 36.53 58.54 55.38 1.679 3.139 5.120 6.135
Variance 0.03 0.01 111.94 11.31 647.77 137.54 755.89 166.35 0.567 0.729 0.702 0.253
Observ. 25 25 39 39 39 39 39 39 39 39 25 25
df 24 38 38 38 38 24
t Stat -1.309 0.663 -0.362 0.791 -9.347 -9.687
P(T<=t) one-tail 0.101* 0.256* 0.360* 0.217* 0.000** 0.000**
t Critical one-tail 1.711 1.686 1.686 1.686 1.686 1.711
P(T<=t) two-tail 0.203* 0.511* 0.720* 0.434* 0.000** 0.000**

t Critical two-tail

2.064 2.024 2.024 2.024 2.024 2.064

As the assumptions for normality of the data from the field studies are not completely met and there are also outliers in some of the analysed parameters, an additional non-parametric test was performed to check the statistical significance in means - the Wilcoxon Signed Rank test (paired, two tailed, p = 0.05). The results confirmed the outcomes from the t-test (Table 4).

Table 4.

Wilcoxon Signed Rank test (p-value = 0.05): Results.

* indicating no significant difference in mean, ** indicating significant difference in mean.

Parameters n p-value
Bulk Density 25 0.1538*
Coarse Fraction 39 0.8092*
Org C. content 39 0.1756*
SOC 39 0.7004*
N content 39 < 0.000**
pH 25 < 0.000**

Discussion

The SoilGrids data for the Rila Mountains represents well the main soil parameters in relation to their depth and dynamics in terms of altitudinal range. The analysed parameters take into account the known variability resulting from the complex influence of a number of climatic and biological factors (Franzluebbers et al. 2001, Sergeevna Kozun et al. 2022), which are closely correlated with changes in altitude and tree cover (Figs 2, 3, 4, 5). It is noticeable that the content of org. C and total N increase with increasing altitude (Table 5), since the accumulation and decomposition of organic matter are directly dependent on the hydrothermal regime of the soil and the vegetation and tree composition, which strongly differentiate with altitude. Soil acidity also varies with altitude from acidic to slightly acidic in the lowlands to strongly acidic in the mid-montane and high-montane zones. Bulk density decreases with increasing altitude because of the increase in soil porosity. This is due to accumulation of organic matter and formation of thick and soft forest litter, contributing to the formation of thick humus horizons. Coarse fraction increases with height, especially in the brown forest soil’s zone, which is generally characterised by high skeletal content (Koynov et al. 1998) and is significant in the alpine zone of the mountains, where there are mostly cliffs.

Table 5.

Correlation matrix. Data: field measurements.

Altitude Soil type SOC content Coarse fraction

Bulk density

SOC stock N content pH
Altitude 1.00
Soil type 0.76 1.00
SOC content 0.85 0.56 1.00
Coarse fraction 0.54 0.42 0.77 1.00
Bulk density -0.67 -0.17 -0.42 -0.08 1.00
SOC stock 0.80 0.53 0.75 0.53 -0.73 1.00
N content 0.91 0.57 0.92 0.73 -0.50 0.75 1.00
pH -0.57 -0.08 -0.34 0.02 0.94 -0.64 -0.44 1.00

With regard to nitrogen content, it is striking that model and field data have significant differences in values (Fig. 7, Tables 3, 4). More than half of the reported data from field observations fall outside the model predicted minimum nitrogen content values. In SoilGrids, nitrogen content includes total nitrogen (ammonia, organic and reduced nitrogen) as determined by Kjeldahl, plus the absorbable forms of nitrogen, nitrate and nitrite. The method we applied in the analysis of the field observations does not account for nitrate and nitrite forms of nitrogen, but their content in the forest soils is very small (Donov et al. 1974) and the difference cannot be explained by a difference in their accounting alone.

SoilGrids data predict pH changes well with increasing elevation and changing vegetation cover, with significantly more acidic soils in the conifer forest zone (Fig. 4). However, the comparative analysis shows that, overall, the model predicts less acidic soils with pH values mostly between 5.7 and 6.6, while field observations report pH values between 4.5 and 5.8 (Fig. 7). In the Rila Mountains, Cambisols are the most common soil type (> 50%). Data from several studies on these soil types in the country confirm the more acid solution reaction (< 6.0) (Zhiyanski 2018, Kirova 2019, Malinova et al. 2020, Hristov et al. 2021). The acidity of soils in the Rila Mountains is conditioned by the widespread occurrence of carbonate-free silicate rocks - granite-gneisses, on the one hand and the predominance of coniferous forest tree species on the other hand, which lead to soil acidification (Binkley and Sollins 1990, Zhiyanski 2018).

We suggest that the reason for the discrepancies between model and field data in terms of pH and nitrogen content is likely due to model imperfections associated with the input data. The data in SoilGrids are modelled on the basis of information from nearly 240,000 soil profiles from around the world, dominated by data from agricultural land. In addition, it should also be taken into account that more than 60% of these data were collected between 1960 and 2000, 34% have an unknown sampling date and only 16% of soil profiles were surveyed between 2001 and 2020 (Poggio et al. 2021). Batjes et al. (2020) note that the sampling period should be taken into account when mapping using environmental covariates, especially when this refers to highly variable features that may be affected by land-use changes or natural disturbances. However, Poggio et al. (2021) considered that the spatial variation of dynamic features is greater than their temporal variation and this was neglected in the creation of SoilGrids.

The analyses and comparisons with field data show that SoilGrids provides reliable values for soil organic carbon content, bulk density and coarse fraction, which are important parameters for determining the amount of carbon stored in the soil. The SOC stock is a dynamic characteristic that is influenced by land use (Zhiyanski et al. 2016, Menichetti et al. 2017) and silvicultural practices (Ameray et al. 2021) in addition to the soil potential itself. The analysis of the average soil organic carbon (SOC) stock by soil type across the Rila Mountains corresponds to the observed SOC stock in forest soils at national scale reported by Stoeva and Kirova (2021). According to SoilGrids data, intersected with the soil map of the area, the mean SOC stock in Cambisols in Rila Mountains is 57.92 tC/ha, while the average SOC stock in Luvisols is 53.00 tC/ha. Stoeva and Kirova (2021) reported mean SOC stock values in interval 51-56 tC/ha in Cambisols and 43-53 tC/ha for Luvisols. Both studies show that the SOC stocks are characterised with large variability. It is highest in brown and dark-coloured forest soils, as well as in upland-meadow soils, which are typical for mountainous regions in Bulgaria (Hristov and Filcheva 2017). The reason for the large dispersion in the values of soil parameters (including SOC) is explained by the diverse conditions under which soil formation processes take place. Brown and dark-coloured forest soils in Rila Mountains develop in a moderately wet and cool climate under the influence of woody vegetation - mostly moist beech and conifer forests. The relief is hilly to mountainous and, in places in the Rila Mountains, it is highly dissected, which determines the development of erosion processes. Depending on the altitude, the different types of forest vegetation and the geological composition, certain differences in the characteristics of individual brown soils are noticeable, which is also clearly visible from the large dispersion in the SOC content in brown and dark forest soils (Fig. 7). The mountain-meadow soils, on the other hand, are developed under harsh climatic conditions, contributing to the formation of a peaty horizon, water retention and slowing down of the mineralisation of organic matter. This leads to humus accumulation, delayed chemical weathering, strong leaching and acidification.

Regarding the SOC content, the most significant influence is that of vegetation and mineralisation processes. In this respect, woody debris is the main source of organic matter, nitrogen and ash elements in the soil beneath forest ecosystems. Climatic factors in turn determine the processes of mineralisation and transport of organic matter and ash elements in depth. The cycling of substances under beech, spruce and alpine shrub formations is different, which has implications for the properties and characteristics of soils in the mountains and can be clearly seen by looking at the soil indicators and their variation with height, which is followed by the variation in vegetation and climatic conditions. When comparing soil organic carbon stock (org. C) data from SoilGrids with different forest vegetation zones (according to Bulgarian classification), there is a clear difference in average values and spread across various altitudes. The lowest forest zone has low C stock, while the high mountain zone has higher C stock (Fig. 8). There is no great difference in average soil C stock between the sub-zones at High Mountain area (above 2001 m a.s.l.), but there is a noticeable difference within the average SOC stock values between the sub-zones of the other two forest vegetation zones – the one of the Lower Plains and Foothills oak woodland (0-700 m a.s.l.) and the Mid-montane beech and conifer forests (701-2000 m a.s.l.). The difference in the mean SOC values of the Oak woodland zones is explained mainly by the influence of the land use and its legacy effect. Most of the agricultural land is present at the lowest altitude 0-500 m a.s.l., where the lowest average soil C stock is estimated. Regarding the difference in means of the sub-zones of the Mid-montane beech and conifer forests, the variation is explained by diversity of the environmental factors within this zone, such as terrain, climate and vegetation. Cambisols are the main soil type distributed within this zone. Based on the SOC analysis by soil type, Cambisols have the highest dispersion in SOC values. The average SOC value within the sub-zones differentiates substantially. These findings are consistent with the observations of Kyuleva (1979), who proposed the division of brown forest soils into three subgroups. The first subgroup shows transitional characteristics with the grey and cinnamon forest soils (Luvisols), the second consists of the typical brown forest soils (Dystric-Eutric Cambisols) and the third shows transitional characteristics with the dark-coloured forest soils (Umbric Cambisols).

Figure 8.

SOC stock distribution and density by vegetation zones and its altitude ranges. SOC data: SoilGrids.

SoilGrids data show that the SOC stock is also heavily influenced by the local climate, specifically temperature and moisture. This is evident when combining the SOC data from SoilGrids with information on climate. The highest soil carbon storage is observed in the cool and humid climate zone, which prevails in the Rila Mountains. This climate promotes the accumulation of organic matter in the topsoil, particularly fulvic acids (Hristov and Filcheva 2017). The humidity is also a relevant factor for SOC stocks in the investigation area. Even within similar temperature ranges, there is a significant difference in the mean soil carbon stock between cool, dry areas and cool, humid areas (Fig. 9).

Figure 9.

Distribution of the SOC stock data by climate zones (IPCC). SOC data: SoilGrids.

In warm, dry climates, soil carbon storage is low and humic acids dominate the organic matter. This is exemplified by the eroded and leached forest soils, which are typical in these climate zones and have some of the lowest SOC levels in the Rila Mountains.

The analysis confirms the importance of considering climate regions, soil types and vegetation when deriving country- or region-specific SOC stocks and soil carbon stock changes for ecosystem accounting purposes, especially when calculating greenhouse gas inventories under the UNFCCC. To achieve this, stratification should be applied to group soil measurements by climate region and soil type, ensuring that each stratum has sufficient data to minimise sampling error. When using international databases such as SoilGrids, rather than averaging all pixels or plots within a region, more precise approaches, such as averaging plots with similar climatic and soil conditions - even from neighbouring regions or countries - or using statistical methods to identify the most comparable plots, can provide more accurate results (Bellassen et al. 2023).

Conclusion and recommendations

SoilGrids data represent a valuable resource for assessing key indicators of soil health and SOC stocks, providing critical information for different ecosystem reporting domains, such as the Ecosystem Accounts (SEEA-EA) and GHG inventories. One significant advantage of SoilGrids is its accessibility and spatially explicit data. This makes it useful for the representation of the soil parameters in the ecosystem accounts, which are usually represented by maps.

SoilGrids aims to provide soil information on a global scale. However, its application at regional or local scales requires careful consideration. Independent verification of SoilGrids data with field observation from Rila Mountains (Bulgaria) demonstrates that SoilGrids data can be effectively utilised for regional analysis and reporting in Bulgaria within the natural capital accounting framework or for other reporting needs. However, special consideration should be given to the prediction of the N content and pH in mountainous area, as the study also demonstrated.

SoilGrids is also a vital resource for research and analysis when soil information is scarce, offering a means to fill data gaps. However, as a model output, SoilGrids possesses inherent limitations and uncertainties. Therefore, instead of relying on specific point data, it is advisable to conduct analyses at a broader scale, focusing on areas with similar environmental conditions (e.g. climate, elevation, vegetation, soil type) to those lacking data. This approach helps mitigate the uncertainties associated with the model while still providing meaningful insights.

Acknowledgements

The research that obtained these results was supported by the LTER-BG infrastructure (Agreement No. DО1-320/30.11.2023), purchased under the National Roadmap for Research Infrastructure, financially coordinated by the Ministry of Education and Science of the Republic of Bulgaria.

Author contributions

LS: Conceptualisation, Methodology, Analysis, Visualisation, Writing original draft. MZ: Writing, review and editing.

Conflicts of interest

The authors have declared that no competing interests exist.

References

Supplementary material

Suppl. material 1: Description of sample plots 
Authors:  Lora Stoeva, Miglena Zhiyanski
Data type:  excel file
Brief description: 

Description of sample plots from field studies of forest soils in the Rila Mountains.

login to comment