One Ecosystem :
Research Article
|
Corresponding author: Thi Ha Thanh Nguyen (nguyenthihathanh@hus.edu.vn)
Academic editor: Artur Gil
Received: 10 Dec 2021 | Accepted: 06 Feb 2022 | Published: 14 Feb 2022
© 2022 Kinh Bac Dang, Thi Ha Thanh Nguyen, Huu Duy Nguyen, Quang Hai Truong, Thi Phuong Vu, Hanh Nguyen Pham, Thi Thuy Duong, Van Trong Giang, Duc Minh Nguyen, Thu Huong Bui, Benjamin Burkhard
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Dang KB, Nguyen THT, Nguyen HD, Truong QH, Vu TP, Pham HN, Duong TT, Giang VT, Nguyen DM, Bui TH, Burkhard B (2022) U-shaped deep-learning models for island ecosystem type classification, a case study in Con Dao Island of Vietnam. One Ecosystem 7: e79160. https://doi.org/10.3897/oneeco.7.e79160
|
The monitoring of ecosystem dynamics utilises time and resources from scientists and land-use managers, especially in wetland ecosystems in islands that have been affected significantly by both the current state of oceans and human-made activities. Deep-learning models for natural and anthropogenic ecosystem type classification, based on remote sensing data, have become a tool to potentially replace manual image interpretation. This study proposes a U-Net model to develop a deep learning model for classifying 10 island ecosystems with cloud- and shadow-based data using Sentinel-2, ALOS and NOAA remote sensing data. We tested and compared different optimiser methods with two benchmark methods, including support vector machines and random forests. In total, 48 U-Net models were trained and compared. The U-Net model with the Adadelta optimiser and 64 filters showed the best result, because it could classify all island ecosystems with 93 percent accuracy and a loss function value of 0.17. The model was used to classify and successfully manage ecosystems on a particular island in Vietnam. Compared to island ecosystems, it is not easy to detect coral reefs due to seasonal ocean currents. However, the trained deep-learning models proved to have high performances compared to the two traditional methods. The best U-Net model, which needs about two minutes to create a new classification, could become a suitable tool for island research and management in the future.
neural network, encoder, decoder, RAMSAR, Con Dao Island
Currently, more than 100,000 islands have 500 million residents in total, encompass 20% of global biodiversity and provide the according sustenance (
Improved earth observation and analytical skills have transformed our perspective on our world, allowing for a more global perspective (
When using artificial intelligence, machine learning (ML) classify information, based on stored knowledge and do the work without any further assistance. Numerous research projects have used deep-learning methods to identify vegetation clusters, emphasising coastal and wetland habitats rather than island ecosystems (
Additionally, the deep learning models for land-cover classification have been commonly designed for inland or coastal ecosystems (
This study aims to develop the most suited deep-learning models, based on the U-shape-based neural network for classifying and monitoring ten ecosystems on a particular island in Vietnam using Sentinel-2 images. This study addresses three issues related to deep-learning-based ecosystem type classification on a particular island in Vietnam:
A 4-band Sentinel-2 image (including red, green, blue and near-infrared) and digital elevation models (DEMs) were utilised as input data for the U-Net (basic) models to categorise different island ecosystems. Land covers on an island in Vietnam of about 20 km x 25 km were built as a mask for training deep-learning models (Sections "study area" and "input dataset preparation"). An accuracy comparison was made between the results obtained from the trained U-Net models and two benchmark techniques, namely Random Forest (RF) and Support Vector Machine (SVM). Lastly, the new Sentinel-2 images taken since 2017 were used to analyse changes in land cover on Con Dao Island, Vietnam.
The Con Dao Island, which is a district of the Ba Ria-Vung Tau Province in the southeast of Vietnam, approximately 187 km from Vung Tau City, was chosen as the study area (Fig.
In addition to the typical habitats comprising woods, rivers, streams, lakes, sandbanks and residential areas, the study area also includes specific ecosystems, such as corals, seagrasses, shallow seas and deep seas. The mangrove forest ecosystem on Con Dao Island is narrow, with approximately 30 ha, located primarily on three sub-islands (
Based on remote sensing and GIS technology, Tuan (
The deep-learning model is set up based on three steps (Fig.
Three main steps to develop a U-Net model for island ecosystem type classification are shown in Fig.
Regarding the tide level, the land-sea boundary can be identified differently on the Sentinel-2 image between high and low tide during a day. Due to the tides in the research area fluctuating from 0.5-3.5 m, the boundary between land and sea can be identified in the elevation data from -2 m to +2 m. It could be a large coastal area. Therefore, the tidal information is also collected to correct the boundary between inland and wetland ecosystems obtained from DEM and Sentinel-2. According to the metadata of the Sentinel-2 images, seven images were taken at about 3:00 am. Meanwhile, the local tide at that time is about 2.0-2.3 m. Therefore, it does not make a large change of coastline in the seven images.
In addition to the cliff separation, based on ALOS and NOAA DEM data, the Sentinel-2 image obtained in February 2019 was integrated with the field mission in January 2021 to identify nine other island ecosystem types with cloud and its shadow. The initial stage of classification was image segmentation, based on the pixel using eCognition software (
Fieldwork was carried out in January 2021 at Con Dao Island, Ba Ria-Vung Tau Province, to verify the visual interpretation that was done indoors. It is difficult to find a good-quality Sentinel-2 image on an island due to the effects of the cloud and its shadow, especially a suitable image in 2020. Therefore, the fieldwork has been done in January 2021 when the image obtained in April 2021 was not published. To improve the accuracy during the fieldwork, the authors worked with the National Park managers in Con Dao Island to identify the stable area of ten island ecosystems during three years and then used them as samples. The area with high changes in land cover was eliminated in the sampling. With this method, the authors can identify correct samples in 2019. With the inland ecosystems, the authors could access them easily. With the wetland ecosystems, the authors had to use both boats and diving equipment for observation and sampling. Twenty polygons for each island ecosystem types for image interpretation were randomly selected to assess the accuracy, based on fieldwork samples. Each polygon was limited by the circular plots with a radius of 40 m. In total, 180 polygons (20 polygons x 9 categories = 180 polygons) were checked in the fieldwork and compared with the visual interpretation results from the satellite image obtained on 07/02/2019 (Fig.
In the study area, it is challenging to distinguish mangroves and corals on the images from a pixel-based classification because their total area is so small and scattered. However, these types of ecosystems are easily accessible in the field. Therefore, these types were added to the outcome of the U-Net model after the fieldwork. For natural forests, the vegetation density is high, so the pixels in the image have a relatively uniform reflectance spectrum with the tone of natural colours and the forest edges often have irregular shapes. For residential areas, due to the appearance of many different objects, such as buildings, gardens, roads and parks, the reflection spectrum is not uniform, with relatively clear boundaries. The spatial arrangement of the residential area manifests itself in the orderly repetition of colour tones and similar structures.
Based on the main sample characteristics, the authors interpreted ecosystem types, based on their colours, structures and shape from the segmentation process in the eCognition software (
The basic U-Net architecture is a supervised learning algorithm, based on a Convolutional Neural Network (CNN) to identify the classes of interest by modifying the parameters of convolutional filters (
The structure of the U-Net model for island ecosystem type classification is presented in Fig.
Based on the deep-learning approach, various methods have been used to optimise a U-Net model, such as the changes of training size, optimiser functions and loss function (
Regarding the optimisation, various loss functions were considered in this study. In most cases, the loss function has been used to calculate the quantity that the model should attempt to minimise throughout the training process. The mean squared error function is the most frequently used loss function in regression models, while the “cross entropy” loss function is the most commonly used loss function in classification models, based on probability calculations (
\(Lossvalue = - log(\frac {e^{V_p}} {∑_i^Ce^{V_i}})\) (Formula 1)
where Vi denotes the net's estimated scores for each class in 11 island ecosystem types and Vp denotes the network's estimated score for the positive class.
Different optimiser approaches may be used to build neural networks in order to reduce their related costs (e.g. loss of data information, training time and uncertainty). In this study, four optimiser types were applied including Adaptive Moment Estimation (Adam), Adaptive Gradient Algorithm (Adagrad), Adadelta and Stochastic Gradient Descent algorithm (SGD) (Fig.
In order to assess the performance of all trained U-Net models for ecosystem classification, based on an object-based approach, two traditional models, based on a pixel-based approach were generated, including random forest (RF) and support vector machine (SVM). As these two models were made, based on the pixel-based approach, the optimiser parameters are also different from the U-Net models, as follows:
Random Forest (RF) is a powerful algorithm in a supervised-learning class, based on the predicted results of decision trees for resolving problems in classification and regression. This algorithm was firstly introduced by Breiman and his group in 2001 (
During the training process, the RF decreases the bias and increases the variance of the model. From that, it avoids the over-fitting problem by passing the average of predictions (
SVM, or Support Vector Machine, is a popular supervised-learning algorithm that was first proposed in the 1970s (
The performance of SVM highly depends on the selection of kernel functions because it increases the flexibility in creating the decision boundaries of a dataset (
All models were implemented in a workstation (Intel Xeon Silver 4112 2.6GHz; Ram: 128GB DDR4 3200 MHz; Graphics: Nvidia Quadro RTX5000, 16GB, 4DP) using Python programming language via TensorFlow and Scikit-Learn frameworks. After completing both SVM and RF models, the results were compared with the best U-Net model to check the improvement of the selected deep-learning models for the island ecosystem type classification.
Once the optimal U-Net model for the classification of island ecosystem types using Sentinel-2 and DEM data have been established, its primary purpose was then to identify ten island ecosystem types with cloud and its shadow on new images. This research project concentrated on ten habitats on the Con Dao Island. Six new Sentinel-2 images in the specified region were selected for new interpretation across a three year period (2017, 2019 and 2021). Additionally, as described in above sections, data collection and pre-processing were performed. As soon as the new picture was fed into the trained U-Net, the model made use of the previously learned parameters to convert the new images into particular spatial matrices, creating intermediate matrices and to interpret the appropriate classes for each pixel in the new image. All of these prediction methods are self-contained and do not need additional training data.
Based on the changes in the training size, the number of filters and optimiser methods, 48 U-Net models were trained. The total accuracy and loss function values were used to compare the performance of these U-Net models (Table
Model performance of 48 trained U-Net models for island ecosystem type prediction.
No. |
Optimiser |
Size |
No. filters |
Loss |
ACC |
No. |
Optimiser |
Size |
No. filters |
Loss |
ACC |
|
1 |
Adam |
64 |
8 |
0.566 |
76.86 |
25 |
Adadelta |
64 |
8 |
0.579 |
76.52 |
|
2 |
Adam |
64 |
16 |
0.491 |
77.85 |
26 |
Adadelta |
64 |
16 |
0.514 |
77.36 |
|
3 |
Adam |
64 |
32 |
0.521 |
77.32 |
27 |
Adadelta |
64 |
32 |
0.488 |
78.11 |
|
4 |
Adam |
64 |
64 |
0.496 |
77.97 |
28 |
Adadelta |
64 |
64 |
0.469 |
78.48 |
|
5 |
Adam |
128 |
8 |
0.578 |
73.32 |
29 |
Adadelta |
128 |
8 |
0.564 |
74.63 |
|
6 |
Adam |
128 |
16 |
0.535 |
75.23 |
30 |
Adadelta |
128 |
16 |
0.547 |
75.13 |
|
7 |
Adam |
128 |
32 |
0.481 |
76.91 |
31 |
Adadelta |
128 |
32 |
0.488 |
76.38 |
|
8 |
Adam |
128 |
64 |
0.497 |
76.11 |
32 |
Adadelta |
128 |
64 |
0.442 |
78.21 |
|
9 |
Adam |
256 |
8 |
0.509 |
76.19 |
33 |
Adadelta |
256 |
8 |
0.516 |
75.71 |
|
10 |
Adam |
256 |
16 |
0.427 |
79.51 |
34 |
Adadelta |
256 |
16 |
0.469 |
77.37 |
|
11 |
Adam |
256 |
32 |
0.374 |
81.95 |
35 |
Adadelta |
256 |
32 |
0.456 |
78.33 |
|
12 |
Adam |
256 |
64 |
0.436 |
77.78 |
36 |
Adadelta |
256 |
64 |
0.167 |
93.36 |
|
13 |
SGD |
64 |
8 |
0.621 |
76.24 |
37 |
Adagrad |
64 |
8 |
0.606 |
76.59 |
|
14 |
SGD |
64 |
16 |
0.623 |
76.15 |
38 |
Adagrad |
64 |
16 |
0.579 |
77.03 |
|
15 |
SGD |
64 |
32 |
0.596 |
76.51 |
39 |
Adagrad |
64 |
32 |
0.552 |
77.13 |
|
16 |
SGD |
64 |
64 |
0.574 |
76.92 |
40 |
Adagrad |
64 |
64 |
0.521 |
77.41 |
|
17 |
SGD |
128 |
8 |
0.657 |
73.67 |
41 |
Adagrad |
128 |
8 |
0.633 |
74.09 |
|
18 |
SGD |
128 |
16 |
0.646 |
73.95 |
42 |
Adagrad |
128 |
16 |
0.565 |
74.71 |
|
19 |
SGD |
128 |
32 |
0.573 |
74.64 |
43 |
Adagrad |
128 |
32 |
0.516 |
75.67 |
|
20 |
SGD |
128 |
64 |
0.585 |
74.61 |
44 |
Adagrad |
128 |
64 |
0.506 |
76.46 |
|
21 |
SGD |
256 |
8 |
0.598 |
74.11 |
45 |
Adagrad |
256 |
8 |
0.562 |
74.39 |
|
22 |
SGD |
256 |
16 |
0.588 |
74.55 |
46 |
Adagrad |
256 |
16 |
0.455 |
77.77 |
|
23 |
SGD |
256 |
32 |
0.561 |
74.99 |
47 |
Adagrad |
256 |
32 |
0.461 |
77.91 |
|
24 |
SGD |
256 |
64 |
0.551 |
75.41 |
48 |
Adagrad |
256 |
64 |
0.372 |
82.49 |
Three U-Net models had an accuracy higher than 80%: the UNet-Adam-256-32, UNet-Adadelta-256-64 and UNet-Adagrad-256-64 models. Especially, the UNet-Adadelta-256-64 model was assessed to have the highest performance with an accuracy of 93.36% and a loss function value of 0.16 (Fig.
The accuracy of the island ecosystem type classification on Con Dao Island, based on the interpretation of five trained models is shown in Fig.
The Cross-Validation of three trained U-Net models and two benchmark models for the Island ecosystem type classification.
Sample distribution |
Class accuracy |
|||||
Type |
No. Samples |
UNet-Adam-256-32 |
UNet-Adagrad-256-64 |
UNet-Adadelta-256-64 |
SVM |
RF |
Ecosystem types |
||||||
Deep sea |
997 |
97.4 |
97.8 |
99.4 |
98.5 |
97.2 |
Sandy dunes |
962 |
74.8 |
76.0 |
79.5 |
77.5 |
26.8 |
Seagrass |
983 |
85.8 |
88.3 |
93.4 |
85.4 |
85.0 |
Residential area |
939 |
80.9 |
80.4 |
90.4 |
76.3 |
19.2 |
Natural forest |
990 |
97.4 |
97.2 |
98.6 |
97.8 |
97.2 |
Coral reefs |
852 |
27.0 |
30.4 |
64.9 |
21.8 |
3.2 |
Shallow water area |
992 |
77.6 |
85.0 |
98.7 |
59.1 |
48.1 |
Deep water area |
990 |
38.1 |
61.3 |
95.3 |
4.4 |
0.0 |
Other types |
||||||
Cloud shadow |
943 |
65.6 |
66.9 |
58.5 |
63.4 |
50.1 |
Cloud |
982 |
85.8 |
86.3 |
87.8 |
83.1 |
75.3 |
Total |
9,630 |
Overall accuracy (%) |
||||
73.0 |
77.0 |
86.6 |
66.7 |
50.2 |
||
Kappa Coefficient |
||||||
0.7 |
0.8 |
0.9 |
0.6 |
0.5 |
The accuracy comparison between the three U-Net models and the two benchmark models with new predictions is shown in Table
Fig.
It is worthwhile to have a tool that suits the specific needs of different stakeholders (e.g. land managers). This research project developed different deep-learning models to interpret ten different inland and offshore ecosystem types on the famous Island of Con Dao, Vietnam. As island ecosystems are commonly affected by both local and global climates, especially by storms and waves, the land cover of all ecosystems can change rapidly during rainy and dry seasons. Previous studies have already developed classification models for inland and coastal wetland ecosystems; however, some island ecosystems, such as coral reefs and seagrass, were not identified. The addition of these two ecosystems in the trained models can meet the needs of island managers. In comparison, generating an island land-cover map using conventional interpretation techniques with actual field samples may take considerable time. Meanwhile, the UNet-Adadelta-256-64 model can effectively and quickly interpret ten different island ecosystem types, clouds and their shadows from recent satellite images using training weight and calibration results contained in the trained model.
In addition to the former inland ecosystems, clouds with their shadows and seven wetland ecosystem types, based on the RAMSAR and MONRE classification systems, were added to the trained models. The addition of seven wetland ecosystem types is the first difference in comparison to all other models that were developed in previous studies (
As the research area is a small island, where the training and testing samples were collected in one year, the U-Net models could not clearly detect coral reefs, mangroves or sandy dunes. All islands are affected by currents waves and annual storms, leading to partly dramatic changes in the offshore sediments and climate. In particular, coral reefs can develop in waters with temperatures ranging from 20-32°C. During the rainy season, they can easily vanish when a wave or current containing offshore sediments flows over them, converting them to shallow water cover. Meanwhile, the mangrove ecosystems commonly develop in coastal areas. Therefore, the areas of coral reefs and mangroves observed on islands are rather small. As a powerful function of deep-learning methods, all U-Net models enable developers to update trained models with new data in order to build more accurate models. When more samples are available, sophisticated models may predict more accurately the kind of island ecosystem and offer more management choices. The multi-temporal remote sensing data can be used in this step to optimise the total accuracy, as well as the accuracy of coral reef interpretation. As the area of mangrove ecosystems is too small in the research area, it is necessary to collect more mangrove samples in coastal areas. However, the addition of coastal mangroves can improve the variety between the island and coastal ecosystem types. Therefore, to improve this issue, we think the SAR data from Sentinel-1 or data related to sea surface topography, sea and land surface temperature and ocean and land surface colour, calculated from Sentinel-3, can improve the interpretation of mangrove and coral reefs. However, they all are new sensors and require more research in the future. Some application of SAR data for analysing climatic condition was also mentioned in different Data Cube in European and Asian countries and can correct the distribution of mangrove and coral reefs. However, the resolution of these data is still low. The high-resolution images obtained, for instance, from Lidar or unmanned aerial vehicles (UAVs) can also be used to monitor this specific ecosystem in the future.
The development of 48 U-Net models for island ecosystem categorisation is expensive and time-consuming. A CPU Intel (R) Xeon (R) CPU @ 2.6GHz with 32GB RAM and a GPU NVIDIA GeForce GTX1070 were built for this study. Each U-Net model took from 30 to 40 seconds to train each epoch. Additionally, each RF and SVM model takes 60 to 70 seconds to train, on average. Even though it takes a while to train a U-Net model, fresh data may be used to update a learned model. Future U-Net models may benefit from adopting other optimisation methods, such as evolutionary or swarm intelligence, in place of a traditional optimisation method; or using fresh multi-spectral satellite image data to gain additional knowledge. High-resolution data may be used with a supercomputer to quickly interpret all kinds of (island) ecosystem types.
This study demonstrated the benefits of combining deep-learning and remote-sensing data for monitoring island ecosystem types. Besides interpreting new satellite images in any coastal region at any moment, the UNet-Adadelta-256-64 model was developed to interpret the distribution of ten island ecosystem types, as well as clouds and their shadows. The accuracy of the model reached 93%, with a loss function value of 0.16. The best-trained U-Net model was utilised to effectively identify the island ecosystem types on Con Dao Island within six years using Sentinel-2 data. A total of 11 different ecosystem types was found on Con Dao Island. Besides comparably common ecosystem types, characteristic coral reefs and seagrass can be found surrounding the Island, whereas the distribution of the shallow water ecosystems depends on the season and currents. After five years, the mainland ecosystems have not changed, except for residential areas due to urbanisation. Land-use managers could use the data and approaches to monitor ecosystem dynamics on islands every season instead of using traditional methods that assess changes every five years. It may be possible to retrain the model with additional samples in the future and use it to categorise ecosystems on other islands.
The paper was conducted under the science and technology project of the Vietnamese Ministry of Natural Resources and Environment: "Research and Evaluate the Coastal Wetland Ecosystem of Con Dao National Park, Propose Solutions for Conservation and Sustainable Development," code TNMT.2021.562.07. The authors would like to thank Mrs. Angie Faust for language corrections of the manuscript.
The science and technology project of the Vietnamese Ministry of Natural Resources and Environment: "Research and Evaluate the Coastal Wetland Ecosystem of Con Dao National Park, Propose Solutions for Conservation and Sustainable Development," code TNMT.2021.562.07
Research and Evaluate the Coastal Wetland Ecosystem of Con Dao National Park, Propose Solutions for Conservation and Sustainable Development.
VNU Institute of Vietnamese Studies and Development Sciences, Vietnam National University.