NEW APPROACH FOR ANALYZING MARINE ECOSYSTEM STRUCTURE USING BAYESIAN NETWORKS
Аннотация и ключевые слова
Аннотация (русский):
Aquatic ecosystems of the Black Sea are complex multiparametric systems with a hierarchical structure. Thus, the main goal of our research was to investigate possibilities of using Bayesian networks to study the structure fo the natural systems in the Black Sea. We used CMEMS Black Sea environmental dataset, which consists of 7 different variables, that, in our opinion, can describe structural relations in the Black Sea ecosystem – sea surface temperature and salinity, concentrations of nitrates and phosphates, amount of chlorophyll-a and net primary production and also dissolved oxygen concentration. We think, that these variables can generally define interactions in water environment of the Black Sea, organisms, that live there and human activity. As a modelling result, we receive a structure of environmental variables interactions. At the top of this structure is a dissolved oxygen, as a final result of the ecosystem functioning. Further, we think it’s more appropriate to use Dynamic Bayesian networks for investigation of spatio-temporal changes to distinguish main drivers of changes and provide more balanced management of natural territories.

Ключевые слова:
machine learning, Black Sea, Bayesian networks, ecosystem analysis
Текст
Текст произведения (PDF): Читать Скачать

Introduction

Ecosystem analysis — is one of the most important scientific methods of receiving information about characteristics of any area [Bourlière and Hadley, 1973]. Main goal of the ecosystem analysis is studying and assignment of natural ecological systems, defining their functional potential, georeferencing, ecological and social position [Krivoguz and Borovskaya, 2020; Milns et al., 2010]. Also, it can give an informational basis for more objective evaluation of environmental and resource state for future optimization, predicting of their state, qualitative and quantitative environmental monitoring, etc. Due to the hierarchical structure, ecosystems are a good choice for modern modelling and analytical approaches using machine learning algorithms. Modelling is a key method of ecosystem analysis, which helps to investigate ecosystems in time and space. Aquatic ecosystems are a complex multiparameter systems, described by non-stationary and non-linear processes. Large amount of direct and opposite connections from all of the ecosystem elements, make its behavior unpredictable for researchers. That’s why prediction accuracy of some ecosystem elements wittingly too low [Lehikoinen et al., 2019]. In this case, ecosystem analysis using machine learning algorithms is a key for increasing predicting performance on modelling.

Objective-oriented approach, used in our paper is that analyzed ecosystem of the Black Sea consists from some variety of objects with some certain properties, which are interact with each other. For example, it can be some physical-chemical properties of water environment (salinity or temperature) which interact with biota or, in our case, phytoplankton, that defined by such factors, as chlorophyll-a concentration and net primary production values.

One of the most widespread use of Bayesian networks is in area of ecosystem services. For example, Landuyt D. [Landuyt et al., 2013] used data about different characteristics of trees and population density combining with Bayesian network to understand amount of wood production and extraction in future. Great example of understanding structure of ecosystem presented in [Milns et al., 2010], where they’re investigate inter-habitat networks for different spatial scales of Peak District National Park, United Kingdom. Some interesting implementations of Bayesian networks and aquatic

Figure 1: Seasonal zoning of the Black Sea. A — Winter; B — Spring; C — Summer; D — Autumn. [Krivoguz, 2020]

environment presented in [Havron et al., 2017], where using different environmental factors, such as amount of silt and sand, salinity, temperature, total organic carbon, nitrogen and other variables they’re mapped marine habitat suitability and uncertainty.

Thus, we think, that using a Bayesian networks can become an important part of more complex spatio-temporal prediction system in future. In this case, Bayesian network helps us to understand not only the structure of the system, but also connections of element’s interactions between each other. Further, with analysis of temporal patterns, this approach will help us to investigate the structure of natural systems on different stages of its spatiotemporal evolution. This will help us more accurate identify different factors, having the greatest impact and leading to her transforming.

Some research about investigation of ecosystem structure for fisheries were made by N. Trifonova [Trifonova et al., 2015, 2017]. In their study, they use a dynamic Bayesian network model with a hidden variable and spatial autocorrelation to explore the future of different fish and zooplankton species, given alternate scenarios, and across spatial scales within the North Sea.

Thus, the goal of our research is to investigate possibilities of using Bayesian networks, as a tool for studying the structure of the natural systems on the example of the Black Sea.

Materials and methods

Research area

Geographical conditions. Black sea is situated between 46◦380 and 40◦540N and 27◦210 and 41◦470E. It isn’t isolated from the World Ocean [Tamaychuk, 2017]. Through the Bosporus Strait

it’s connected with the Sea of Marmara, that is connected with Aegean Sea through the Dardanelle Strait. The averaged area of the Black Sea is 420,000 km2 and volume about 555,000 km3 [Barratt, 1993].

Geological and tectonic conditions. Black sea is situated in the depression between Anatolian peninsula and South-Eastern Europe. This depression was formed in Miocene with an active mountain building processes, that divided Tethys ocean into several separate waterbodies [Ozsoy and Unluata, 1997].

Rocks of the Black Sea consists from coarsegrained deposits, like pebbles, gravel and sand [Ivanov and Belokopytov, 2011]. Offshore sediments presented by fine-grained sands and silts, while in the North-Western part of the sea, by shell rocks.

Climate. Climate of the Black Sea can be characterized as continental. Shores of Caucasus and Southern Crimea is covered with mountain ranges

from cold northern winds. Atlantic Ocean has a critical impact on the climate of the Black Sea [Ivanov and Belokopytov, 2011]. Averaged air temperature in January in the North part of the Black Sea is about

— − −

1◦C to 5◦C, but rarely it can go lower to 10◦C. Areas in the southern part of the Black Sea is quite

warmer, with mild winters, when temperature don’t get lower then 5◦C [Ivanov and Belokopytov, 2011]. In June averaged air temperature in the North-

ern part of the Black Sea is 22–25◦C, but sometimes it can reach 35◦C [Ozsoy and Unluata, 1997]. The maximum amount of precipitation drops near

Caucasus shore of the Black Sea (about 1500 mm annually), while the minimum — on North-Western part (about 300 mm annually). Averaged sea surface temperature doesn’t fall lower then 7◦C [Ivanov and Belokopytov, 2011].

Natural zones. Studying water objects important to know the seasonal differentiation of zones, due to their very high rate of changing in time. Comparing with land areas, water objects doesn’t be stable in the large period of time and the spatial distribution of the different factors can change from season to season (Figure 1).

In winter season distribution of zones spread from East to West. The most important role plays the interaction with the Sea of Marmara, river flows in the Norther part of the Black Sea, Caucasus and areas near the Southern Crimean and Kerch peninsula shores.

In spring, zonation of the Black Sea comes from North to South. Main role here plays large amount of river flows from such rivers as Dnieper, Danube and Dniester in North-Western part of the Black Sea and inflow water from the Sea of Marmara. Due to the interactions of these two water masses, radically different by their properties, they form transitional zone between them from Kerch strait to Danube estuary.

Data

In this research we used monthly averaged data from Copernicus Marine Environmental Monitoring Service (CMEMS) — Black Sea Reanalysis, which contains 7 spatial variables of different environmental factors, describing the Black Sea ecosystem [Lima et al., 2020].

For building the Black Sea ecosystem structure using Bayesian network we used 7 environmental variables: sea surface temperature (SST), sea surface salinity (SSS), nitrates and phosphates, chlorophylla, net primary production (PPN) and dissolved oxygen.

Distribution of environmental variables of the Black Sea presented in Table 1. Concentration of nitrates ranges from 0 to 140.19 mmol/m−3, but usu-

ally it values varies around 3–4 mmol/m−3. Higher

values (about 140 mmol/m−3) can be explained by flows from the farmlands or other human activity. Concentration of phosphates in the Black Sea waters varies from 0.88 to 5.48 mmol/m−3. Statistical analysis showed that usually phosphates concentrations lays near its maximum, an reaches its minimum only sometimes. Amount of dissolved oxygen varies from 240.37 mmol/m−3 to 287.66 mmol/m−3 and its fluctuations mainly caused by seasonal dynamics. Amount of net primary production in the Black Sea varies from 0.40 mol/m−3day−1 to

27.76 mmol/m−3day−1, that with the concentra-

tion of the chlorophyll-a, which is in range from

0.01 mg/m3 to 15.07 mg/m3 explained by seasonal growth of phytoplankton in some areas of the Black Sea mostly in spring and summer. Variations of sea surface temperature and salinity are from 12.97◦C

to 16.94◦C and from 11.87 ‰ to 18.50 ‰ corre-

spondingly with the lowest values in the North-

Western part of the Black Sea, due to the high amount of water flows from big rivers, and maximum in the South-Eastern part of the sea.

2.3 Algorithm description

Bayesian networks often use in ecology, where they have two main directions of application. First, when we need to understand basic principles of ecosystems functions. In this case, the research bases on edges of Bayesian network to investigate functional relations in ecosystem or on rules, used for constructing conditional probabilities for node to investigate principles of factor’s interactions [Scutari, 2010]. The second way is to evaluate values of model and giving us an empirical information, that is useful and related to the key ecological predictors [Chen and Pollino, 2012].

Bayesian networks is a good tool for describing the complex systems and events with uncertainty. The main idea of using Bayesian networks is a decomposition of complex system into several simple elements [Bendtsen, 2017]. Bayesian network is a directed acyclic graph that represents the structure of nodes and edges, where nodes are responsible for some variables and edges represents relations between them [Margaritis, 2003]. Commonly, Bayesian network also called as «Deep network» due to its complicated structure [Cooper and Herskovits, 1992; Singh and Valtorta, 1995]. Presence of the edge between two nodes indicates, that there’s a statistical dependency between two variables [Koski and Noble, 2012]. In some cases, the direction of the edge interpreted as a presence of casual relationship between variables[Friedman et al., 1997]. Generally, Bayesian network represents as:

The node Xi is a parent of the node Xj, if they’re connected by edge from Xi to Xj. Respectively, the node Xj is a child to the node Xi. If the node Xi is

Table 1: Annual distribution of environmental variables in the Black Sea

where probability Xi depends from probability of corresponding parent node and can be represented by a random value.

The concept of Bayesian network mainly based on two things — conditional probability and joint probability. Conditional probability of event X — is a numeric value of probability, that event X will occur on a condition, that event Y had occurred. The probability for one event can be calculated as (2) [Marcot, 2012].

Figure 2: Simple representation of a Bayesian network with events X, Y and Z.

Fork patter is similar to chain but in this case, dependence is reverse — that X Y Z. Collider pattern is differed for others. In this case, the event Y is depend both from event X and from event Z or X Y Z [Marcot and Penman, 2019; Zhou et al., 2020].

P(X) = n(x)

N

(2)

Results and discussion

where n — investigated events, N — all possible events.

For two events, if and is a dependent events, probability calculates by (3) [Marcot, 2012].

Through the constructed Bayesian Network (Figure 3), we can see patterns, that shows dependency between main components of the Black Sea ecosystem. Sea surface salinity and sea surface tempera-

|

P(X Y) = P(X  Y)

P(Y)

(3)

ture is a basis of the network. Each of these factors are independent from other in ecosystem and all of

If X and Y are independent events, probability calculates by (4) [Marcot, 2012].

P(X|Y) = P(X) (4)

i.e. investigated event occuring is equally same to each other.

Joint probability is a definition of statistical measure for two or more events, that are occurring at the same point of time, i.e. if events X, Y and Z occurs together, their joint probability can be defined by (5) [Marcot, 2012].

P(X  Z) (5)

The structure of a simple Bayesian network can be described by three junction patterns: chain, fork and collider (Figure 2) [Margaritis, 2003].

Chain pattern means that event X depends of event Y, which depends of event Z or X → Y → Z.

the patterns starts from them.

We divided this system into 4 levels, according to location of analyzed factors in our network. Each of this level Ln has «in-factors» (some factors from level Ln 1, which are connected with any of the factor in the level Ln) and «out-factors» (some factors from level Ln, which are connected with any of the factor in the level Ln+1). Each of these factors describes some event or state of this factors according to other in same state of the time.

For building and learning the structure of this network we used R-package «bnlearn» on Black Sea environmental dataset [Nagarajan et al., 2013]. To create Bayesian network we loaded dataset with environmental variables and created undirected and unweighted structure using «hc» command. Next, we set directions of undirected edges of the network using «iamd». This means that direction of the edges will be chosen automaticaly, based on the input data, instead of the setting them manually.

Figure 3: Systematic connectivity of environmental factors in the Black Sea modeled by Bayes network.

After this step we fitted of our model using unweighted structure and directions of the edges to create visualization of the ecosystem structure using «bn.fit» command.

Sea surface temperature in one of the most important factors of the environment, defining not only a spatial distri bution of the ecosystems, but also their biodiversity. For example, water temperature can impact on metabolism speed level of marine organisms and speed of photosynthetic reactions for algae and flora. The main factor influencing the level of sea surface temperature is solar radiation.

Sea surface salinity is also an important factor of any marine ecosystem. Lifecycle of many marine organisms highly depends to the salinity level. This is due to the peculiarities of osmosis — the ability to penetrate into a living cell of the body and vice versa, depending on the concentration of substances dissolved in water, until equilibrium is reached. Main driver of the salinity level changing is an amount of precipitation and the level of evaporation from the sea surface.

Nitrates and phosphates situated higher then salinity and temperature due to their dependency from temperature and salinity and also to the sources. The main driver, defining the level of ni-

trates and phosphates in aquatic environment of the Black Sea is a human activity, through river flows.

Higher from phosphates and nitrates are situated chlorophyll-a and net primary production. Generally, these two different factors describing the amount of the phytoplankton in the Black Sea waters. Also, these factors are dependent from temperature, salinity, nitrates and phosphates.

At the top of the Bayesian network is dissolved oxygen. It’s a key factor, that is a result of interactions between other factors.

Figure 4: Out-factors layer of level-1.

Schemes of the first-level factors temperature and salinity is quite similar (Figure 4). The only

difference between them is an interaction with chlorophyll-a. The main cause of it is a strong impact of sea temperature on distribution of phytoplankton. This means that if there are any changes in sea surface temperature it will lead to changes in level of the chlorophyll-a amount. On the other hand, sea surface salinity doesn’t have any impact on chlorophyll-a amount directly, but mainly though more complex pattern by interactions with another factors.

Figure 5: In-factors layer of the level-2.

In-factors graph (Figure 5) shows some similarities between level-2 factors nitrates and phosphates. The main difference between them is some impact of the nitrates level and amount of phosphates. Due to the fact, that the source of these factors mainly is an anthropogenic activity, the nature of this dependence can be outside of the Black Sea ecosystem. This means that if the main source of the phosphates and nitrates here is a flow from the rivers of the Black Sea basin, this dependence forms on the stage, when phosphates and nitrates fall into them, not with the functioning of the ecosystem.

Figure 7: In-factors layer of the level-3.

duction. When its level decreases, then the amount of fixated carbon decreasing too, but it makes no effect into chlorophyll-a amount in aquatic environments. Phosphates and nitrates are well-known factors of phytoplankton growth, while temperature mainly impacts on their spatial distribution. Net primary production is a derivative of an amount of chlorophyll-a and correspondingly phytoplankton biomass.

Figure 8: Out-factors layer of the level-3

Out-factor graph of this ecosystem (Figure 8) level quite similar, cause mainly these 3 factors describes phytoplankton activity and extraction of by-product of their lifecycle — oxygen. This connection has a direct impact, so when biomass of phytoplankton decreases, then the amount of dissolved in water oxygen will decreasing too.

Figure 6: Out-factors layer of the level-2.

Out-factors graphs are quite different for nitrates and phosphates (Figure 6). While changing in phosphates level impact only on phytoplankton, nitrates has move wide interactions with ecosystem of the Black Sea. If we discard interaction between phosphates and nitrates, the only difference between them will be an interaction with the dissolved oxygen, as a top-level factor of our Bayesian network. The main cause of this is a participation of nitrates in different chemical reactions in aquatic environment with using oxygen, that contains in water.

In-factor graph of level-3 factors (Figure 7) chlorophyll-a and net primary production differs mainly by impact of sea surface salinity. Generally, salinity has a direct impact into net primary pro-

Figure 9: In-factors layer of the level-4

In-factor graph of the last 4th level (Figure 9) shows pattern of the dissolved oxygen dependency with different factors of the Black Sea ecosystem. These relations can be described with all of the analyzed factors except phosphates, which, as we think, has just an indirect impact. Mostly, these factors make part in different physic-chemical processes, that involves consumption or extraction of oxygen,

so any changes in levels of these factors will lead to increasing or decreasing in the amount of dissolved oxygen, contained in the Black Sea ecosystem.‌

Conclusion

In this paper we suggest using of Bayesian networks for investigation and analysis of natural systems structure. We used CMEMS Black Sea environmental dataset, which consists of 7 different variables, that, in our opinion, can describe structural relations in the Black Sea ecosystem — sea surface temperature and salinity, concentrations of nitrates and phosphates, amount of chlorophyll-a and net primary production and also dissolved oxygen concentration. We think, that these variables can generally define interactions in water environment of the Black Sea, organisms, that live there and human activity.‌‌‌

As a modelling result, we receive a structure of

environmental variables interactions. At the top of this structure is a dissolved oxygen, as a final result of the ecosystem functioning.

Further, we think it’s more appropriate to use Dynamic Bayesian networks for investigation of spatiotemporal changes to distinguish main drivers of changes and provide more balanced management of natural territories.

 

Список литературы

1. Barratt, L., Black sea oceanography, Reviews in Fish Biology and Fisheries, 3(2), 199-200, doihttps://doi.org/10.1007/BF00045240,1993.

2. Bendtsen, M., Gated Bayesian Networks, 212 pp., Linköping University, Linköping, 2017.

3. Bourlière, F., and M. Hadley, Ecosystem Analysis: Combination of Qualitative and Quantitative Approaches, in Ecological studies, pp. 1-6, Springer, Berlin, Heidelberg,doihttps://doi.org/10.1007/9783642855870_1, 1973.

4. Chen, S. H., and C. A. Pollino, Good practice in Bayesian network modelling, Environmental Modelling and Software, 37, 134-145, doihttps://doi.org/10.1016/j.envsoft.2012.03.012,2012.

5. Cooper, G. F., and E. Herskovits, A Bayesian method for the induction of probabilistic networks from data, Machine Learning, 9(4), 309-347,doihttps://doi.org/10.1007/bf00994110, 1992.

6. Friedman, N., D. Geiger, and M. Goldszmidt, Bayesian Network Classifiers, Machine Learning, 29(23), 131-163, doihttps://doi.org/10.1023/a:1007465528199, 1997.

7. Havron, A., C. Goldfinger, S. Henkel, B. G. Marcot,C. Romsos, and L. Gilbane, Mapping marine habitat suitability and uncertainty of Bayesian networks: a case study using Pacific benthic macrofauna, Ecosphere, 8(7),e01,859, doihttps://doi.org/10.1002/ecs2.1859, 2017.

8. Ivanov, V., and V. Belokopytov, Oceanography of the Black Sea, 212 pp., ECOSY Gidrofizika, Sevastopol, 2011.

9. Koski, T. J., and J. Noble, A Review of Bayesian Networks and Structure Learning, Mathematica Applicanda, 40(1),doihttps://doi.org/10.14708/ma.v40i1.278, 2012.

10. Krivoguz, D., Methodology of physiography zoning using machine learning: A case study of the Black Sea, Russian Journal of Earth Sciences, 20(1), 1-10,doihttps://doi.org/10.2205/2020ES000707, 2020.

11. Krivoguz, D., and R. Borovskaya, Methodological aspects of natural territories zoning using machine learning, Monitoring systems of environment, 39(1), 13-20,doihttps://doi.org/10.33075/22205861202011420, 2020.

12. Landuyt, D., S. Broekx, R. D’hondt, G. Engelen, J. Aertsens, and P. L. Goethals, A review of Bayesian belief networks in ecosystem service modelling, Environmental Modelling & Software, 46, 1-11,doihttps://doi.org/10.1016/j.envsoft.2013.03.011, 2013.

13. Lehikoinen, A., J. Olsson, L. Bergström, U. Bergström, A. Bryhn, R. Fredriksson, and L. Uusitalo, Evaluating complex relationships between ecological indicators and environmental factors in the Baltic Sea: A machine learning approach, Ecological Indicators, 101,117-125, doihttps://doi.org/10.1016/j.ecolind.2018.12.053, 2019.

14. Lima, L., S. Masina, S. A. Ciliberti, E. L. Peneva, S. Cretí, L. Stefanizzi, R. Lecci, F. Palermo, G. Coppini, N. Pinardi, and A. Palazov, Black Sea Physical Reanalysis (CMEMS BSCurrents),doihttps://doi.org/10.25423/cmcc/blksea_reanalysis_phys_007_004,2020.

15. Marcot, B. G., Metrics for evaluating performance and uncertainty of Bayesian network models, Ecological Modelling, 230, 50-62,doihttps://doi.org/10.1016/j.ecolmodel.2012.01.013, 2012.

16. Marcot, B. G., and T. D. Penman, Advances in Bayesian network modelling: Integration of modelling technologies, Environmental Modelling & Software, 111(1), 386-393, doihttps://doi.org/10.1016/j.envsoft.2018.09.016, 2019.

17. Margaritis, D., Learning Bayesian network model structure from data, Tech. rep., Carnegie Mellon Univ Pittsburgh Pa School of Computer Science, 2003.

18. Milns, I., C. M. Beale, and V. A. Smith, Revealing ecological networks using Bayesian network inference algorithms, Ecology, 91(7), 1892-1899, 2010.

19. Nagarajan, R., M. Scutari, and S. Lèbre, Bayesian Networks in R: with Applications in Systems Biology, 1-157 pp., Springer New York, doihttps://doi.org/10.1007/9781461464464,2013.

20. Ozsoy, E., and U. Unluata, Oceanography of the Black Sea: a review of some recent results, Earth Science Reviews,42, 231-272, 1997.

21. Scutari, M., Learning Bayesian networks with the bnlearnR Package, Journal of Statistical Software, 35(3), 1-22,doihttps://doi.org/10.18637/jss.v035.i03, 2010.

22. Singh, M., and M. Valtorta, Construction of Bayesian network structures from data: A brief survey and an efficient algorithm, International Journal of Approximate Reasoning, 12(2), 111-131, doihttps://doi.org/10.1016/0888613X(94)00016V, 1995.

23. Tamaychuk, A. N., The space heterogeneity of natural conditions and the division of the Black Sea, Regional Research of Russia, 149(2), 30-50, 2017.

24. Trifonova, N., A. Kenny, D. Maxwell, D. Duplisea,J. Fernandes, and A. Tucker, Spatiotemporal Bayesian network models with latent variables for revealing trophic dynamics and functional networks in fisheries ecology, Ecological Informatics, 30, 142-158,doihttps://doi.org/10.1016/j.ecoinf.2015.10.003, 2015.

25. Trifonova, N., D. Maxwell, J. Pinnegar, A. Kenny, and A. Tucker, Predicting ecosystem responses to changes in fisheries catch, temperature, and primary productivity with a dynamic Bayesian network model, ICES Journal of Marine Science, 74(5), 1334-1343,doihttps://doi.org/10.1093/icesjms/fsw231, 2017.

26. Zhou, J., P. G. Asteris, D. J. Armaghani, and B. T. Pham, Prediction of ground vibration induced by blasting operations through the use of the Bayesian Network and random forest models, Soil Dynamics and Earthquake Engineering, 139(1), 106,390,doihttps://doi.org/10.1016/j.soildyn.2020.106390, 2020.

Войти или Создать
* Забыли пароль?