Unrecognised water limitation is a main source of uncertainty for models of terrestrial photosynthesis

Abstract

Quantification of environmental controls on ecosystem photosynthesis is essential to understand the impacts of climate change and extreme events on the carbon cycle and the provisioning of ecosystem services. Machine learning models have become popular for simulating ecosystem terrestrial photosynthesis because of their predictive skill, but often do not consider temporal dependencies in the data, even though process understanding suggests that these should exist. Here, we investigate how models that account for temporal structure impact the prediction of ecosystem photosynthesis. Using time-series measurements of ecosystem fluxes paired with measurements of meteorological variables from a network of globally distributed sites (N=104) and remotely sensed vegetation indices, we train three different models to predict ecosystem gross primary production (GPP): a mechanistic, theory-based photosynthesis model, a memoryless multilayer perceptron (MLP) and a recurrent neural network (Long Short-Term Memory, LSTM). Through comparisons of patterns in model error, we assess the ability of these models to predict GPP across a wide diversity of ecosystems and climates, and to account for temporal dependencies, with a focus on effects by low rooting zone moisture and freezing air temperatures. While both deep learning models outperform the mechanistic model, we find their overall performance is similar, with an R2 of 0.79 spatial out-of-sample predictions for both models. Overall, model skill is consistently good across moist sites with strong seasonality. During periods affected by temporal patterns such as drought and frost, the LSTM shows lower model error than the MLP as well as an LSTM with shuffled input, showing that there is an advantage from learned temporal dependencies. Generalisation patterns reveal that the LSTM tends to be more successful than the (time-agnostic) MLP in simulating GPP in dry environments. However, a large variability in model skill across relatively dry sites remained. This was not resolved by the inclusion of additional earth observation data, although this improved overall model performance. Insufficient information on the exposure and response to water stress and related effects on GPP appear to be dominant sources of error for modelling ecosystem fluxes across the globe. With the increasing frequency of hydroclimatic extreme events, effects of water limitation are expected to become more prevalent, which calls for models that better represent its impact on ecosystem function.

Publication
Biogeosciences
Samantha Biegel
Samantha Biegel
PhD Student, co-hosted ETH AI-Center