The data-based mechanistic approach to the modelling, forecasting and control of environmental systems

https://doi.org/10.1016/j.arcontrol.2006.05.002Get rights and content

Abstract

The paper presents a unified approach to the modelling, forecasting and control of natural and man-made environmental systems. The modelling approach exploits the author’s Data-Based Mechanistic (DBM) modelling philosophy, combined with powerful methods of recursive statistical estimation. These provide the basis for two major stages of model building: first, the critical evaluation of the over-parametrized simulation models that are currently the most common vehicle used in environmental planning and management studies; and second, the adaptive, data-based estimation of parsimonious, ‘top-down’ models that can be used for adaptive forecasting and data assimilation, as well as operational control and management system design. The associated control system design methodology is based on the Non-Minimal State Space (NMSS) approach to the design of Proportional-Integral-Plus (PIP) control systems, based on the DBM models obtained at the previous modelling stage. The paper includes a case study concerned with the modelling and control of globally averaged levels of CO2 in the atmosphere.

Introduction

Throughout the 20th century, the hypothetico-deductive approach to scientific research, as extolled so lucidly by the philosopher Karl Popper (Popper, 1959), reigned supreme. The inductive approach, that had provided the cornerstone for scientific research in earlier centuries, became less attractive to the physicists and chemists who dominated science during the 20th century and whose well planned experimentation provided some of the stimulus for Popper’s views. Moreover, the rise of the computer, with its ability to construct and solve large mathematical simulation models, provided a magnificent engine for the implementation of the hypothetico-deductive approach. Today, such computer-based simulation modelling has become straightforward, almost simple, with the availability of iconographic software, such as Matlab/Simulink , where a complex model can be assembled quickly from a built-in and comprehensive library of simulation objects.

All this would appear to be good news for environmental scientists involved in the management and planning of environmental systems. But is it? While acknowledging the virtues of simulation modelling, particularly when it is carried out in stochastic terms, this paper will discuss the limitations of this approach when it is used in the context of environmental systems, where planned experimentation is difficult, if not impossible, and where uncertainty about the nature of the processes involved sits uncomfortably with the deterministic models that appear to dominate simulation modelling practice.

This paper will outline a Data-Based Mechanistic (DBM) approach to modelling, forecasting and control that often starts with the construction and evaluation of a simulation model which reflects the scientists’ perception of the physical, chemical and biological mechanisms that characterise the environmental system. However, these DBM studies are not simple exercises in simulation modelling: rather they constitute a critical evaluation of the model in both stochastic and response terms; an evaluation that marks only the beginning, not the end, of the modelling process.

Exploiting some of the tools of DBM modelling that are later applied to real data, this critical evaluation considers the simulation model as a natural extension of the thought processes and scientific speculation that resides in the mind of the model builder. And, by providing insight into the strengths and limitations of the simulation model, it provides a prelude to the exercises in DBM modelling from real data that becomes possible when data are available on the response of the environmental system to natural or anthropogenically-induced perturbations.

Of course, in this environmental context, the real data required for DBM modelling are most often the result of monitoring studies, rather than planned experimentation. As a result, such data may not be available or, as is often the case, they may provide an insufficient basis for DBM modelling. Even in this data-deficient situation, however, we will see that the DBM modelling methodology can provide valuable insight into the strengths and limitations of the simulation model; insight that can radically effect the way in which the model is used as a tool in planning and management.

Whatever model emerges from the DBM modelling process, it should be a model that is well suited to the objectives of the study team. These may range from ‘what-if’ simulation, where the complexity of the simulation model is a clear advantage, to exercises in forecasting and operational control, where the over-parameterization that normally characterizes the large simulation model is a definite disadvantage. This paper will argue, therefore, that the construction of a single model that suits all purposes is normally impossible and, in any case, undesirable. Rather the objectives of the study should be clearly defined and a well integrated suite of models should be constructed, each designed to satisfy the requirements of these objectives.

It is suggested that, wherever possible, the parametrically efficient (or ‘parsimonious’) DBM model should provide a description of the core mechanisms that dominate the observed behaviour of the environmental system under study. And it should also provide the basis for the final construction of a stochastic simulation model that reflects this core behaviour but may involve other, more speculative elements that are required for ‘what-if’ simulation and planning exercises. The advantage of this DBM ‘moderation’ of the simulation model is that the relative confidence in the historically validated DBM core (when sufficient data are available) can be balanced with the reduced confidence in the more poorly validated speculative elements. Thus the results of any analysis can be better evaluated with these relative uncertainties in mind. For instance, if the model contains nonlinearities that have not been well-validated during DBM modelling over the historical period, but are thought to be of potential importance in the future, then the large level of uncertainty in this regard must be reflected clearly in any predictive application of the model.

The DBM approach to modelling and forecasting has been applied to a number of environmental systems. These include the modelling and control of water quality in rivers (see e.g. Beck and Young, 1975, Young and Beck, 1974, Young et al., 1976, Wallis et al., 1989); the modelling of rainfall-flow processes, within the context of flood forecasting and warning (e.g. see Young, 2002 and the prior references therein); and the modelling and control of mass and energy transfer in agricultural buildings (Price, Young, Berckmans, Janssens, & Taylor, 1999). The present paper describes an environmental case study in a different environmental area which has involved most aspects of this DBM approach: namely, the modelling and control of globally averaged levels of atmospheric CO2 and the possible implications of this research on global warming.

Section snippets

The fundamentals of data-based mechanistic modelling

For many years, the author and his co-workers have attempted to draw attention to the limitations of large deterministic models of the environment (e.g. Young, 1978, Beck, 1983, Young, 1998b, Young, 1999a, Beck, 1983, Young and Lees, 1993, Young et al., 1996, Shackley et al., 1998). In order to alleviate these limitations, the DBM approach to modelling is built on the assumption that, wherever possible, the dynamic modelling of environmental systems should not be based solely on such simulation

Brief review of DBM methods

Since the computational methods used in DBM modelling have been described elsewhere in the cited references, only those of relevance to the case study, as described in the next Section 4, will be outlined in this section. However, these and other DBM modelling algorithms are available in the CAPTAIN Toolbox,1 a suite of algorithms developed at Lancaster for use within the Matlab/Simulink software environment. The primary tools in CAPTAIN, together

Case study: Global carbon cycle modelling, prediction and speculative emission control

This case study is a review of research carried out at Lancaster over the past few years on global carbon cycle dynamics. It is based on two separate studies that, taken together, illustrate well the DBM approach to modelling and operational control/management system design. The main objectives of this study are: (i) to understand better the nature of global carbon cycle dynamics, based on an evaluation of existing simulation models and the analysis of the available globally averaged data; and

Conclusions

This paper has outlined the Data-Based Mechanistic approach to the modelling, forecasting and control of environmental systems. The generic nature of this approach means that it has wide application potential (Young, 1998b). This is demonstrated by the global carbon cycle case study outlined in the paper, as well as previous published applications in various areas of study ranging from ecological (e.g. Young, 2000, Young, 2001b) and biological (e.g. Jarvis et al., 1999, Price et al., 2001),

Peter Young is Emeritus Professor of Environmental Systems at Lancaster University, UK, and Adjunct Professor of Environmental Systems in the Centre for Resource and Environmental Studies at the Australian National University, Canberra. He obtained B.Tech. and M.Sc. degrees at Loughborough University before moving to Cambridge University, where he obtained his Doctoral degree in 1970. Following two years as a civilian scientist, working for the U.S. Navy in California, he was appointed

References (67)

  • P.C. Young et al.

    The modelling and control of water quality in a river system

    Automatica

    (1974)
  • P.C. Young et al.

    The modelling and control of pollution in a river system

    Journal of Hydrology

    (1976)
  • H. Akaike

    A new look at statistical model identification

    IEEE Transactions on Automatic Control

    (1974)
  • M.B. Beck

    Uncertainty, system identification and the prediction of water quality

  • K.J. Beven et al.

    The future of distributed models: model calibration and uncertainty prediction

    Hydrological Processes

    (1992)
  • K.J. Beven et al.

    Comment on ‘Bayesian recursive parameter estimation for hydrologic models’ by M. Thiemann et al

    Water Resources Research

    (2003)
  • A. Dai et al.

    Can climate variability contribute to the missing CO2 sink?

    Global Biogeochem. Cycles

    (1993)
  • Enting, I. G., & Lassey, K. R. (1993). Projections of future CO2. Technical Report 27. Division of Atmospheric...
  • G.D. Farquhar et al.

    The carbon cycle and atmospheric carbon dioxide

  • G.C. Goodwin et al.

    Dynamic system identification: Experiment design and data analysis

    (1977)
  • C.W.J. Granger et al.

    Modelling nonlinear economic relationships

    (1993)
  • G.M. Hornberger et al.

    An approach to the analysis of behaviour and sensitivity in environmental systems analysis of environmental systems

  • R.J. Huggett

    Modelling the human impact on nature: Systems analysis of environmental problems

    (1993)
  • A.J. Jarvis et al.

    An analysis of the dynamic response of stomatal conductance to a reduction in humidity over leaves of Cedrella odorata

    Plant Cell and Environment

    (1999)
  • Jarvis, A. J., Young, P. C., Leedal, D.T., & Chotai, A. A sequential CO2 emissions strategy based on optimal control of...
  • H. Oeschger et al.

    A box diffusion model to study the carbon dioxide exchange in nature

    Tellus

    (1975)
  • N. Oreskes et al.

    Verification, validation, and confirmation of numerical models in the earth sciences

    Science

    (1994)
  • Parkinson, S. (1995). The application of stochastic modelling techniques to global climate change. PhD thesis....
  • S. Parkinson et al.

    Uncertainty and sensitivity in global carbon cycle modeling

    Climate Research

    (1998)
  • D.J. Pedregal et al.

    Statistical approaches to modelling and forecasting time series

  • D.A. Pierce

    Least squares estimation in dynamic disturbance time-series models

    Biometrika

    (1972)
  • K. Popper

    The logic of scientific discovery

    (1959)
  • Cited by (57)

    • Data-driven models for predicting microbial water quality in the drinking water source using E. coli monitoring and hydrometeorological data

      2022, Science of the Total Environment
      Citation Excerpt :

      In addition, a longer time horizon for prediction, e.g. several days, would be advantageous from the water management perspective. Furthermore, combining data-driven methods with process-based modelling of water quality (e.g. Dienus et al., 2016; Whitehead et al., 2018) is a promising approach (Young, 1992, 2006). An optimal forecasting model for managing river water intake should be easily interpreted and make predictions a couple of days ahead based on accessible online data.

    • Real-time modelling of individual weight response to feed supply for fattening pigs

      2019, Computers and Electronics in Agriculture
      Citation Excerpt :

      A data-based mechanistic approach is considered when the model structure and parameters inferred by the data-based model can be interpreted physically in meaningful terms. For instance, a Transfer Function technique that, while being based on a simple model structure, allows coping with non-linear characteristics of the process by estimating the model parameters in real-time, could be useful in this case, as long as the data-based model allows a mechanistic interpretation of its properties (Young, 2006). An essential component of precision feeding systems is the control algorithm and, in particular, the real-time prediction model of individual growth response to different feed compositions during the production cycle.

    • Dynamic modelling of lettuce transpiration for water status monitoring

      2018, Computers and Electronics in Agriculture
      Citation Excerpt :

      System identification is a data-driven modelling approach which is applied in modelling dynamic systems (Chen and Chang, 2008). It has been successfully applied in simplifying and modelling complex environmental and biological processes(Taylor et al., 2007; Young, 2006), predicting time-varying biological responses (Kirchsteiger et al., 2011; Quanten et al., 2006) and in many other irrigation decision support applications (Delgoda et al., 2016; Lozoya et al., 2016). It is extensively applied as part of the fault detection methodologies in the advanced process control industry (Young, 2006).

    View all citing articles on Scopus

    Peter Young is Emeritus Professor of Environmental Systems at Lancaster University, UK, and Adjunct Professor of Environmental Systems in the Centre for Resource and Environmental Studies at the Australian National University, Canberra. He obtained B.Tech. and M.Sc. degrees at Loughborough University before moving to Cambridge University, where he obtained his Doctoral degree in 1970. Following two years as a civilian scientist, working for the U.S. Navy in California, he was appointed University Lecturer in Engineering and a Fellow of Clare Hall, Cambridge University, in 1970. As a result of his novel research on environmental systems, he was invited to become Professorial Fellow and Head of the Systems Group in the new Centre for Resource and Environmental Studies at the Australian National University in 1975. Finally, he moved back to UK in 1981, where he was Head of the Department of Environmental Science for seven years, before becoming Director of the Centre for Research on Environmental Systems and Statistics. He is well known for his work on all aspects of recursive estimation and non-minimal state space methods of control system design. His most recent research has been concerned with data-based mechanistic modelling, adaptive forecasting, signal processing and control for nonstationary and state-dependent parameter, nonlinear stochastic systems. The applications of this methodological research are wide ranging, from the environment, through ecology, biology and engineering to business and macro-economics. Currently, he is leading the research program on real-time flood forecasting in the UK Flood Risk Management Research Consortium.

    View full text