Gaussian process emulation of an individual-based model simulation of microbial communities
Authored by S P Rushton, O K Oyebamiji, D J Wilkinson, P G Jayathilake, T P Curtis, B Li, P Gupta
Date Published: 2017
DOI: 10.1016/j.jocs.2017.08.006
Sponsors:
United Kingdom Engineering and Physical Sciences Research Council (EPSRC)
Platforms:
C++
LAMMPS
Model Documentation:
Other Narrative
Mathematical description
Model Code URLs:
https://github.com/nufeb/NUFEB/releases
Abstract
The ability to make credible simulations of open engineered biological
systems is an important step towards the application of scientific
knowledge to solve real-world problems in this challenging, complex
engineering domain. An important application of this type of knowledge
is in the design and management of wastewater treatment systems. One of
the crucial aspects of an engineering biology approach to wastewater
treatment study is the ability to run a simulation of complex biological
communities. However, the simulation of open biological systems is
challenging because they often involve a large number of bacteria that
ranges from order 10(12) (a baby's microbiome) to 10(18) (a wastewater
treatment plant) individual particles, and are physically complex. Since
the models are computationally expensive, and due to computing
constraints, the consideration of only a limited set of scenarios is
often possible. A simplified approach to this problem is to use a
statistical approximation of the simulation ensembles derived from the
complex models at a fine scale which will help in reducing the
computational burden. Our aim in this paper is to build a cheaper
surrogate of an individual-based (IS) model simulation of microbial
communities. The paper focuses on how to use an emulator as an effective
tool for studying and incorporating microscale processes in a
computationally efficient way into macroscale models. The main issue we
address is a strategy for emulating high-level summaries from the IB
model simulation data. We use a Gaussian process regression model for
the emulation. Under cross-validation, the percentage of variance
explained for the univariate emulator ranges from 83-99\% and 87-99\%
for the multivariate emulators, and for both biofilms and floc. Our
emulators show an approximately 220-fold increase in computational
efficiency. The sensitivity analyses indicated that substrate nutrient
concentration for nitrate, carbon, nitrite and oxygen as well as the
maximum growth rate for heterotrophic bacteria are the most important
parameters for the predictions. We observe that the performance of the
single step emulator depends hugely on the initial conditions and sample
size taken for the normal approximation. We believe that the development
of an emulator for an IB model is of strategic importance for using
microscale understanding to enable macroscale problem solving. (C) 2017
The Authors. Published by Elsevier B.V.
Tags
individual-based models
Design
Dynamics
calibration
Emulator
Sensitivity-analysis
Bayesian-approach
Biofilms
Multivariate gaussian process
Flocs
Computer codes
Kriging models
Multivariate
Outputs