A Comparison of Agent-Based Models and the Parametric G-Formula for Causal Inference
Authored by Miguel A Hernan, Kenneth A Freedberg, III George R Seage, Eleanor J Murray, James M Robins
Date Published: 2017
DOI: 10.1093/aje/kwx091
Sponsors:
United States National Institutes of Health (NIH)
Platforms:
No platforms listed
Model Documentation:
Other Narrative
Model Code URLs:
Model code not found
Abstract
Decision-making requires choosing from treatments on the basis of
correctly estimated outcome distributions under each treatment. In the
absence of randomized trials, 2 possible approaches are the parametric
g-formula and agent-based models (ABMs). The g-formula has been used
exclusively to estimate effects in the population from which data were
collected, whereas ABMs are commonly used to estimate effects in
multiple populations, necessitating stronger assumptions. Here, we
describe potential biases that arise when ABM assumptions do not hold.
To do so, we estimated 12-month mortality risk in simulated populations
differing in prevalence of an unknown common cause of mortality and a
time-varying confounder. The ABM and g-formula correctly estimated
mortality and causal effects when all inputs were from the target
population. However, whenever any inputs came from another population,
the ABM gave biased estimates of mortality-and often of causal effects
even when the true effect was null. In the absence of unmeasured
confounding and model misspecification, both methods produce valid
causal inferences for a given population when all inputs are from that
population. However, ABMs may result in bias when extrapolated to
populations that differ on the distribution of unmeasured outcome
determinants, even when the causal network linking variables is
identical.
Tags
Agent-based models
Epidemiology
mathematical models
Monte Carlo methods
Parametric g-formula
Causal inference
Decision analysis
Individual-level models
Medical decision making