A Comparison of Agent-Based Models and the Parametric G-Formula for Causal Inference

Authored by Miguel A Hernan, Kenneth A Freedberg, III George R Seage, Eleanor J Murray, James M Robins

Date Published: 2017

DOI: 10.1093/aje/kwx091

Sponsors: United States National Institutes of Health (NIH)

Platforms: No platforms listed

Model Documentation: Other Narrative

Model Code URLs: Model code not found

Abstract

Decision-making requires choosing from treatments on the basis of correctly estimated outcome distributions under each treatment. In the absence of randomized trials, 2 possible approaches are the parametric g-formula and agent-based models (ABMs). The g-formula has been used exclusively to estimate effects in the population from which data were collected, whereas ABMs are commonly used to estimate effects in multiple populations, necessitating stronger assumptions. Here, we describe potential biases that arise when ABM assumptions do not hold. To do so, we estimated 12-month mortality risk in simulated populations differing in prevalence of an unknown common cause of mortality and a time-varying confounder. The ABM and g-formula correctly estimated mortality and causal effects when all inputs were from the target population. However, whenever any inputs came from another population, the ABM gave biased estimates of mortality-and often of causal effects even when the true effect was null. In the absence of unmeasured confounding and model misspecification, both methods produce valid causal inferences for a given population when all inputs are from that population. However, ABMs may result in bias when extrapolated to populations that differ on the distribution of unmeasured outcome determinants, even when the causal network linking variables is identical.
Tags
Agent-based models Epidemiology mathematical models Monte Carlo methods Parametric g-formula Causal inference Decision analysis Individual-level models Medical decision making