Population Synthesis with Quasirandom Integer Sampling
Authored by Mark Birkin, Andrew P Smith, Robin Lovelace
Date Published: 2017
DOI: 10.18564/jasss.3550
Sponsors:
No sponsors listed
Platforms:
C++
R
Model Documentation:
Other Narrative
Pseudocode
Model Code URLs:
https://github.com/CatchDat/humanleague
Abstract
Established methods for synthesising a population from geographically
aggregated data are robust and well understood. However, most rely on
the potentially detrimental process of integerisation if a
whole-individual population is required, e.g. for use in agent-based
modelling (ABM). This paper describes and investigates the use of
quasirandom sequences to sample populations from known marginal
constraints whilst preserving those marginal distributions. We call this
technique Quasirandom Integer Without-replacement Sampling (QIWS) and
show that the statistical properties of quasirandomly sampled
populations to be superior to those of pseudorandomly sampled ones in
that they tend to yield entropies much closer to populations generated
using the entropy-maximising iterative proportional fitting (IPF)
algorithm. The implementation is extremely efficient, easily
outperforming common IPF implementations. It is freely available as an
open source R package called humanleague. Finally, we suggest how the
current limitations of the implementation can be overcome, providing a
direction for future work.
Tags
Microsimulation
Sampling
Microsynthesis
Quasirandom sequences
Sequence generator