The evolving categories multinomial distribution: introduction with applications to movement ecology and vote transfer
Ricardo Carrizo Vergara, Marc Kéry, Trevor Hefley
TL;DR
The paper introduces the evolving categories multinomial (ECM) distribution for multivariate counts observed over time with evolving category structures, plus the ECM-Poisson variant for unknown total abundance $N$. It develops two practical inference strategies—Gaussian pseudo-likelihood (MGLE) and pairwise composite likelihood (MCLE)—that rely only on first- and second-order moment information, enabling estimation when full likelihoods are intractable. The ECM framework is demonstrated in movement ecology (space-time counts and OU movement, and translocated lesser prairie-chickens) and sociological/ecological inference (vote transfer in Chile's 2021 election), including a Chilean case study using MGLE. Key findings show consistent estimation as the number of individuals or rate $\lambda$ grows, that MCLE often outperforms MGLE at smaller sample sizes, and that the ECM structure provides a principled link between movement, counts, and evolving categories with applicability beyond the illustrated domains.
Abstract
We introduce the evolving categories multinomial (ECM) distribution for multivariate count data taken over time. This distribution models the counts of individuals following iid stochastic dynamics among categories, with the number and identity of the categories also evolving over time. We specify the one-time and two-times marginal distributions of the counts and the first and second order moments. When the total number of individuals is unknown, placing a Poisson prior on it yields a new distribution (ECM-Poisson), whose main properties we also describe. Since likelihoods are intractable or impractical, we propose two estimating functions for parameter estimation: a Gaussian pseudo-likelihood and a pairwise composite likelihood. We show two application scenarios: the inference of movement parameters of animals moving continuously in space-time with irregular survey regions, and the inference of vote transfer in two-rounds elections. We give three illustrations: a simulation study with Ornstein-Uhlenbeck moving individuals, paying special attention to the autocorrelation parameter; the inference of movement and behavior parameters of lesser prairie-chickens; and the estimation of vote transfer in the 2021 Chilean presidential election.
