Table of Contents
Fetching ...

Linear type conditional specifications for multivariate count variables

Yang Lu, Wei Sun

TL;DR

The paper addresses the compatibility of linear conditional specifications for multivariate count data, focusing on two families—the compound autoregressive model and the random coefficient integer autoregressive model—and their extensions to arbitrary dimensions. It shows that, under full conditional specification, only a handful of non-trivial solutions exist, with a canonical high-dimensional solution given by Poisson–gamma conjugacy; when the requirement is relaxed to linear conditional expectations alone, a much larger semi-parametric class of solutions emerges. The results indicate that for spatial count data, semi-parametric specifications based on linear conditional expectations are more flexible and potentially preferable to latent-factor or Gaussian-based approaches. These findings have practical implications for modeling, estimation, and computation, suggesting that linear-type conditional specifications can be both tractable and sufficiently expressive, and they hint at future exploration of non-linear dependency networks in count data contexts.

Abstract

This paper investigates conditional specifications for multivariate count variables. Recently, the spatial count data literature has proposed several conditional models such that the conditional expectations are linear in the conditioning variables. These models are much easier to estimate than existing spatial count models based on Gaussian random field. However, whether or not such conditional specifications are compatible have not been addressed. We investigate two large families of conditional models, that are the compound autoregressive model and the random coefficient integer autoregressive model. We characterize all the solutions to these two families of models at arbitrary dimensions, and find that only a handful of them admit non-trivial solutions. We then show that if we focus on the linearity condition of the conditional expectations only, a considerable larger family of solutions can be obtained. This suggests that for spatial count data modeling, semi-parametric type specifications that impose the conditional expectation structure is preferable.

Linear type conditional specifications for multivariate count variables

TL;DR

The paper addresses the compatibility of linear conditional specifications for multivariate count data, focusing on two families—the compound autoregressive model and the random coefficient integer autoregressive model—and their extensions to arbitrary dimensions. It shows that, under full conditional specification, only a handful of non-trivial solutions exist, with a canonical high-dimensional solution given by Poisson–gamma conjugacy; when the requirement is relaxed to linear conditional expectations alone, a much larger semi-parametric class of solutions emerges. The results indicate that for spatial count data, semi-parametric specifications based on linear conditional expectations are more flexible and potentially preferable to latent-factor or Gaussian-based approaches. These findings have practical implications for modeling, estimation, and computation, suggesting that linear-type conditional specifications can be both tractable and sufficiently expressive, and they hint at future exploration of non-linear dependency networks in count data contexts.

Abstract

This paper investigates conditional specifications for multivariate count variables. Recently, the spatial count data literature has proposed several conditional models such that the conditional expectations are linear in the conditioning variables. These models are much easier to estimate than existing spatial count models based on Gaussian random field. However, whether or not such conditional specifications are compatible have not been addressed. We investigate two large families of conditional models, that are the compound autoregressive model and the random coefficient integer autoregressive model. We characterize all the solutions to these two families of models at arbitrary dimensions, and find that only a handful of them admit non-trivial solutions. We then show that if we focus on the linearity condition of the conditional expectations only, a considerable larger family of solutions can be obtained. This suggests that for spatial count data modeling, semi-parametric type specifications that impose the conditional expectation structure is preferable.

Paper Structure

This paper contains 29 sections, 14 theorems, 134 equations.

Key Result

Theorem 1

The two conditional distributions $\ell(x \mid y)$ and $\ell(y \mid x)$ are compatible if and only if $i)$ the domains of $\ell(x \mid y)$ and $\ell(y \mid x)$ are identical and $ii)$ there exist suitable functions $u(\cdot)$ and $v(\cdot)$ such that:

Theorems & Definitions (37)

  • Theorem 1: arnold2001conditionally, Theorem 4.1
  • Corollary 1
  • proof
  • Corollary 2
  • proof
  • Remark 1
  • Example 1: Poisson-gamma conjugacy
  • Theorem 2
  • proof
  • Remark 2
  • ...and 27 more