Table of Contents
Fetching ...

The Entropy and Crossentropy of Generalized Mallows Models

Marina Meilă

TL;DR

The paper addresses the challenge of information-theoretic analysis for the Generalized Mallows Model (GMM) over permutations by deriving tractable, closed-form-like expressions for entropy $H$, cross-entropy $\mathrm{XE}$, and KL divergence $D$ through the inversion matrix $Q$ and stagewise statistics $s_r$. It shows that the GMM forms an exponential family in the dispersion parameters $\vec{\theta}$ and that the discrete central permutation $\sigma$ can be handled via sufficient statistics, yielding a practical, polynomial-time recursion to compute necessary expectations $\bar{\mathbf Q}^{\rm up}_k(\sigma,\alpha_{1:r-1})$. The key contributions include a decomposition of $\mathrm{XE}$ into a sum over stages, an explicit expression for $H(\vec{\theta})$ independent of $\sigma$, and a recursive method to compute the averaged submatrices of $Q$, enabling efficient evaluation of divergence quantities. This work enables principled information-theoretic analysis and model comparison for ranking data within GMMs, with potential applications to ML and ranking-system design.

Abstract

The Generalized Mallows Model (GMM) is a well known family of models for ranking data. A GMM is a distribution over $\mathbb{S}_n$, the set of permutations of n objects, characterized by a location parameter $σ\in \mathbb{S}_n$, known as central permutation and a set of dispersion parameters $θ_{1:n-1}\in(0,1]$. The GMM shares many properties, such as having sufficient statistics, with exponential models, thus it can be seen as an exponential family with a discrete parameter $σ$. This paper shows that computing entropy, crossentropy and Kullback-Leibler divergence in the the class of GMM is tractable, paving the way for a better understanding of this exponential family.

The Entropy and Crossentropy of Generalized Mallows Models

TL;DR

The paper addresses the challenge of information-theoretic analysis for the Generalized Mallows Model (GMM) over permutations by deriving tractable, closed-form-like expressions for entropy , cross-entropy , and KL divergence through the inversion matrix and stagewise statistics . It shows that the GMM forms an exponential family in the dispersion parameters and that the discrete central permutation can be handled via sufficient statistics, yielding a practical, polynomial-time recursion to compute necessary expectations . The key contributions include a decomposition of into a sum over stages, an explicit expression for independent of , and a recursive method to compute the averaged submatrices of , enabling efficient evaluation of divergence quantities. This work enables principled information-theoretic analysis and model comparison for ranking data within GMMs, with potential applications to ML and ranking-system design.

Abstract

The Generalized Mallows Model (GMM) is a well known family of models for ranking data. A GMM is a distribution over , the set of permutations of n objects, characterized by a location parameter , known as central permutation and a set of dispersion parameters . The GMM shares many properties, such as having sufficient statistics, with exponential models, thus it can be seen as an exponential family with a discrete parameter . This paper shows that computing entropy, crossentropy and Kullback-Leibler divergence in the the class of GMM is tractable, paving the way for a better understanding of this exponential family.

Paper Structure

This paper contains 15 sections, 6 theorems, 35 equations, 1 algorithm.

Key Result

Proposition 1

where $\bar{\mathbf s}_r={\mathbb E}_{{\rm id},\vec{\alpha}}[s_r(\pi|\sigma)]$.

Theorems & Definitions (12)

  • Example 1
  • Example 2
  • Example 3
  • Proposition 1: Crossentropy as a function of average inversion counts
  • Example 4
  • Proposition 2: Conditional expression for $\bar{\mathbf s}_r$
  • Proposition 3: Matrix expression for $\bar{\mathbf s}_r$
  • Proposition 4: KL-divergence for $GMM$
  • Proposition 5
  • Corollary 6
  • ...and 2 more