Metric geometry for ranking-based voting: Tools for learning electoral structure
Moon Duchin, Kristopher Tapp
TL;DR
This work develops a metric geometry for ranking-based voting by extending Kendall tau and Spearman footrule to incomplete ballots through two coordinate embeddings, the Borda embedding and the head-to-head embedding, producing distances $d_B$ and $d_H$ that capture global ranking structure. It introduces ballot graphs and generalized ballot graphs to realize these distances as path metrics, enabling both coordinate- and graph-based analyses for partial ballots and slate-level preferences. The authors show how to identify voter blocs and candidate slates via clustering in the induced spaces, provide synthetic validation and real-world Scottish election results demonstrating robust, interpretable structure, and connect polarization and proportionality metrics to the learned clusters. The framework supports practical analysis and visualization of electoral structure, with robust performance across embeddings and methods, and is supported by open data and code for replication.
Abstract
In this paper, we develop the metric geometry of ranking statistics, proving that the two major permutation distances in the statistics literature -- Kendall tau and Spearman footrule -- extend naturally to incomplete rankings with both coordinate embeddings and graph realizations. This gives us a unifying framework that allows us to connect popular topics in computational social choice: metric preferences (and metric distortion), polarization, and proportionality. As an important application, the metric structure enables efficient identification of blocs of voters and slates of their preferred candidates. Since the definitions work for partial ballots, we can execute the methods not only on synthetic elections, but on a suite of real-world elections. This gives us robust clustering methods that often produce an identical grouping of voters -- even though one family of methods is based on a Condorcet-consistent ranking rule while the other is not.
