LORE: Jointly Learning the Intrinsic Dimensionality and Relative Similarity Structure From Ordinal Data
Vivek Anand, Alec Helbling, Mark Davenport, Gordon Berman, Sankar Alagapan, Christopher Rozell
TL;DR
LORE addresses the problem of learning subjective perceptual spaces from ordinal triplets by jointly inferring the embedding and its intrinsic dimensionality. It couples a smoothed triplet loss with a nonconvex Schatten-$p$ quasi-norm regularizer, optimized via an iteratively reweighted scheme that provably converges to a stationary point. Across synthetic, simulated, and real crowdsourced data, LORE achieves high triplet accuracy while recovering a low, interpretable intrinsic rank, outperforming baselines in rank recovery and producing semantically meaningful axes. This approach enables scalable, data-efficient perceptual modeling with interpretable latent structure for psychophysics and related domains.
Abstract
Learning the intrinsic dimensionality of subjective perceptual spaces such as taste, smell, or aesthetics from ordinal data is a challenging problem. We introduce LORE (Low Rank Ordinal Embedding), a scalable framework that jointly learns both the intrinsic dimensionality and an ordinal embedding from noisy triplet comparisons of the form, "Is A more similar to B than C?". Unlike existing methods that require the embedding dimension to be set apriori, LORE regularizes the solution using the nonconvex Schatten-$p$ quasi norm, enabling automatic joint recovery of both the ordinal embedding and its dimensionality. We optimize this joint objective via an iteratively reweighted algorithm and establish convergence guarantees. Extensive experiments on synthetic datasets, simulated perceptual spaces, and real world crowdsourced ordinal judgements show that LORE learns compact, interpretable and highly accurate low dimensional embeddings that recover the latent geometry of subjective percepts. By simultaneously inferring both the intrinsic dimensionality and ordinal embeddings, LORE enables more interpretable and data efficient perceptual modeling in psychophysics and opens new directions for scalable discovery of low dimensional structure from ordinal data in machine learning.
