Learning Smooth Distance Functions via Queries
Akash Kumar, Sanjoy Dasgupta
TL;DR
This work studies learning smooth distance functions from triplet queries, introducing two approximation regimes: an $\omega$-additive and a $(1+\omega)$-multiplicative notion of triplet-equivalence. It develops a global-cover approach for additive accuracy and a hybrid global-local strategy using local Mahalanobis surrogates to achieve multiplicative accuracy, with formal guarantees on query complexity that scale with cover sizes and ambient dimension. The framework covers non-Mahalanobis smooth distances by leveraging smoothness (via Hessians) and a finite-cover extension, and it provides concrete algorithms for learning both general distance functions and Mahalanobis distances from triplet feedback. The results highlight a principled path to interactive metric learning applicable to personalized retrieval, embedding construction, and geometry inference under limited feedback. Overall, the paper advances query-efficient methods for learning distance structures by combining global covers, local linearizations, and careful complexity analyses to handle both additive and multiplicative approximation goals in high-dimensional spaces.
Abstract
In this work, we investigate the problem of learning distance functions within the query-based learning framework, where a learner is able to pose triplet queries of the form: ``Is $x_i$ closer to $x_j$ or $x_k$?'' We establish formal guarantees on the query complexity required to learn smooth, but otherwise general, distance functions under two notions of approximation: $ω$-additive approximation and $(1 + ω)$-multiplicative approximation. For the additive approximation, we propose a global method whose query complexity is quadratic in the size of a finite cover of the sample space. For the (stronger) multiplicative approximation, we introduce a method that combines global and local approaches, utilizing multiple Mahalanobis distance functions to capture local geometry. This method has a query complexity that scales quadratically with both the size of the cover and the ambient space dimension of the sample space.
