Lower Bound Techniques in the Comparison-Query Model and Inversion Minimization on Trees
Ivan Hu, Dieter van Melkebeek, Andrew Morgan
TL;DR
The paper studies the fundamental query-complexity limits for inversion minimization on rooted trees under the comparison-query model, showing that for many trees the problem is nearly as hard as sorting. It introduces two novel lower-bound techniques—the Sensitivity Lemma and the Connectivity Lemma—framed via the permutahedron and adjacent-rank transpositions, and derives tight bounds that depend on tree shape, notably a Gamma-function based bound for general trees and near-sorting bounds for binary and certain bounded-degree trees. It further analyzes the Mann–Whitney cross-inversion statistic, proving Omega(n log n) behavior in balanced cases and providing detailed distributional bounds for cross inversions via the characteristic function, Gaussian polynomials, and a pole-reduction method. The study extends to randomized and bounded-error settings, showing how average sensitivity translates into robust lower bounds, and links these insights to practical tasks like cross-inversion counting and Wilcoxon-type tests. Overall, the work unifies several lower-bound techniques for a class of optimization problems on trees, clarifying when inversion-minimization is information-theoretically as hard as sorting and when structural constraints yield markedly lower query complexity.
Abstract
Given a rooted tree and a ranking of its leaves, what is the minimum number of inversions of the leaves that can be attained by ordering the tree? This variation of the problem of counting inversions in arrays originated in mathematical psychology, with the evaluation of the Mann--Whitney statistic for detecting differences between distributions as a special case. We study the complexity of the problem in the comparison-query model, used for problems like sorting and selection. For many types of trees with $n$ leaves, we establish lower bounds close to the strongest known in the model, namely the lower bound of $\log_2(n!)$ for sorting $n$ items. We show: (a) $\log_2((α(1-α)n)!) - O(\log n)$ queries are needed whenever the tree has a subtree that contains a fraction $α$ of the leaves. This implies a lower bound of $\log_2((\frac{k}{(k+1)^2}n)!) - O(\log n)$ for trees of degree $k$. (b) $\log_2(n!) - O(\log n)$ queries are needed in case the tree is binary. (c) $\log_2(n!) - O(k \log k)$ queries are needed for certain classes of trees of degree $k$, including perfect trees with even $k$. The lower bounds are obtained by developing two novel techniques for a generic problem $Π$ in the comparison-query model and applying them to inversion minimization on trees. Both techniques can be described in terms of the Cayley graph of the symmetric group with adjacent-rank transpositions as the generating set. Consider the subgraph consisting of the edges between vertices with the same value under $Π$. We show that the size of any decision tree for $Π$ must be at least: (i) the number of connected components of this subgraph, and (ii) the factorial of the average degree of the complementary subgraph, divided by $n$. Lower bounds on query complexity then follow by taking the base-2 logarithm.
