Table of Contents
Fetching ...

Lower Bound Techniques in the Comparison-Query Model and Inversion Minimization on Trees

Ivan Hu, Dieter van Melkebeek, Andrew Morgan

TL;DR

The paper studies the fundamental query-complexity limits for inversion minimization on rooted trees under the comparison-query model, showing that for many trees the problem is nearly as hard as sorting. It introduces two novel lower-bound techniques—the Sensitivity Lemma and the Connectivity Lemma—framed via the permutahedron and adjacent-rank transpositions, and derives tight bounds that depend on tree shape, notably a Gamma-function based bound for general trees and near-sorting bounds for binary and certain bounded-degree trees. It further analyzes the Mann–Whitney cross-inversion statistic, proving Omega(n log n) behavior in balanced cases and providing detailed distributional bounds for cross inversions via the characteristic function, Gaussian polynomials, and a pole-reduction method. The study extends to randomized and bounded-error settings, showing how average sensitivity translates into robust lower bounds, and links these insights to practical tasks like cross-inversion counting and Wilcoxon-type tests. Overall, the work unifies several lower-bound techniques for a class of optimization problems on trees, clarifying when inversion-minimization is information-theoretically as hard as sorting and when structural constraints yield markedly lower query complexity.

Abstract

Given a rooted tree and a ranking of its leaves, what is the minimum number of inversions of the leaves that can be attained by ordering the tree? This variation of the problem of counting inversions in arrays originated in mathematical psychology, with the evaluation of the Mann--Whitney statistic for detecting differences between distributions as a special case. We study the complexity of the problem in the comparison-query model, used for problems like sorting and selection. For many types of trees with $n$ leaves, we establish lower bounds close to the strongest known in the model, namely the lower bound of $\log_2(n!)$ for sorting $n$ items. We show: (a) $\log_2((α(1-α)n)!) - O(\log n)$ queries are needed whenever the tree has a subtree that contains a fraction $α$ of the leaves. This implies a lower bound of $\log_2((\frac{k}{(k+1)^2}n)!) - O(\log n)$ for trees of degree $k$. (b) $\log_2(n!) - O(\log n)$ queries are needed in case the tree is binary. (c) $\log_2(n!) - O(k \log k)$ queries are needed for certain classes of trees of degree $k$, including perfect trees with even $k$. The lower bounds are obtained by developing two novel techniques for a generic problem $Π$ in the comparison-query model and applying them to inversion minimization on trees. Both techniques can be described in terms of the Cayley graph of the symmetric group with adjacent-rank transpositions as the generating set. Consider the subgraph consisting of the edges between vertices with the same value under $Π$. We show that the size of any decision tree for $Π$ must be at least: (i) the number of connected components of this subgraph, and (ii) the factorial of the average degree of the complementary subgraph, divided by $n$. Lower bounds on query complexity then follow by taking the base-2 logarithm.

Lower Bound Techniques in the Comparison-Query Model and Inversion Minimization on Trees

TL;DR

The paper studies the fundamental query-complexity limits for inversion minimization on rooted trees under the comparison-query model, showing that for many trees the problem is nearly as hard as sorting. It introduces two novel lower-bound techniques—the Sensitivity Lemma and the Connectivity Lemma—framed via the permutahedron and adjacent-rank transpositions, and derives tight bounds that depend on tree shape, notably a Gamma-function based bound for general trees and near-sorting bounds for binary and certain bounded-degree trees. It further analyzes the Mann–Whitney cross-inversion statistic, proving Omega(n log n) behavior in balanced cases and providing detailed distributional bounds for cross inversions via the characteristic function, Gaussian polynomials, and a pole-reduction method. The study extends to randomized and bounded-error settings, showing how average sensitivity translates into robust lower bounds, and links these insights to practical tasks like cross-inversion counting and Wilcoxon-type tests. Overall, the work unifies several lower-bound techniques for a class of optimization problems on trees, clarifying when inversion-minimization is information-theoretically as hard as sorting and when structural constraints yield markedly lower query complexity.

Abstract

Given a rooted tree and a ranking of its leaves, what is the minimum number of inversions of the leaves that can be attained by ordering the tree? This variation of the problem of counting inversions in arrays originated in mathematical psychology, with the evaluation of the Mann--Whitney statistic for detecting differences between distributions as a special case. We study the complexity of the problem in the comparison-query model, used for problems like sorting and selection. For many types of trees with leaves, we establish lower bounds close to the strongest known in the model, namely the lower bound of for sorting items. We show: (a) queries are needed whenever the tree has a subtree that contains a fraction of the leaves. This implies a lower bound of for trees of degree . (b) queries are needed in case the tree is binary. (c) queries are needed for certain classes of trees of degree , including perfect trees with even . The lower bounds are obtained by developing two novel techniques for a generic problem in the comparison-query model and applying them to inversion minimization on trees. Both techniques can be described in terms of the Cayley graph of the symmetric group with adjacent-rank transpositions as the generating set. Consider the subgraph consisting of the edges between vertices with the same value under . We show that the size of any decision tree for must be at least: (i) the number of connected components of this subgraph, and (ii) the factorial of the average degree of the complementary subgraph, divided by . Lower bounds on query complexity then follow by taking the base-2 logarithm.
Paper Structure (42 sections, 35 theorems, 117 equations, 18 figures, 2 algorithms)

This paper contains 42 sections, 35 theorems, 117 equations, 18 figures, 2 algorithms.

Key Result

Theorem 5

Let $T$ be a tree with $n$ leaves, and $v$ a node with $\lvert\mathop{\mathrm{L}}\nolimits(T_v)\rvert = \ell$. The query complexity of inversion minimization on $T$ is at least $\log_2(\Gamma( \frac{\ell(n-\ell)}{n}+1 ))$. In particular, the complexity is at least $\log_2(\Gamma(\frac{k}{(k+1)^2} \c

Figures (18)

  • Figure 1: Classification of body parts
  • Figure 2: Mann--Whitney instance
  • Figure 3: Rank instance
  • Figure 4: Permutahedron for $n=4$ items
  • Figure 5: Ranking encoding
  • ...and 13 more figures

Theorems & Definitions (106)

  • Definition 1: ranking, inversion, $\mathop{\mathrm{Inv}}\nolimits_{\cdot}(\cdot)$
  • Definition 2: cross inversions, $\mathop{\mathrm{XInv}}\nolimits_{\cdot}(\cdot,\cdot)$
  • Definition 3: inversion minimization on trees, $\mathop{\mathrm{MinInv}}\nolimits(\cdot,\cdot)$, $\Pi_{\cdot}$
  • Definition 4: leaf set, $\mathop{\mathrm{L}}\nolimits(\cdot)$, and subtree
  • Theorem 5: lower bound for general trees
  • Theorem 6: lower bound for binary trees
  • Definition 7: leaf child set, $\mathop{\mathrm{LC}}\nolimits(\cdot)$
  • Theorem 8: lower bound for restricted classes
  • Theorem 9: lower bound for counting cross inversions
  • Definition 10: computational problem and algorithm in the comparison-query model
  • ...and 96 more