Table of Contents
Fetching ...

Symmetric Linear Bandits with Hidden Symmetry

Nam Phuong Tran, The Anh Ta, Debmalya Mandal, Long Tran-Thanh

TL;DR

This work studies high-dimensional linear stochastic bandits with hidden symmetry, where the reward is invariant under a hidden subgroup $\mathcal{G}\le\mathcal{S}_d$ acting by coordinate permutations. It shows that merely knowing the existence of a symmetry offers no benefit unless structure is restricted, and then casts learning the symmetry as model selection over a structured collection of low-dimensional fixed-point subspaces. The proposed Explore-Models-then-Commit (EMC) algorithm achieves regret $O\big(d_0^{2/3}T^{2/3}\log d\big)$, and, under a well-separated partition assumption, improves to $O\big(d_0\sqrt{T}\log d\big)$, by leveraging a subspace lattice and restricted isometry properties. Experiments demonstrate competitive performance across sparsity and partition-structured regimes, highlighting the approach’s ability to break the curse of dimensionality when hidden symmetries can be learned online. The results provide a new perspective on leveraging symmetry as an inductive bias in sequential decision-making, with potential impact on large-scale, structured learning problems.

Abstract

High-dimensional linear bandits with low-dimensional structure have received considerable attention in recent studies due to their practical significance. The most common structure in the literature is sparsity. However, it may not be available in practice. Symmetry, where the reward is invariant under certain groups of transformations on the set of arms, is another important inductive bias in the high-dimensional case that covers many standard structures, including sparsity. In this work, we study high-dimensional symmetric linear bandits where the symmetry is hidden from the learner, and the correct symmetry needs to be learned in an online setting. We examine the structure of a collection of hidden symmetry and provide a method based on model selection within the collection of low-dimensional subspaces. Our algorithm achieves a regret bound of $ O(d_0^{2/3} T^{2/3} \log(d))$, where $d$ is the ambient dimension which is potentially very large, and $d_0$ is the dimension of the true low-dimensional subspace such that $d_0 \ll d$. With an extra assumption on well-separated models, we can further improve the regret to $ O(d_0\sqrt{T\log(d)} )$.

Symmetric Linear Bandits with Hidden Symmetry

TL;DR

This work studies high-dimensional linear stochastic bandits with hidden symmetry, where the reward is invariant under a hidden subgroup acting by coordinate permutations. It shows that merely knowing the existence of a symmetry offers no benefit unless structure is restricted, and then casts learning the symmetry as model selection over a structured collection of low-dimensional fixed-point subspaces. The proposed Explore-Models-then-Commit (EMC) algorithm achieves regret , and, under a well-separated partition assumption, improves to , by leveraging a subspace lattice and restricted isometry properties. Experiments demonstrate competitive performance across sparsity and partition-structured regimes, highlighting the approach’s ability to break the curse of dimensionality when hidden symmetries can be learned online. The results provide a new perspective on leveraging symmetry as an inductive bias in sequential decision-making, with potential impact on large-scale, structured learning problems.

Abstract

High-dimensional linear bandits with low-dimensional structure have received considerable attention in recent studies due to their practical significance. The most common structure in the literature is sparsity. However, it may not be available in practice. Symmetry, where the reward is invariant under certain groups of transformations on the set of arms, is another important inductive bias in the high-dimensional case that covers many standard structures, including sparsity. In this work, we study high-dimensional symmetric linear bandits where the symmetry is hidden from the learner, and the correct symmetry needs to be learned in an online setting. We examine the structure of a collection of hidden symmetry and provide a method based on model selection within the collection of low-dimensional subspaces. Our algorithm achieves a regret bound of , where is the ambient dimension which is potentially very large, and is the dimension of the true low-dimensional subspace such that . With an extra assumption on well-separated models, we can further improve the regret to .
Paper Structure (38 sections, 19 theorems, 65 equations, 6 figures, 4 algorithms)

This paper contains 38 sections, 19 theorems, 65 equations, 6 figures, 4 algorithms.

Key Result

Proposition 0

There is a bijection $\mathbf{H}$ between $\mathcal{P}_d$ and $\mathcal{F}_{\mathcal{S}_d}$.

Figures (6)

  • Figure 1: Partition that respects the underlying ordered tree.
  • Figure 2: Regret of EMC (Algorithm \ref{['alg: Explore Model Selection then Commit']}) and of ESTC proposed in Hao2020_SparseLinBandit_PoorRegime, in cases of sparsity, non-crossing partitions, and non-nesting partitions.
  • Figure 3: Regret of EMC (Algorithm \ref{['alg: Explore Model Selection then Commit']}) and of ESTC proposed in Hao2020_SparseLinBandit_PoorRegime, in cases of sparsity, non-crossing partitions, and non-nesting partitions, with $d = 40,\; d_0 = 4$.
  • Figure 4: Regret of EMC (Algorithm \ref{['alg: Explore Model Selection then Commit']}) and of ESTC proposed in Hao2020_SparseLinBandit_PoorRegime, in cases of sparsity, non-crossing partitions, and non-nesting partitions, with $d = 80,\; d_0 = 10$.
  • Figure : Explore Models then Commit
  • ...and 1 more figures

Theorems & Definitions (38)

  • Proposition 0
  • Proposition 1: Bodi2009_FixedPointSubspaceProperties's Theorem 14
  • Proposition 1
  • Remark 2
  • Remark 5: Equivalence between sparsity and interval partition
  • Definition 6: Exploratory distribution
  • Theorem 7: Regret upper bound
  • Lemma 7
  • Remark 8: Non-triviality of Lemma \ref{['Lem: Exploration Error']}
  • Proposition 8
  • ...and 28 more