Symmetric Linear Bandits with Hidden Symmetry

Nam Phuong Tran; The Anh Ta; Debmalya Mandal; Long Tran-Thanh

Symmetric Linear Bandits with Hidden Symmetry

Nam Phuong Tran, The Anh Ta, Debmalya Mandal, Long Tran-Thanh

TL;DR

This work studies high-dimensional linear stochastic bandits with hidden symmetry, where the reward is invariant under a hidden subgroup $\mathcal{G}\le\mathcal{S}_d$ acting by coordinate permutations. It shows that merely knowing the existence of a symmetry offers no benefit unless structure is restricted, and then casts learning the symmetry as model selection over a structured collection of low-dimensional fixed-point subspaces. The proposed Explore-Models-then-Commit (EMC) algorithm achieves regret $O\big(d_0^{2/3}T^{2/3}\log d\big)$, and, under a well-separated partition assumption, improves to $O\big(d_0\sqrt{T}\log d\big)$, by leveraging a subspace lattice and restricted isometry properties. Experiments demonstrate competitive performance across sparsity and partition-structured regimes, highlighting the approach’s ability to break the curse of dimensionality when hidden symmetries can be learned online. The results provide a new perspective on leveraging symmetry as an inductive bias in sequential decision-making, with potential impact on large-scale, structured learning problems.

Abstract

High-dimensional linear bandits with low-dimensional structure have received considerable attention in recent studies due to their practical significance. The most common structure in the literature is sparsity. However, it may not be available in practice. Symmetry, where the reward is invariant under certain groups of transformations on the set of arms, is another important inductive bias in the high-dimensional case that covers many standard structures, including sparsity. In this work, we study high-dimensional symmetric linear bandits where the symmetry is hidden from the learner, and the correct symmetry needs to be learned in an online setting. We examine the structure of a collection of hidden symmetry and provide a method based on model selection within the collection of low-dimensional subspaces. Our algorithm achieves a regret bound of $ O(d_0^{2/3} T^{2/3} \log(d))$, where $d$ is the ambient dimension which is potentially very large, and $d_0$ is the dimension of the true low-dimensional subspace such that $d_0 \ll d$. With an extra assumption on well-separated models, we can further improve the regret to $ O(d_0\sqrt{T\log(d)} )$.

Symmetric Linear Bandits with Hidden Symmetry

TL;DR

This work studies high-dimensional linear stochastic bandits with hidden symmetry, where the reward is invariant under a hidden subgroup

acting by coordinate permutations. It shows that merely knowing the existence of a symmetry offers no benefit unless structure is restricted, and then casts learning the symmetry as model selection over a structured collection of low-dimensional fixed-point subspaces. The proposed Explore-Models-then-Commit (EMC) algorithm achieves regret

, and, under a well-separated partition assumption, improves to

, by leveraging a subspace lattice and restricted isometry properties. Experiments demonstrate competitive performance across sparsity and partition-structured regimes, highlighting the approach’s ability to break the curse of dimensionality when hidden symmetries can be learned online. The results provide a new perspective on leveraging symmetry as an inductive bias in sequential decision-making, with potential impact on large-scale, structured learning problems.

Abstract

, where

is the ambient dimension which is potentially very large, and

is the dimension of the true low-dimensional subspace such that

. With an extra assumption on well-separated models, we can further improve the regret to

Paper Structure (38 sections, 19 theorems, 65 equations, 6 figures, 4 algorithms)

This paper contains 38 sections, 19 theorems, 65 equations, 6 figures, 4 algorithms.

Introduction
Related Work
Problem Setting
Group and group action.
Impossibility Result of Learning with General Hidden Subgroups
Fixed Point Subspace and Partition
Fixed-point subspaces.
Set partition.
Impossibility Result
Problem with known symmetry.
From the setting with hidden subgroup to the setting with hidden set partition.
The Case of Hidden Subgroups with Subexponential Size
The Explore-Models-then-Commit Algorithm
Regret Analysis
Assumptions
...and 23 more sections

Key Result

Proposition 0

There is a bijection $\mathbf{H}$ between $\mathcal{P}_d$ and $\mathcal{F}_{\mathcal{S}_d}$.

Figures (6)

Figure 1: Partition that respects the underlying ordered tree.
Figure 2: Regret of EMC (Algorithm \ref{['alg: Explore Model Selection then Commit']}) and of ESTC proposed in Hao2020_SparseLinBandit_PoorRegime, in cases of sparsity, non-crossing partitions, and non-nesting partitions.
Figure 3: Regret of EMC (Algorithm \ref{['alg: Explore Model Selection then Commit']}) and of ESTC proposed in Hao2020_SparseLinBandit_PoorRegime, in cases of sparsity, non-crossing partitions, and non-nesting partitions, with $d = 40,\; d_0 = 4$.
Figure 4: Regret of EMC (Algorithm \ref{['alg: Explore Model Selection then Commit']}) and of ESTC proposed in Hao2020_SparseLinBandit_PoorRegime, in cases of sparsity, non-crossing partitions, and non-nesting partitions, with $d = 80,\; d_0 = 10$.
Figure : Explore Models then Commit
...and 1 more figures

Theorems & Definitions (38)

Proposition 0
Proposition 1: Bodi2009_FixedPointSubspaceProperties's Theorem 14
Proposition 1
Remark 2
Remark 5: Equivalence between sparsity and interval partition
Definition 6: Exploratory distribution
Theorem 7: Regret upper bound
Lemma 7
Remark 8: Non-triviality of Lemma \ref{['Lem: Exploration Error']}
Proposition 8
...and 28 more

Symmetric Linear Bandits with Hidden Symmetry

TL;DR

Abstract

Symmetric Linear Bandits with Hidden Symmetry

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (38)