A Unified Regularization Approach to High-Dimensional Generalized Tensor Bandits
Jiannan Li, Yiyang Yang, Yao Wang, Shaojie Tang
TL;DR
This work tackles high-dimensional, high-order tensor bandits under generalized linear rewards by introducing G-ELTC, a unified framework that regularizes parameter estimation with weakly decomposable norms tailored to tensor structures. Leveraging Tucker decomposition, the authors derive structure-aware regret bounds that improve over existing tensor methods, covering tensor low-rankness, slice sparsity, and extensions to other structures. A parameter-estimation mechanism under GLMs connects estimation error to regret, with a novel analysis approach based on generic chaining. The framework is validated through experiments demonstrating sublinear regret and favorable comparisons to baselines across multiple structured settings, highlighting its practicality for complex, high-dimensional decision problems.
Abstract
Modern decision-making scenarios often involve data that is both high-dimensional and rich in higher-order contextual information, where existing bandits algorithms fail to generate effective policies. In response, we propose in this paper a generalized linear tensor bandits algorithm designed to tackle these challenges by incorporating low-dimensional tensor structures, and further derive a unified analytical framework of the proposed algorithm. Specifically, our framework introduces a convex optimization approach with the weakly decomposable regularizers, enabling it to not only achieve better results based on the tensor low-rankness structure assumption but also extend to cases involving other low-dimensional structures such as slice sparsity and low-rankness. The theoretical analysis shows that, compared to existing low-rankness tensor result, our framework not only provides better bounds but also has a broader applicability. Notably, in the special case of degenerating to low-rank matrices, our bounds still offer advantages in certain scenarios.
