Autonomy-Aware Clustering: When Local Decisions Supersede Global Prescriptions
Amber Srivastava, Salar Basiri, Srinivasa Salapaka
TL;DR
The paper addresses clustering when data points exhibit local autonomy that can override policy-prescribed cluster assignments. It couples a maximum-entropy, deterministic annealing (DA) framework with a reinforcement-learning (RL) approach to jointly learn assignment policies and cluster representatives under known or unknown autonomy models, respectively. A key innovation is the Adaptive Distance Estimation Network (ADEN), a transformer-based architecture that predicts autonomy-aware distances $d_{\text{avg}}(x_i,y_j)$ and enables knowledge transfer across problem instances and online operation. Empirically, the autonomy-aware framework achieves ground-truth-aligned solutions with a small gap ($\sim$3–4%) and substantially outperforms autonomy-ignoring baselines (often by tens of percent), with real-world applicability demonstrated via UAV placement in decentralized sensing. The combination of DA phase-transition insights, RL-based learning, and the ADEN model provides a scalable and flexible toolkit for autonomy-aware clustering in dynamic, uncertain environments.
Abstract
Clustering arises in a wide range of problem formulations, yet most existing approaches assume that the entities under clustering are passive and strictly conform to their assigned groups. In reality, entities often exhibit local autonomy, overriding prescribed associations in ways not fully captured by feature representations. Such autonomy can substantially reshape clustering outcomes -- altering cluster compositions, geometry, and cardinality -- with significant downstream effects on inference and decision-making. We introduce autonomy-aware clustering, a reinforcement learning (RL) framework that learns and accounts for the influence of local autonomy without requiring prior knowledge of its form. Our approach integrates RL with a Deterministic Annealing (DA) procedure, where, to determine underlying clusters, DA naturally promotes exploration in early stages of annealing and transitions to exploitation later. We also show that the annealing procedure exhibits phase transitions that enable design of efficient annealing schedules. To further enhance adaptability, we propose the Adaptive Distance Estimation Network (ADEN), a transformer-based attention model that learns dependencies between entities and cluster representatives within the RL loop, accommodates variable-sized inputs and outputs, and enables knowledge transfer across diverse problem instances. Empirical results show that our framework closely aligns with underlying data dynamics: even without explicit autonomy models, it achieves solutions close to the ground truth (gap ~3-4%), whereas ignoring autonomy leads to substantially larger gaps (~35-40%). The code and data are publicly available at https://github.com/salar96/AutonomyAwareClustering.
