Conditional Gumbel-Softmax for constrained feature selection with application to node selection in wireless sensor networks
Thomas Strypsteen, Alexander Bertrand
TL;DR
This work tackles constrained feature/node selection by introducing Conditional Gumbel-Softmax (CGS), which factors the joint selection distribution $p(Z)$ through conditioning each selected feature on a predecessor, enabling adherence to pairwise distance constraints in wireless sensor networks. The method supports end-to-end learning by replacing discrete selections with differentiable relaxed samples and introduces a polytree-based sampling scheme that masks infeasible transitions to satisfy the constraints. Applied to a constrained Wireless EEG Sensor Network (WESN) for motor execution, CGS is evaluated against a mutual-information heuristic and a vanilla unconstrained Gumbel-Softmax, demonstrating competitive performance under realistic energy-distance constraints while highlighting areas of high gradient variance and occasional suboptimal minima. The approach is general and can be extended to other constrained feature selection problems and network design tasks beyond wearable BCIs, with potential refinements in energy modeling and global metrics like latency.
Abstract
In this paper, we introduce Conditional Gumbel-Softmax as a method to perform end-to-end learning of the optimal feature subset for a given task and deep neural network (DNN) model, while adhering to certain pairwise constraints between the features. We do this by conditioning the selection of each feature in the subset on another feature. We demonstrate how this approach can be used to select the task-optimal nodes composing a wireless sensor network (WSN) while ensuring that none of the nodes that require communication between one another have too large of a distance between them, limiting the required power spent on this communication. We validate this approach on an emulated Wireless Electroencephalography (EEG) Sensor Network (WESN) solving a motor execution task. We analyze how the performance of the WESN varies as the constraints are made more stringent and how well the Conditional Gumbel-Softmax performs in comparison with a heuristic, greedy selection method. While the application focus of this paper is on wearable brain-computer interfaces, the proposed methodology is generic and can readily be applied to node deployment in wireless sensor networks and constrained feature selection in other applications as well.
