Neuroscheduling for Remote Estimation
Marcos M. Vasconcelos, Yifei Zhang
TL;DR
This work addresses remote estimation where a scheduler reveals only one of two Gaussian sources to a remote estimator, formalizing the objective $\mathbb{E}[ (X_1-\hat X_1)^2+(X_2-\hat X_2)^2 ]$ under a max-scheduling policy. It introduces neuroscheduling, a data-driven approach that uses linear function approximation to model nonlinear estimators via $\eta_j(x)=w_j^T\boldsymbol{\Phi}(x)$ and employs a convex-concave procedure to efficiently train the weights. A key theoretical contribution is a necessary-and-sufficient condition on the basis functions to avoid over-fitting, which explains why simple ReLU-based bases can fail, and numerical results show the optimal scheduler/estimator pair is nonlinear for Gaussian sources. Empirically, soft-plus and polynomial bases yield stable, accurate approximations, suggesting practical pathways to implement adaptive, data-driven scheduling in multi-source remote estimation with performance guarantees against over-fitting.
Abstract
Many modern distributed systems consist of devices that generate more data than what can be transmitted via a communication link in near real time with high-fidelity. We consider the scheduling problem in which a device has access to multiple data sources, but at any moment, only one of them is revealed in real-time to a remote receiver. Even when the sources are Gaussian, and the fidelity criterion is the mean squared error, the globally optimal data selection strategy is not known. We propose a data-driven methodology to search for the elusive optimal solution using linear function approximation approach called neuroscheduling and establish necessary and sufficient conditions for the optimal scheduler to not over fit training data. Additionally, we present several numerical results that show that the globally optimal scheduler and estimator pair to the Gaussian case are nonlinear.
