Optimizing Decentralized Online Learning for Supervised Regression and Classification Problems
J. M. Diederik Kruijssen, Renata Valieva, Steven N. Longmore
TL;DR
The paper presents a systematic calibration framework for key parameters in decentralized online learning, focusing on how historical performance maps to weights and how performance translates to rewards. Using an Allora-like simulator, it separates the optimization into the slope of the regret-to-weight mapping ($p$), the historical window via EMA ($\alpha$), and the slope of reward mappings ($p_i,p_f,p_r$), comparing regression and classification tasks. It shows that optimal $p$ is 3 for regression and 5 for classification, that $\alpha \\approx 0.1$ balances memory and adaptability, and that $p_i= p_f= p_r=1$–3-1–3 defaults minimize reward-spread while remaining effective across network compositions. The findings offer a practical recipe for tuning decentralized inference systems and suggest that these defaults generalize to networks solving a range of inference-synthesis problems beyond the Allora design.
Abstract
Decentralized learning networks aim to synthesize a single network inference from a set of raw inferences provided by multiple participants. To determine the combined inference, these networks must adopt a mapping from historical participant performance to weights, and to appropriately incentivize contributions they must adopt a mapping from performance to fair rewards. Despite the increased prevalence of decentralized learning networks, there exists no systematic study that performs a calibration of the associated free parameters. Here we present an optimization framework for key parameters governing decentralized online learning in supervised regression and classification problems. These parameters include the slope of the mapping between historical performance and participant weight, the timeframe for performance evaluation, and the slope of the mapping between performance and rewards. These parameters are optimized using a suite of numerical experiments that mimic the design of the Allora Network, but have been extended to handle classification tasks in addition to regression tasks. This setup enables a comparative analysis of parameter tuning and network performance optimization (loss minimization) across both problem types. We demonstrate how the optimal performance-weight mapping, performance timeframe, and performance-reward mapping vary with network composition and problem type. Our findings provide valuable insights for the optimization of decentralized learning protocols, and we discuss how these results can be generalized to optimize any inference synthesis-based, decentralized AI network.
