Context-Aware Inference via Performance Forecasting in Decentralized Learning Networks
Joel Pfeffer, J. M. Diederik Kruijssen, Clément Gossart, Mélanie Chevance, Diego Campo Millan, Florian Stecker, Steven N. Longmore
TL;DR
The paper tackles the slow adaptation of weightings in decentralized learning by introducing context-aware performance forecasting that predicts per-epoch model performance. Using Allora-inspired networks, forecasting workers predict losses or derived regrets (and regret $z$-scores), which are converted through a sigmoid gate to forecast-implied weights for combining inferences. Across synthetic benchmarks and live Allora data, forecasts based on regret and regret $z$-scores consistently outperform naive, history-based network inferences, with per-inferer models typically offering the strongest context awareness. The findings highlight the importance of target choice, feature design, and training horizon, and suggest that performance forecasting can be a practical, generalizable approach for predictive weighting in dynamic ensembles. This work provides a concrete framework and open-source baseline for deploying context-aware forecasting in decentralized prediction tasks and potentially broader model-combination problems.
Abstract
In decentralized learning networks, predictions from many participants are combined to generate a network inference. While many studies have demonstrated performance benefits of combining multiple model predictions, existing strategies using linear pooling methods (ranging from simple averaging to dynamic weight updates) face a key limitation. Dynamic prediction combinations that rely on historical performance to update weights are necessarily reactive. Due to the need to average over a reasonable number of epochs (with moving averages or exponential weighting), they tend to be slow to adjust to changing circumstances (phase or regime changes). In this work, we develop a model that uses machine learning to forecast the performance of predictions by models at each epoch in a time series. This enables `context-awareness' by assigning higher weight to models that are likely to be more accurate at a given time. We show that adding a performance forecasting worker in a decentralized learning network, following a design similar to the Allora network, can improve the accuracy of network inferences. Specifically, we find forecasting models that predict regret (performance relative to the network inference) or regret z-score (performance relative to other workers) show greater improvement than models predicting losses, which often do not outperform the naive network inference (historically weighted average of all inferences). Through a series of optimization tests, we show that the performance of the forecasting model can be sensitive to choices in the feature set and number of training epochs. These properties may depend on the exact problem and should be tailored to each domain. Although initially designed for a decentralized learning network, using performance forecasting for prediction combination may be useful in any situation where predictive rather than reactive model weighting is needed.
