Universal Discrete Filtering with Lookahead or Delay
Pumiao Yan, Jiwon Jeong, Naomi Sagan, Tsachy Weissman
TL;DR
This work extends universal discrete filtering to allow fixed lookahead and delay, showing that any universal SPA combined with an invertible memoryless channel can asymptotically achieve Bayes-optimal filtering. It develops tight excess-loss bounds for causal, delayed, and lookahead settings, linking performance to KL divergence, mutual information, and causally conditioned entropy. The authors instantiate the framework with LZ78-based SPAs, proving their universality and practicality, and implement a complete filtering pipeline with training, Monte Carlo estimation for delay, and lookahead-aware noncausal filtering. Through experiments on a Markov-source scenario, the LZ78-based universal filters approach the Bayesian Bayes limit and outperform a Wiener baseline while offering favorable computational efficiency. The results demonstrate the viability of universal SPAs for real-time, resource-constrained filtering in discrete-sequence environments and lay groundwork for broader noise models and comparisons with neural methods.
Abstract
We consider the universal discrete filtering problem, where an input sequence generated by an unknown source passes through a discrete memoryless channel, and the goal is to estimate its components based on the output sequence with limited lookahead or delay. We propose and establish the universality of a family of schemes for this setting. These schemes are induced by universal Sequential Probability Assignments (SPAs), and inherit their computational properties. We show that the schemes induced by LZ78 are practically implementable and well-suited for scenarios with limited computational resources and latency constraints. In passing, we use some of the intermediate results to obtain upper and lower bounds that appear to be new, in the purely Bayesian setting, on the optimal filtering performance in terms, respectively, of the mutual information between the noise-free and noisy sequence, and the entropy of the noise-free sequence causally conditioned on the noisy one.
