Directed Information: Estimation, Optimization and Applications in Communications and Causality
Dor Tsur, Oron Sabag, Navin Kashyap, Haim Permuter, Gerhard Kramer
TL;DR
This work introduces directed information (DI) as a rigorous, causality-aware measure of information flow between stochastic processes, linking it to channel capacity with memory and feedback. It develops the theoretical foundations of causal conditioning, multivariate DI, and the DI rate, and relates DI to transfer entropy, Granger causality, and Pearl’s do-calculus. The monograph surveys a spectrum of DI estimation methods—ranging from plug-in and CTW to neural estimators like DINE—and discusses optimization of estimated DI for capacity problems, including NDT-based approaches for continuous inputs and RL-based strategies for discrete settings. A major portion focuses on feedback capacity of finite-state channels, presenting MD P formulations, Bellman equations, Q-graph and dual bounds, and reinforcement-learning-based algorithms (DDPG, POU) for capacity evaluation, with exact solutions in notable cases like the Ising channel. Collectively, the work provides a comprehensive toolkit for analyzing, estimating, and optimizing directional information flow in complex communication systems with memory and feedback, with broad implications for causal inference and control in data-driven contexts.
Abstract
Directed information (DI) is an information measure that attempts to capture directionality in the flow of information from one random process to another. It is closely related to other causal influence measures, such as transfer entropy, Granger causality, and Pearl's causal framework. This monograph provides an overview of DI and its main application in information theory, namely, characterizing the capacity of channels with feedback and memory. We begin by reviewing the definitions of DI, its basic properties, and its relation to Shannon's mutual information. Next, we provide a survey of DI estimation techniques, ranging from classic plug-in estimators to modern neural-network-based estimators. Considering the application of channel capacity estimation, we describe how such estimators numerically optimize DI rate over a class of joint distributions on input and output processes. A significant part of the monograph is devoted to techniques to compute the feedback capacity of finite-state channels (FSCs). The feedback capacity of a strongly connected FSC involves the maximization of the DI rate from the channel input process to the output process. This maximization is performed over the class of causal conditioned probability input distributions. When the FSC is also unifilar, i.e., the next state is given by a time-invariant function of the current state and the new input-output symbol pair, the feedback capacity is the optimal average reward of an appropriately formulated Markov decision process (MDP). This MDP formulation has been exploited to develop several methods to compute exactly, or at least estimate closely, the feedback capacity of a unifilar FSC. This monograph describes these methods, starting from the value iteration algorithm, to Q-graph methods, and reinforcement learning algorithms that can handle large input and output alphabets.
