Function-Coherent Gambles with Non-Additive Sequential Dynamics
Gregory Wheeler
TL;DR
The paper addresses the ergodicity problem in sequential decision making with multiplicative dynamics by relaxing linear utility and introducing function-coherence. It defines a log-domain nonlinear operator $f \oplus g = (1+f)(1+g) - 1$ with $L(f)=\log(1+f)$, proving a representation theorem that preserves coherence in the transformed space and aligns sequential aggregation with geometric growth. By generalizing to $f \oplus_u g = u^{-1}(u(f)+u(g))$, it provides conditions for well-behaved operators, a taxonomy of utility classes (power, exponential, logarithmic), and induced risk measures that capture multiplicative risk and time-average growth. The framework unifies ergodicity considerations, risk assessment, and non-stationary reward dynamics, with practical implications for portfolio management and long-horizon decisions where volatility drag and asymmetric gains/losses matter. Overall, it offers a principled, mathematically rigorous bridge between expectation values and time averages in dynamic, non-linear contexts.
Abstract
The desirable gambles framework provides a rigorous foundation for imprecise probability theory but relies heavily on linear utility via its coherence axioms. In our related work, we introduced function-coherent gambles to accommodate non-linear utility. However, when repeated gambles are played over time -- especially in intertemporal choice where rewards compound multiplicatively -- the standard additive combination axiom fails to capture the appropriate long-run evaluation. In this paper we extend the framework by relaxing the additive combination axiom and introducing a nonlinear combination operator that effectively aggregates repeated gambles in the log-domain. This operator preserves the time-average (geometric) growth rate and addresses the ergodicity problem. We prove the key algebraic properties of the operator, discuss its impact on coherence, risk assessment, and representation, and provide a series of illustrative examples. Our approach bridges the gap between expectation values and time averages and unifies normative theory with empirically observed non-stationary reward dynamics.
