Value Under Ignorance in Universal Artificial Intelligence
Cole Wyeth, Marcus Hutter
TL;DR
The paper broadens universal AI by allowing arbitrary continuous utilities beyond simple rewards and reframes semimeasure loss as either death or model-ignorance within an imprecise-probability framework. It develops a rigorous semimeasure-extension method to define integration and shows that Choquet-integral expected utilities align with traditional recursive value functions, while also enabling a credal-set interpretation that can improve computability. The authors prove existence of optimal policies under continuity assumptions and analyze computability properties, highlighting potential benefits of the imprecise-probability approach for general utility AIs. The work offers a foundation for decision-theoretic, history-based universal agents with richer goal structures and discusses implications for AI alignment and the semantics of termination.
Abstract
We generalize the AIXI reinforcement learning agent to admit a wider class of utility functions. Assigning a utility to each possible interaction history forces us to confront the ambiguity that some hypotheses in the agent's belief distribution only predict a finite prefix of the history, which is sometimes interpreted as implying a chance of death equal to a quantity called the semimeasure loss. This death interpretation suggests one way to assign utilities to such history prefixes. We argue that it is as natural to view the belief distributions as imprecise probability distributions, with the semimeasure loss as total ignorance. This motivates us to consider the consequences of computing expected utilities with Choquet integrals from imprecise probability theory, including an investigation of their computability level. We recover the standard recursive value function as a special case. However, our most general expected utilities under the death interpretation cannot be characterized as such Choquet integrals.
