Alignment in Time: Peak-Aware Orchestration for Long-Horizon Agentic Systems
Hanjing Shi, Dominic DiFranzo
TL;DR
The paper tackles sustained reliability in long-horizon AI workflows by reframing alignment as a trajectory-level control problem. It introduces APEMO, a runtime, orthogonal overlay that reallocates compute to peak and ending segments under a fixed budget, guided by temporal-affective signals rather than model-weight changes. Through ABM, single-agent LLM, and multi-agent evaluations, it shows consistent improvements in trajectory-level quality and reuse, with quantified trade-offs in coordination cost. The work highlights trajectory-level control as a practical, scalable pathway to robust agentic systems and invites further study on human–AI interaction, adaptive scheduling, and scaling to larger models.
Abstract
Traditional AI alignment primarily focuses on individual model outputs; however, autonomous agents in long-horizon workflows require sustained reliability across entire interaction trajectories. We introduce APEMO (Affect-aware Peak-End Modulation for Orchestration), a runtime scheduling layer that optimizes computational allocation under fixed budgets by operationalizing temporal-affective signals. Instead of modifying model weights, APEMO detects trajectory instability through behavioral proxies and targets repairs at critical segments, such as peak moments and endings. Evaluation across multi-agent simulations and LLM-based planner--executor flows demonstrates that APEMO consistently enhances trajectory-level quality and reuse probability over structural orchestrators. Our results reframe alignment as a temporal control problem, offering a resilient engineering pathway for the development of long-horizon agentic systems.
