Enforcing Monotonic Progress in Legal Cross-Examination: Preventing Long-Horizon Stagnation in LLM-Based Inquiry
Hsien-Jyh Liao
TL;DR
This paper addresses long-horizon legal inquiry and the risk of procedural stagnation in LLM-based cross-examinations. It introduces Soft-FSM, a neuro-symbolic architecture that externalizes procedural state control to enforce monotonic information gain over a DAG of KIUs, ensuring completion even when language models favor local coherence. Empirical results on three real-world Taiwanese homicide cases show that Soft-FSM achieves >97% completeness with minimal redundancy, while baseline methods collapse as task depth increases due to the Complexity Cliff. The work demonstrates that reliable long-horizon task completion in procedurally constrained domains requires explicit external state enforcement rather than relying on emergent LLM behavior, with implications for designing robust legal AI systems.
Abstract
Large language models (LLMs) exhibit impressive linguistic fluency but struggle to reliably complete long-horizon tasks under explicit procedural constraints. In legal cross-examination, purely proba-bilistic generation often maintains behavioral coherence while failing to ensure procedural advancement. We characterize this failure as procedural stagnation and propose Soft-FSM, a neuro-symbolic architecture that enforces monotonic progress over accumulated Key Information Units (KIUs) via an external deterministic state controller. Experiments on three real-world Taiwanese criminal homicide cases show that baseline methods collapse below 40% completeness, while Soft-FSM consistently achieves over 97% with near-zero redundancy. These results suggest that, in such domains, reliable task completion cannot be guaranteed by emergent LLM behavior alone, and can be reliably enforced through explicit and verifiable external state control.
