Human-in-the-Loop Optimization with Model-Informed Priors
Yi-Chi Liao, João Belo, Hee-Seung Moon, Jürgen Steimle, Anna Maria Feit
TL;DR
This work tackles the cold-start challenge in human-in-the-loop optimization by pretraining optimizers on large-scale synthetic user data generated from predictive models. It introduces HOMI, a framework that uses meta-BO trained with synthetic users to learn robust adaptation strategies before real-user deployment, and presents Neural Acquisition Function$^+$ (NAF$^+$), a neural, meta-learned acquisition function augmented with a novelty detector for out-of-distribution robustness and dynamic multi-objective weighting. Through synthetic tests and a real user study on mid-air keyboard adaptation, the approach demonstrates faster early convergence and robust performance across diverse user profiles compared with standard BO, TAF, and ConBO baselines. The work suggests a scalable path toward personalized, efficient interface optimization by combining model-informed priors with in situ optimization, potentially generalizable to a wide range of interactive systems. Overall, HOMI offers a principled method to leverage synthetic user models as training resources, enabling rapid, adaptive interface personalization in VR and beyond.
Abstract
Human-in-the-loop optimization identifies optimal interface designs by iteratively observing user performance. However, it often requires numerous iterations due to the lack of prior information. While recent approaches have accelerated this process by leveraging previous optimization data, collecting user data remains costly and often impractical. We present a conceptual framework, Human-in-the-Loop Optimization with Model-Informed Priors (HOMI), which augments human-in-the-loop optimization with a training phase where the optimizer learns adaptation strategies from diverse, synthetic user data generated with predictive models before deployment. To realize HOMI, we introduce Neural Acquisition Function+ (NAF+), a Bayesian optimization method featuring a neural acquisition function trained with reinforcement learning. NAF+ learns optimization strategies from large-scale synthetic data, improving efficiency in real-time optimization with users. We evaluate HOMI and NAF+ with mid-air keyboard optimization, a representative VR input task. Our work presents a new approach for more efficient interface adaptation by bridging in situ and in silico optimization processes.
