Modeling Rational Adaptation of Visual Search to Hierarchical Structures
Saku Sourulahti, Christian P Janssen, Jussi PP Jokinen
TL;DR
The paper tackles how visual search efficiency can be enhanced by exploiting hierarchical structure under human memory limits. It introduces a reinforcement-learning–driven, computationally rational model that learns to search within visual hierarchies via a POMDP framework, without hard-coded strategies. Empirical data from a human experiment show that structured layouts reduce search times, and the model’s predictions align closely with human performance (R^2 ≈ 0.90, RMSE ≈ 0.38 s), especially at larger set sizes. The work advances understanding of adaptive visual search in structured environments and suggests practical guidelines for designing visually organized information spaces in HCI contexts. It highlights the potential of hierarchical memory-based strategies to inform UI layout optimization and eye-movement prediction, while outlining limitations and paths for extending memory structure and perceptual grouping in future work.
Abstract
Efficient attention deployment in visual search is limited by human visual memory, yet this limitation can be offset by exploiting the environment's structure. This paper introduces a computational cognitive model that simulates how the human visual system uses visual hierarchies to prevent refixations in sequential attention deployment. The model adopts computational rationality, positing behaviors as adaptations to cognitive constraints and environmental structures. In contrast to earlier models that predict search performance for hierarchical information, our model does not include predefined assumptions about particular search strategies. Instead, our model's search strategy emerges as a result of adapting to the environment through reinforcement learning algorithms. In an experiment with human participants we test the model's prediction that structured environments reduce visual search times compared to random tasks. Our model's predictions correspond well with human search performance across various set sizes for both structured and unstructured visual layouts. Our work improves understanding of the adaptive nature of visual search in hierarchically structured environments and informs the design of optimized search spaces.
