Generating Computational Cognitive Models using Large Language Models

Milena Rmus; Akshay K. Jagadish; Marvin Mathony; Tobias Ludwig; Eric Schulz

Generating Computational Cognitive Models using Large Language Models

Milena Rmus, Akshay K. Jagadish, Marvin Mathony, Tobias Ludwig, Eric Schulz

TL;DR

This work introduces GeCCo, a guided-generation pipeline that uses open-source Large Language Models to generate executable cognitive models as Python functions, iteratively refining them with held-out data feedback. Across decision making, learning, planning, and working memory, GeCCo-produced models match or exceed domain-specific baselines in predictive accuracy ($BIC$) and posterior predictive checks, while capturing substantial explainable variance comparable to a foundation model. The approach relies on in-context learning, a hybrid LLM-plus-optimization loop, and robust control analyses to demonstrate the scalability, interpretability, and domain generalizability of LLM-driven cognitive-model discovery. The findings suggest LLMs can democratize cognitive-model generation, accelerate theory development, and inspire new theoretical insights by revealing compact, empirically competitive models across diverse task domains.

Abstract

Computational cognitive models, which formalize theories of cognition, enable researchers to quantify cognitive processes and arbitrate between competing theories by fitting models to behavioral data. Traditionally, these models are handcrafted, which requires significant domain knowledge, coding expertise, and time investment. However, recent advances in machine learning offer solutions to these challenges. In particular, Large Language Models (LLMs) have demonstrated remarkable capabilities for in-context pattern recognition, leveraging knowledge from diverse domains to solve complex problems, and generating executable code that can be used to facilitate the generation of cognitive models. Building on this potential, we introduce a pipeline for Guided generation of Computational Cognitive Models (GeCCo). Given task instructions, participant data, and a template function, GeCCo prompts an LLM to propose candidate models, fits proposals to held-out data, and iteratively refines them based on feedback constructed from their predictive performance. We benchmark this approach across four different cognitive domains -- decision making, learning, planning, and memory -- using three open-source LLMs, spanning different model sizes, capacities, and families. On four human behavioral data sets, the LLM generated models that consistently matched or outperformed the best domain-specific models from the cognitive science literature. Taken together, our results suggest that LLMs can generate cognitive models with conceptually plausible theories that rival -- or even surpass -- the best models from the literature across diverse task domains.

Generating Computational Cognitive Models using Large Language Models

TL;DR

Abstract

Generating Computational Cognitive Models using Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)