Automated discovery of symbolic laws governing skill acquisition from naturally occurring data
Sannyuya Liu, Qing Li, Xiaoxuan Shen, Jianwen Sun, Zongkai Yang
TL;DR
The paper tackles understanding skill acquisition by learning governing laws from large-scale, naturally occurring training data. It introduces a two-stage Auto-Discovered Model (ADM) that first uses a Transformer-like deep regressor to estimate latent cognitive states and then applies symbolic distillation to extract algebraic governing laws via symbolic regression. Across simulated and Lumosity real-world data, the approach accurately recovers preset laws under noise, yields interpretable laws that often outperform classical fits, and reveals novel patterns such as logarithmic and inverse relationships. This data-driven, interpretable framework advances cognitive science by providing scalable, evidence-based rules for skill learning with potential applications in education and personalized training.
Abstract
Skill acquisition is a key area of research in cognitive psychology as it encompasses multiple psychological processes. The laws discovered under experimental paradigms are controversial and lack generalizability. This paper aims to unearth the laws of skill learning from large-scale training log data. A two-stage algorithm was developed to tackle the issues of unobservable cognitive states and algorithmic explosion in searching. Initially a deep learning model is employed to determine the learner's cognitive state and assess the feature importance. Subsequently, symbolic regression algorithms are utilized to parse the neural network model into algebraic equations. Experimental results show the algorithm can accurately restore preset laws within a noise range in continuous feedback settings. When applied to Lumosity training data, the method outperforms traditional and recent models in fitness terms. The study reveals two new forms of skill acquisition laws and reaffirms some previous findings.
