Abrupt and spontaneous strategy switches emerge in simple regularised neural networks

Anika T. Löwe; Léo Touzo; Paul S. Muhle-Karbe; Andrew M. Saxe; Christopher Summerfield; Nicolas W. Schuck

Abrupt and spontaneous strategy switches emerge in simple regularised neural networks

Anika T. Löwe, Léo Touzo, Paul S. Muhle-Karbe, Andrew M. Saxe, Christopher Summerfield, Nicolas W. Schuck

TL;DR

The study shows that insight-like, abrupt strategy switches can emerge from simple gradient-descent learning with L1-regularised input gating and gradient noise, without dedicated insight mechanisms. By comparing humans and two-input neural networks on a perceptual decision task with a hidden regularity, the authors observe sudden, selective, and delayed switches in a subset of both groups, aligning with classic aha moments. Mechanistically, regularisation suppresses irrelevant inputs (gating) until latent knowledge (colour weights) becomes decisive, and Gaussian gradient noise enables transitions, producing a spectrum of switch timings. These results suggest that seemingly conscious insights can arise from ordinary gradual learning dynamics, with implications for understanding cognitive flexibility and the design of artificial systems that capitalize on hidden regularities.

Abstract

Humans sometimes have an insight that leads to a sudden and drastic performance improvement on the task they are working on. Sudden strategy adaptations are often linked to insights, considered to be a unique aspect of human cognition tied to complex processes such as creativity or meta-cognitive reasoning. Here, we take a learning perspective and ask whether insight-like behaviour can occur in simple artificial neural networks, even when the models only learn to form input-output associations through gradual gradient descent. We compared learning dynamics in humans and regularised neural networks in a perceptual decision task that included a hidden regularity to solve the task more efficiently. Our results show that only some humans discover this regularity, whose behaviour was marked by a sudden and abrupt strategy switch that reflects an aha-moment. Notably, we find that simple neural networks with a gradual learning rule and a constant learning rate closely mimicked behavioural characteristics of human insight-like switches, exhibiting delay of insight, suddenness and selective occurrence in only some networks. Analyses of network architectures and learning dynamics revealed that insight-like behaviour crucially depended on a regularised gating mechanism and noise added to gradient updates, which allowed the networks to accumulate "silent knowledge" that is initially suppressed by regularised (attentional) gating. This suggests that insight-like behaviour can arise naturally from gradual learning in simple neural networks, where it reflects the combined influences of noise, gating and regularisation.

Abrupt and spontaneous strategy switches emerge in simple regularised neural networks

TL;DR

Abstract

Paper Structure (17 sections, 15 equations, 4 figures)

This paper contains 17 sections, 15 equations, 4 figures.

Baseline Task Performance (Training and Motion Phases)
Task Performance After Correlation Onset (Motion and Colour Phase)
Insight Classification
Behavioural Differences Between Participants With and Without Insights
Delay of Insights
Effects of Feedback and Low Coherence
Architecture
Learning Algorithm
Network Training
Deep Neural Networks
Overall Network Performance
Insight-like Behavioural Characteristics of Network Behaviour
Spontaneous Strategy Switches in Deep Neural Networks
Insight Networks Immediately Learn More About Colour Once It Becomes Predictive
"Silent" Colour Knowledge Precedes Insight-like Behaviour
...and 2 more sections

Figures (4)

Figure 1: Stimuli, task design and insight classification procedure (A) Stimuli and stimulus-response mapping: dot clouds were either coloured in orange or purple and moved to one of the four directions NW, NE, SE, SW with varying coherence. A left response key, "X", corresponded to the NW/SE motion directions, while a right response key "M" corresponded to NE/SW directions. (B) Task structure of the two-alternative forced choice task for humans and neural networks: each block consisted of 100 trials. A first training block only for humans contained only 100% motion coherence trials to familiarise subjects with the S-R mapping. The remaining training blocks contained only high coherence (0.2, 0.3, 0.45) trials. In the motion phase, colour changed randomly and was not predictive and all motion coherence levels were included. Colour was predictive of correct choices and correlated with motion directions as well as correct response buttons only in the last five blocks (motion and colour phase). Participants were instructed to use colour before the very last block 9, which served as sanity check (data shown only in SI). (C) Trial structure: a fixation cue is shown for a duration that is shuffled between 400, 600, 800 and 1000 ms. The random dot cloud stimulus is displayed for 2000 ms. A response can be made during these entire 2000 ms, but a central feedback cue will replace the fixation cue immediately after a response. (D) Schematic of the neural network with regularised gate modulation used to model insights. (E) Insight classification procedure: We fitted a sigmoid model to data from the lowest motion coherence condition data of both the Experimental Group and a Control Group (where colour never becomes predictive), and derived the distributions of the slope steepness at the estimated switch point (inflection point $t_s$ of sigmoid function). We then asked which participants from the Experimental group had fitted slopes that were steeper than the 100th percentile of the Control Group. The resulting purely behavioural classification of insights versus no insight agreed to 79.6% with verbal insight reports from a post-task questionnaire and predicted a number of behavioural features (see text). Importantly, using this method allowed us to apply the same procedure to neural networks.
Figure 2: Humans: task performance and insight-like strategy switches (A) Accuracy (% correct) during the motion phase increases with increasing motion coherence. N = 99, error bars signify standard error of the mean (SEM). (B) Accuracy (% correct) over the course of the experiment for all motion coherence levels. First dashed vertical line marks the onset of the colour predictiveness (motion and colour phase), second dashed vertical line the "instruction" about colour predictiveness. Blocks shown are halved task blocks (50 trials each). N = 99, error shadows signify SEM. (C) Switch point-aligned accuracy on lowest motion coherence level for insight (49/99) and no-insight (50/99) subjects. Blocks shown are halved task blocks (50 trials each). Error shadow signifies SEM. (D) Trial-wise switch-aligned smoothed binary responses on lowest motion coherence level for an example insight subject. (E) Illustration of the sigmoid function for different slope steepness parameters. (F) Difference between BICs of the linear and sigmoid function for each human subject. N = 99. (G) Distributions of fitted slope steepness at inflection point parameter for control experiment and classified insight and no-insight groups. (H) Distribution of switch points. Dashed vertical line marks onset of colour predictiveness. Blocks shown are halved task blocks (50 trials each).
Figure 3: L1-regularised neural networks: task performance and insight-like strategy switches (A) Accuracy (% correct) during the motion phase increases with increasing motion coherence. N = 99, error bars signify SEM. Grey line is human data for comparison. (B) Accuracy (% correct) over the course of the experiment for all motion coherence levels. First dashed vertical line marks the onset of the colour predictiveness (motion and colour phase), second dashed vertical line the "instruction" about colour predictiveness. Blocks shown are halved task blocks (50 trials each). N = 99, error shadows signify SEM. (C) Switch point-aligned accuracy on lowest motion coherence level for insight (46/99) and no-insight (53/99) networks. Blocks shown are halved task blocks (50 trials each). Error shadow signifies SEM. (D) Trial-wise switch-aligned continuous outputs on lowest motion coherence level for an example insight network. (E) Switch point-aligned accuracy on lowest motion coherence level for insight (18/99) and no-insight (81/99) hidden layer networks. Blocks shown are halved task blocks (50 trials each). Error shadow signifies SEM. (F) Difference between BICs of the linear model and sigmoid function for each network. (G) Distributions of fitted slope steepness at inflection point parameter for control networks and classified insight and no-insight groups. (H) Distribution of switch points. Dashed vertical line marks onset of colour predictiveness. Blocks shown are halved task blocks (50 trials each).
Figure 4: Influence of gradient noise $\sigma_{\xi}$ and regularisation $\lambda$ on insight-like switches (A) Influence of gradient noise (standard deviation $\sigma_{\xi}$) on the frequency of absolute numbers of insight-like networks. The frequency of insight-like switches increases gradually with $\sigma_{\xi}$ until it reaches a ceiling around $\sigma_{\xi}$ = .05. Error bars are SD. Results from 10 simulations of 99 networks each. (B) Effects of gradient noise added only to either all weights ($\sigma_{\xi_{w}}$, light purple dashed line), all gates ($\sigma_{\xi_{g}}$, solid purple line), all motion parameters (i.e. motion weight and motion gates, $\sigma_{\xi_{g_m,w_m}}$, light purple solid line) and all colour parameters ($\sigma_{\xi_{g_c,w_c}}$, dark purple solid line) on the frequency of insight-like switches. While only small amounts of noise in the colour parameters $\sigma_{\xi_{g_c,w_c}}$ suffice to induce 100% insight-like strategy switches among networks (dark purple line), adding noise to the motion parameters $\sigma_{\xi_{g_c,w_c}}$ (light purple solid line) only had a a very minor effect. Furthermore, we find that adding noise specifically to the weights ($\sigma_{\xi_{w}}$, dashed purple line), has a much larger effect than adding noise to the gates ($\sigma_{\xi_{g}}$, solid purple line). Error bars are SD. Results from 10 simulations of 99 networks each. Colour scheme as in Fig. 1B (C) The frequency of insight-like switches declines with increasing $\lambda$. (D) The average switch point occurs later in the task with increasing $\lambda$. Error bars signify SEM.

Abrupt and spontaneous strategy switches emerge in simple regularised neural networks

TL;DR

Abstract

Abrupt and spontaneous strategy switches emerge in simple regularised neural networks

Authors

TL;DR

Abstract

Table of Contents

Figures (4)