Gain modulation of actions selection without synaptic relearning
Elif Köksal-Ersöz, Pascal Chossat, Frédéric Lavigne
TL;DR
Problem: how to change goal selection after punishment without changing synaptic weights. Approach: gain modulation in a branching network of excitatory populations with inhibition, where learned patterns $\xi^k$ form a state-space and punishment reduces the gain $\gamma$ of active units, biasing transitions among branches without altering $J_{ij}$. Findings: punishment-induced gain changes reweight path probabilities, producing exploitation, exploration, or avoidance; effects persist into the next trial $T+1$ and can occur even when punishment targets distant units. Significance: demonstrates a complementary, fast-acting memory mechanism that encodes experience via $\gamma$ rather than synaptic changes, potentially reducing harmful errors while preserving learned structure.
Abstract
Adaptation of behavior requires the brain to change goals in a changing environment. Synaptic learning has demonstrated its effectiveness in changing the probability of selecting actions based on their outcome. In the extreme case, it is vital not to repeat an action to a given goal that led to harmful punishment. The present model proposes a simple neural mechanism of gain modulation that makes possible immediate changes in the probability of selecting a goal after punishment of variable intensity. Results show how gain modulation determine the type of elementary navigation process within the state space of a network of neuronal populations of excitatory neurons regulated by inhibition. Immediately after punishment, the system can avoid the punished populations by going back or by jumping to unpunished populations. This does not require particular credit assignment at the `choice' population but only gain modulation of neurons active at the time of punishment. Gain modulation does not require statistical relearning that may lead to further errors, but can encode memories of past experiences without modification of synaptic efficacies. Therefore, gain modulation can complements synaptic plasticity.
