Inferring response times of perceptual decisions with Poisson variational autoencoders

Hayden R. Johnson; Anastasia N. Krouglova; Hadi Vafaii; Jacob L. Yates; Pedro J. Gonçalves

Inferring response times of perceptual decisions with Poisson variational autoencoders

Hayden R. Johnson, Anastasia N. Krouglova, Hadi Vafaii, Jacob L. Yates, Pedro J. Gonçalves

TL;DR

This work introduces an image-computable perceptual decision model, PVAE-RT, that jointly learns efficient spiking representations of high-dimensional stimuli via a Poisson variational autoencoder and executes Bayesian evidence accumulation through a task-optimized decoder. An entropy-based stopping rule yields response times that capture key psychophysical regularities, including stochastic variability, right-skewed distributions, Hick’s law, and speed–accuracy trade-offs, demonstrated on MNIST. By linking efficient sensory coding with probabilistic decision dynamics under biological constraints, the approach provides a principled framework for rendering temporal aspects of perception in neural models and evaluating rapid decision behavior in complex visual tasks.

Abstract

Many properties of perceptual decision making are well-modeled by deep neural networks. However, such architectures typically treat decisions as instantaneous readouts, overlooking the temporal dynamics of the decision process. We present an image-computable model of perceptual decision making in which choices and response times arise from efficient sensory encoding and Bayesian decoding of neural spiking activity. We use a Poisson variational autoencoder to learn unsupervised representations of visual stimuli in a population of rate-coded neurons, modeled as independent homogeneous Poisson processes. A task-optimized decoder then continually infers an approximate posterior over actions conditioned on incoming spiking activity. Combining these components with an entropy-based stopping rule yields a principled and image-computable model of perceptual decisions capable of generating trial-by-trial patterns of choices and response times. Applied to MNIST digit classification, the model reproduces key empirical signatures of perceptual decision making, including stochastic variability, right-skewed response time distributions, logarithmic scaling of response times with the number of alternatives (Hick's law), and speed-accuracy trade-offs.

Inferring response times of perceptual decisions with Poisson variational autoencoders

TL;DR

Abstract

Inferring response times of perceptual decisions with Poisson variational autoencoders

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)