ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish
Jan-Matthis Lueckmann, Alexander Immer, Alex Bo-Yuan Chen, Peter H. Li, Mariela D. Petkova, Nirmala A. Iyer, Luuk Willem Hesselink, Aparna Dev, Gudrun Ihrke, Woohyun Park, Alyson Petruncio, Aubrey Weigel, Wyatt Korff, Florian Engert, Jeff W. Lichtman, Misha B. Ahrens, Michał Januszewski, Viren Jain
TL;DR
ZAPBench establishes a rigorous benchmark for predicting whole-brain neural activity at cellular resolution in a vertebrate, using a 4D light-sheet zebrafish dataset and an evolving connectome. It formalizes horizon-based forecasting with $H=32$ steps and context lengths $C=4$ or $256$, supporting both time-series traces and volumetric video inputs, and evaluates predictions with MAE across multiple stimulus conditions. The authors provide multiple baselines and representative models (time-series and U-Net-based volumetric forecasting) and report that while models outperform naive baselines, there is substantial room for improvement, especially in leveraging cross-neuron information and integrating structural data. They highlight directions like graph-based and latent-variable models, probabilistic forecasting, and connectome-informed approaches, and release code, data, and interactive visualization tools to catalyze future advances. Overall, ZAPBench aims to accelerate predictive neuroscience by offering a scalable, open platform for brain-wide activity forecasting and method development.
Abstract
Data-driven benchmarks have led to significant progress in key scientific modeling domains including weather and structural biology. Here, we introduce the Zebrafish Activity Prediction Benchmark (ZAPBench) to measure progress on the problem of predicting cellular-resolution neural activity throughout an entire vertebrate brain. The benchmark is based on a novel dataset containing 4d light-sheet microscopy recordings of over 70,000 neurons in a larval zebrafish brain, along with motion stabilized and voxel-level cell segmentations of these data that facilitate development of a variety of forecasting methods. Initial results from a selection of time series and volumetric video modeling approaches achieve better performance than naive baseline methods, but also show room for further improvement. The specific brain used in the activity recording is also undergoing synaptic-level anatomical mapping, which will enable future integration of detailed structural information into forecasting methods.
