Coarse-to-Fine Learning for Multi-Pipette Localisation in Robot-Assisted In Vivo Patch-Clamp
Lan Wei, Gema Vera Gonzalez, Phatsimo Kgwarae, Alexander Timms, Denis Zahorovsky, Simon Schultz, Dandan Zhang
TL;DR
This work tackles automated, real-time localisation of multiple pipettes in in vivo patch-clamp experiments using two-photon imaging. A heatmap-augmented coarse-to-fine pipeline integrates CycleGAN-based background elimination, a Vision Transformer encoder for coarse pipette heatmaps, and a ResNet-18-based fine regression to predict precise tip coordinates, with Hungarian matching guiding robust training. The method achieves high localization accuracy, reporting > $98\%$ within $10\,\mu$m and > $89\%$ within $5\,\mu$m, with an average MSE of $2.52\,\mu$m, and operates in real time across CPU and GPU hardware. This approach substantially enhances automation and scalability for robot-assisted in vivo multi-pipette patch-clamp, enabling more efficient investigations of cellular interactions in intact brains.
Abstract
In vivo image-guided multi-pipette patch-clamp is essential for studying cellular interactions and network dynamics in neuroscience. However, current procedures mainly rely on manual expertise, which limits accessibility and scalability. Robotic automation presents a promising solution, but achieving precise real-time detection of multiple pipettes remains a challenge. Existing methods focus on ex vivo experiments or single pipette use, making them inadequate for in vivo multi-pipette scenarios. To address these challenges, we propose a heatmap-augmented coarse-to-fine learning technique to facilitate multi-pipette real-time localisation for robot-assisted in vivo patch-clamp. More specifically, we introduce a Generative Adversarial Network (GAN)-based module to remove background noise and enhance pipette visibility. We then introduce a two-stage Transformer model that starts with predicting the coarse heatmap of the pipette tips, followed by the fine-grained coordination regression module for precise tip localisation. To ensure robust training, we use the Hungarian algorithm for optimal matching between the predicted and actual locations of tips. Experimental results demonstrate that our method achieved > 98% accuracy within 10 μm, and > 89% accuracy within 5 μm for the localisation of multi-pipette tips. The average MSE is 2.52 μm.
