From Tissue Plane to Organ World: A Benchmark Dataset for Multimodal Biomedical Image Registration using Deep Co-Attention Networks
Yifeng Wang, Weipeng Li, Thomas Pearce, Haohan Wang
TL;DR
The paper addresses the challenging problem of histology-to-organ registration by turning it into a learning task that localizes a 2D tissue slice within a 3D organ volume. It introduces ATOM, an automated pipeline that generates diverse 2D–3D image–position pairs from multi-institution data, and RegisMCAN, a deep modular co-attention model that treats the 2D slice as a question and the 3D volume as an answer, predicting the originating block via patch-level co-attention. Key contributions include the ATOM dataset, a tailored RegisMCAN architecture with patch embeddings and an asymmetric loss to handle imbalanced block labels, and extensive experiments across data types, labels, splits, and slice views. The work advances automated, end-to-end tissue-organ registration with potential to improve multimodal disease understanding and non-invasive radiology analyses, while highlighting practical considerations like resolution constraints and memory demands.
Abstract
Correlating neuropathology with neuroimaging findings provides a multiscale view of pathologic changes in the human organ spanning the meso- to micro-scales, and is an emerging methodology expected to shed light on numerous disease states. To gain the most information from this multimodal, multiscale approach, it is desirable to identify precisely where a histologic tissue section was taken from within the organ in order to correlate with the tissue features in exactly the same organ region. Histology-to-organ registration poses an extra challenge, as any given histologic section can capture only a small portion of a human organ. Making use of the capabilities of state-of-the-art deep learning models, we unlock the potential to address and solve such intricate challenges. Therefore, we create the ATOM benchmark dataset, sourced from diverse institutions, with the primary objective of transforming this challenge into a machine learning problem and delivering outstanding outcomes that enlighten the biomedical community. The performance of our RegisMCAN model demonstrates the potential of deep learning to accurately predict where a subregion extracted from an organ image was obtained from within the overall 3D volume. The code and dataset can be found at: https://github.com/haizailache999/Image-Registration/tree/main
