MPBD-LSTM: A Predictive Model for Colorectal Liver Metastases Using Time Series Multi-phase Contrast-Enhanced CT Scans
Xueyang Li, Han Xiao, Weixiang Weng, Xiaowei Xu, Yiyu Shi
TL;DR
This work addresses early CRLM detection from time-series 5D CECT data, introducing MPBD-LSTM, a phase-aware, multi-plane extension of E3D-LSTM that processes arterial and portal venous phases in parallel while employing bi-directional temporal modeling. The authors curate a 5D CECT CRLM dataset from two hospitals, build a dedicated MPBD-LSTM architecture with two planes and two bi-directional 3D-LSTM stacks per plane, and demonstrate superior predictive performance (AUC 0.790) over several baselines. Ablation studies confirm the benefits of bi-directional connections and parallel multi-phase processing, while timestamp and phase analyses reveal that combining all timestamps across A and V phases yields the strongest signal, with the V phase contributing more than the A phase. The results highlight the potential of phase-aware spatiotemporal modeling for CRLM prediction, though the authors acknowledge remaining challenges from inter-patient variability and 5D feature fusion that warrant further research.
Abstract
Colorectal cancer is a prevalent form of cancer, and many patients develop colorectal cancer liver metastasis (CRLM) as a result. Early detection of CRLM is critical for improving survival rates. Radiologists usually rely on a series of multi-phase contrast-enhanced computed tomography (CECT) scans done during follow-up visits to perform early detection of the potential CRLM. These scans form unique five-dimensional data (time, phase, and axial, sagittal, and coronal planes in 3D CT). Most of the existing deep learning models can readily handle four-dimensional data (e.g., time-series 3D CT images) and it is not clear how well they can be extended to handle the additional dimension of phase. In this paper, we build a dataset of time-series CECT scans to aid in the early diagnosis of CRLM, and build upon state-of-the-art deep learning techniques to evaluate how to best predict CRLM. Our experimental results show that a multi-plane architecture based on 3D bi-directional LSTM, which we call MPBD-LSTM, works best, achieving an area under curve (AUC) of 0.79. On the other hand, analysis of the results shows that there is still great room for further improvement.
