Combiner and HyperCombiner Networks: Rules to Combine Multimodality MR Images for Prostate Cancer Localisation
Wen Yan, Bernard Chiu, Ziyi Shen, Qianye Yang, Tom Syer, Zhe Min, Shonit Punwani, Mark Emberton, David Atkinson, Dean C. Barratt, Yipeng Hu
TL;DR
The paper addresses localising clinically significant prostate cancer from multiparametric MRI by modelling radiologist-style modality fusion rules. It introduces Combiner and HyperCombiner networks that use linear ($Z=\sum_{\tau} \alpha_{\tau} Y^{\tau}$) and nonlinear stacking ($Z=\sigma(\sum_{\tau} \beta_{\tau}Y^{\tau}+\beta_{0})$) formulations to weight modality-specific predictions, and a hypernetwork $\tilde{\boldsymbol{\theta}}=h(\boldsymbol{\alpha};\boldsymbol{\phi})$ to enable inference-time rule conditioning. The approach facilitates rule discovery and interpretation, including PI-RADS-based encoding of conditions with zone-specific decisions and hyperparameter-guided grid searches, while maintaining competitive segmentation performance on a sizeable clinical mpMR dataset (651 patients with all three modalities; 751 total cases; 500/124/127 train/val/test). HyperCombiner networks provide efficient exploration of alternative combining rules and quantify modality importance via odds ratios and statistical measures, offering practical insights for modality availability and decision rule optimization in clinical workflows. Overall, the work demonstrates that low-dimensional, interpretable rule models can match end-to-end performance while increasing transparency and enabling rule discovery in multimodal prostate cancer localisation.
Abstract
One of the distinct characteristics in radiologists' reading of multiparametric prostate MR scans, using reporting systems such as PI-RADS v2.1, is to score individual types of MR modalities, T2-weighted, diffusion-weighted, and dynamic contrast-enhanced, and then combine these image-modality-specific scores using standardised decision rules to predict the likelihood of clinically significant cancer. This work aims to demonstrate that it is feasible for low-dimensional parametric models to model such decision rules in the proposed Combiner networks, without compromising the accuracy of predicting radiologic labels: First, it is shown that either a linear mixture model or a nonlinear stacking model is sufficient to model PI-RADS decision rules for localising prostate cancer. Second, parameters of these (generalised) linear models are proposed as hyperparameters, to weigh multiple networks that independently represent individual image modalities in the Combiner network training, as opposed to end-to-end modality ensemble. A HyperCombiner network is developed to train a single image segmentation network that can be conditioned on these hyperparameters during inference, for much improved efficiency. Experimental results based on data from 850 patients, for the application of automating radiologist labelling multi-parametric MR, compare the proposed combiner networks with other commonly-adopted end-to-end networks. Using the added advantages of obtaining and interpreting the modality combining rules, in terms of the linear weights or odds-ratios on individual image modalities, three clinical applications are presented for prostate cancer segmentation, including modality availability assessment, importance quantification and rule discovery.
