Finding Reproducible and Prognostic Radiomic Features in Variable Slice Thickness Contrast Enhanced CT of Colorectal Liver Metastases
Jacob J. Peoples, Mohammad Hamghalam, Imani James, Maida Wasim, Natalie Gangai, Hyunseon Christine Kang, X. John Rong, Yun Shin Chun, Richard K. G. Do, Amber L. Simpson
TL;DR
This paper investigates how radiomic features from contrast-enhanced CT for colorectal liver metastases behave under variable slice thickness and across multiple feature-extractor settings, assessing both reproducibility and prognostic value. Using a prospective 81-patient dataset for reproducibility and an independent 197-patient survival dataset, the authors extract 93 features across eight extractor configurations from two ROIs (largest tumor and liver parenchyma) and evaluate reproducibility with concordance correlation coefficients and survival performance with cross-validated Cox models. They find that reproducibility and prognostic utility depend on ROI and feature type, and that no single extractor is universally best; a data-driven approach that pools features across settings and applies reproducibility thresholds can achieve competitive prognostic performance (e.g., C-index up to 0.630) while reducing overfitting. The results underscore the value of integrating reproducibility metrics into feature selection and support using diverse extraction settings to build robust radiomic signatures for CRLM prognosis, while noting limitations such as fixed bin count and anisotropic voxel sizes. The work advances the field toward reproducible, prognostic radiomic biomarkers in CT for CRLM and provides a practical framework for multi-parameter feature selection.
Abstract
Establishing the reproducibility of radiomic signatures is a critical step in the path to clinical adoption of quantitative imaging biomarkers; however, radiomic signatures must also be meaningfully related to an outcome of clinical importance to be of value for personalized medicine. In this study, we analyze both the reproducibility and prognostic value of radiomic features extracted from the liver parenchyma and largest liver metastases in contrast enhanced CT scans of patients with colorectal liver metastases (CRLM). A prospective cohort of 81 patients from two major US cancer centers was used to establish the reproducibility of radiomic features extracted from images reconstructed with different slice thicknesses. A publicly available, single-center cohort of 197 preoperative scans from patients who underwent hepatic resection for treatment of CRLM was used to evaluate the prognostic value of features and models to predict overall survival. A standard set of 93 features was extracted from all images, with a set of eight different extractor settings. The feature extraction settings producing the most reproducible, as well as the most prognostically discriminative feature values were highly dependent on both the region of interest and the specific feature in question. While the best overall predictive model was produced using features extracted with a particular setting, without accounting for reproducibility, (C-index = 0.630 (0.603--0.649)) an equivalent-performing model (C-index = 0.629 (0.605--0.645)) was produced by pooling features from all extraction settings, and thresholding features with low reproducibility ($\mathrm{CCC} \geq 0.85$), prior to feature selection. Our findings support a data-driven approach to feature extraction and selection, preferring the inclusion of many features, and narrowing feature selection based on reproducibility when relevant data is available.
