Developing Performance-Guaranteed Biomarker Combination Rules with Integrated External Information under Practical Constraint
Albert Osom, Camden Lopez, Ashley Alexander, Suresh Chari, Ziding Feng, Ying-Qi Zhao
TL;DR
The paper tackles learning biomarker-based clinical decision rules under a clinically meaningful PPV constraint, aiming to maximize true positives while controlling the screening burden. It introduces a PPV-constrained benefit function and develops two practical methods: DOOLR, which guarantees optimality within the linear rule class using a smooth surrogate, and IT-DOOLR, which can safely incorporate external risk information through a tunable penalty. Theoretical guarantees show asymptotic consistency and PPV control, and extensive simulations demonstrate favorable finite-sample performance, including robustness to external information mis-specification. The methods are applied to PDAC screening in new-onset diabetes (PANDORA data) with external ENDPAC risk information, yielding interpretable linear rules that achieve high sensitivity while maintaining a targeted PPV, illustrating tangible clinical utility for risk-guided screening programs.
Abstract
In clinical practice, there is significant interest in integrating novel biomarkers with existing clinical data to construct interpretable and robust decision rules. Motivated by the need to improve decision-making for early disease detection, we propose a framework for developing an optimal biomarker-based clinical decision rule that is both clinically meaningful and practically feasible. Specifically, our procedure constructs a linear decision rule designed to achieve optimal performance among class of linear rules by maximizing the true positive rate while adhering to a pre-specified positive predictive value constraint. Additionally, our method can adaptively incorporate individual risk information from external source to enhance performance when such information is beneficial. We establish the asymptotic properties of our proposed estimator and compare to the standard approach used in practice through extensive simulation studies. Results indicate that our approach offers strong finite-sample performance. We also apply the proposed methods to develop biomarker-based screening rules for pancreatic ductal adenocarcinoma (PDAC) among new-onset diabetes (NOD) patients.
