Benchmarking drug-drug interaction prediction methods: a perspective of distribution changes
Zhenqian Shen, Mingyang Zhou, Yongqi Zhang, Quanming Yao
TL;DR
Emerging drug–drug interaction prediction is hindered by distribution changes between known and new drugs, which are not captured by traditional i.i.d. evaluations. The paper introduces DDI-Ben, a benchmarking framework with a distribution-change surrogate and cluster-based drug splits that reflect real-world drug development, and benchmarks ten diverse methods on DrugBank and TWOSIDES. Results show substantial performance degradation under distribution changes for most methods, while LLM-based approaches (e.g., DDI-GPT, TextDDI) leveraging textual drug information exhibit greater robustness. The work provides benchmark datasets with simulated distribution changes and highlights directions for future research, including language-model–driven methods and domain adaptation to improve resilience in emerging DDI prediction.
Abstract
Motivation: Emerging drug-drug interaction (DDI) prediction is crucial for new drugs but is hindered by distribution changes between known and new drugs in real-world scenarios. Current evaluation often neglects these changes, relying on unrealistic i.i.d. split due to the absence of drug approval data. Results: We propose DDI-Ben, a benchmarking framework for emerging DDI prediction under distribution changes. DDI-Ben introduces a distribution change simulation framework that leverages distribution changes between drug sets as a surrogate for real-world distribution changes of DDIs, and is compatible with various drug split strategies. Through extensive benchmarking on ten representative methods, we show that most existing approaches suffer substantial performance degradation under distribution changes. Our analysis further indicates that large language model (LLM) based methods and the integration of drug-related textual information offer promising robustness against such degradation. To support future research, we release the benchmark datasets with simulated distribution changes. Overall, DDI-Ben highlights the importance of explicitly addressing distribution changes and provides a foundation for developing more resilient methods for emerging DDI prediction. Availability and implementation: Our code and data are available at https://github.com/LARS-research/DDI-Bench.
