Validating Interpretability in siRNA Efficacy Prediction: A Perturbation-Based, Dataset-Aware Protocol
Zahra Khodagholi, Niloofar Yousefi
TL;DR
This work introduces a pre-synthesis gate that validates saliency-based design guidance for siRNA efficacy predictors by measuring the perturbation sensitivity of top-attribution positions against composition-matched baselines, yielding a pass/fail criterion. It reveals two transfer failure modes—faithful-but-wrong and inverted saliency—demonstrating that explanations may not generalize across protocols, and shows that cross-dataset shifts can silently undermine deployment. To bolster interpretability, the authors propose BioPrior, a biology-informed regularizer that aligns model gradients with canonical siRNA design principles, improving saliency faithfulness with modest predictive gains. Across four benchmarks, 19/20 fold–dataset combinations pass the faithfulness test, and the approach yields actionable guidance for sequence edits when faithfulness holds. The work underscores the necessity of dataset-specific interpretability validation for explanation-guided therapeutic design and provides an open-source protocol to operationalize this practice.
Abstract
Saliency maps are increasingly used as \emph{design guidance} in siRNA efficacy prediction, yet attribution methods are rarely validated before motivating sequence edits. We introduce a \textbf{pre-synthesis gate}: a protocol for \emph{counterfactual sensitivity faithfulness} that tests whether mutating high-saliency positions changes model output more than composition-matched controls. Cross-dataset transfer reveals two failure modes that would otherwise go undetected: \emph{faithful-but-wrong} (saliency valid, predictions fail) and \emph{inverted saliency} (top-saliency edits less impactful than random). Strikingly, models trained on mRNA-level assays collapse on a luciferase reporter dataset, demonstrating that protocol shifts can silently invalidate deployment. Across four benchmarks, 19/20 fold instances pass; the single failure shows inverted saliency. A biology-informed regularizer (BioPrior) strengthens saliency faithfulness with modest, dataset-dependent predictive trade-offs. Our results establish saliency validation as essential pre-deployment practice for explanation-guided therapeutic design. Code is available at https://github.com/shadi97kh/BioPrior.
