CONFIDE: Hallucination Assessment for Reliable Biomolecular Structure Prediction and Design
Zijun Gao, Mutian He, Shijia Sun, Hanqun Cao, Jingjie Zhang, Zihao Luo, Xiaorui Wang, Xiaojun Yao, Chang-Yu Hsieh, Chunbin Gu, Pheng Ann Heng
TL;DR
This work tackles hallucinations in state-of-the-art diffusion-based structure predictors by introducing CODE, a self-evaluating metric that tracks topological frustration via diffusion-embedding trajectories, and CONFIDE, an integrated score combining topological and energetic perspectives. CODE correlates strongly with folding kinetics, while CONFIDE outperforms pLDDT across diverse benchmarks, including ternary complexes, flexible proteins, and PDB/CASP15 datasets, and shows practical gains in binder design, enzyme-site mapping, resistance prediction, and aptamer screening. By providing unsupervised, interpretable tools that capture both energy and topology constraints, CODE and CONFIDE offer a robust framework to improve the reliability and applicability of biomolecular structure predictions in drug discovery and structural biology. The methods demonstrate broad applicability across design, screening, and functional annotation tasks, suggesting a new paradigm for unsupervised evaluation of diffusion-based biomolecular models. These contributions enable more reliable structure predictions, enhanced design strategies, and accelerated discovery workflows in computational biology.
Abstract
Reliable evaluation of protein structure predictions remains challenging, as metrics like pLDDT capture energetic stability but often miss subtle errors such as atomic clashes or conformational traps reflecting topological frustration within the protein folding energy landscape. We present CODE (Chain of Diffusion Embeddings), a self evaluating metric empirically found to quantify topological frustration directly from the latent diffusion embeddings of the AlphaFold3 series of structure predictors in a fully unsupervised manner. Integrating this with pLDDT, we propose CONFIDE, a unified evaluation framework that combines energetic and topological perspectives to improve the reliability of AlphaFold3 and related models. CODE strongly correlates with protein folding rates driven by topological frustration, achieving a correlation of 0.82 compared to pLDDT's 0.33 (a relative improvement of 148\%). CONFIDE significantly enhances the reliability of quality evaluation in molecular glue structure prediction benchmarks, achieving a Spearman correlation of 0.73 with RMSD, compared to pLDDT's correlation of 0.42, a relative improvement of 73.8\%. Beyond quality assessment, our approach applies to diverse drug design tasks, including all-atom binder design, enzymatic active site mapping, mutation induced binding affinity prediction, nucleic acid aptamer screening, and flexible protein modeling. By combining data driven embeddings with theoretical insight, CODE and CONFIDE outperform existing metrics across a wide range of biomolecular systems, offering robust and versatile tools to refine structure predictions, advance structural biology, and accelerate drug discovery.
