Rethinking Performance Measures of RNA Secondary Structure Problems
Frederic Runge, Jörg K. H. Franke, Daniel Fertmann, Frank Hutter
TL;DR
The paper tackles the misalignment between conventional RNA secondary structure evaluation metrics and the structural biology they aim to capture. It proposes the Weisfeiler-Lehman graph kernel as a graph-based distance metric to assess RNA predictions, addressing gaps left by F1 and MCC. Through benchmark evaluation, structural shift and bulge migration analyses, and RNA design experiments, WL is shown to provide more informative, sequence-aware similarity assessments and to guide design improvements. While WL has limitations (e.g., not capturing base-stacking), extending it with edge weights and leveraging graph neural network surrogates could further enhance differentiable training and evaluation in RNA biology.
Abstract
Accurate RNA secondary structure prediction is vital for understanding cellular regulation and disease mechanisms. Deep learning (DL) methods have surpassed traditional algorithms by predicting complex features like pseudoknots and multi-interacting base pairs. However, traditional distance measures can hardly deal with such tertiary interactions and the currently used evaluation measures (F1 score, MCC) have limitations. We propose the Weisfeiler-Lehman graph kernel (WL) as an alternative metric. Embracing graph-based metrics like WL enables fair and accurate evaluation of RNA structure prediction algorithms. Further, WL provides informative guidance, as demonstrated in an RNA design experiment.
