Comprehensive benchmarking of large language models for RNA secondary structure prediction

L. I. Zablocki; L. A. Bugnon; M. Gerard; L. Di Persia; G. Stegmayer; D. H. Milone

Comprehensive benchmarking of large language models for RNA secondary structure prediction

L. I. Zablocki, L. A. Bugnon, M. Gerard, L. Di Persia, G. Stegmayer, D. H. Milone

TL;DR

A comprehensive experimental and comparative analysis of pretrained RNA-LLM that have been recently proposed are presented and it is shown that two LLMs clearly outperform the other models, and revealed significant challenges for generalization in low-homology scenarios.

Abstract

Inspired by the success of large language models (LLM) for DNA and proteins, several LLM for RNA have been developed recently. RNA-LLM uses large datasets of RNA sequences to learn, in a self-supervised way, how to represent each RNA base with a semantically rich numerical vector. This is done under the hypothesis that obtaining high-quality RNA representations can enhance data-costly downstream tasks. Among them, predicting the secondary structure is a fundamental task for uncovering RNA functional mechanisms. In this work we present a comprehensive experimental analysis of several pre-trained RNA-LLM, comparing them for the RNA secondary structure prediction task in an unified deep learning framework. The RNA-LLM were assessed with increasing generalization difficulty on benchmark datasets. Results showed that two LLM clearly outperform the other models, and revealed significant challenges for generalization in low-homology scenarios.

Comprehensive benchmarking of large language models for RNA secondary structure prediction

TL;DR

Abstract

Comprehensive benchmarking of large language models for RNA secondary structure prediction

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)