Exploring Adversarial Robustness in Classification tasks using DNA Language Models

Hyunwoo Yoo; Haebin Shin; Kaidi Xu; Gail Rosen

Exploring Adversarial Robustness in Classification tasks using DNA Language Models

Hyunwoo Yoo, Haebin Shin, Kaidi Xu, Gail Rosen

TL;DR

DNA language models for sequence classification face robustness challenges under real-world sequencing noise. The authors apply multi-granularity adversarial perturbations—nucleotide-level with magnitude $\epsilon$, codon-level, and backtranslation—to three models (DNABERT2, GROVER, Nucleotide Transformer) and evaluate on AMR drug classification and promoter detection, demonstrating substantial degradation especially from nucleotide-level attacks. They additionally assess adversarial training as a defense, finding it can improve robustness and, in some cases, overall accuracy, with effectiveness dependent on attack type and dataset. The work highlights the need to integrate robustness considerations into bioinformatics workflows to ensure reliable genomic analyses in practical settings.

Abstract

DNA Language Models, such as GROVER, DNABERT2 and the Nucleotide Transformer, operate on DNA sequences that inherently contain sequencing errors, mutations, and laboratory-induced noise, which may significantly impact model performance. Despite the importance of this issue, the robustness of DNA language models remains largely underexplored. In this paper, we comprehensivly investigate their robustness in DNA classification by applying various adversarial attack strategies: the character (nucleotide substitutions), word (codon modifications), and sentence levels (back-translation-based transformations) to systematically analyze model vulnerabilities. Our results demonstrate that DNA language models are highly susceptible to adversarial attacks, leading to significant performance degradation. Furthermore, we explore adversarial training method as a defense mechanism, which enhances both robustness and classification accuracy. This study highlights the limitations of DNA language models and underscores the necessity of robustness in bioinformatics.

Exploring Adversarial Robustness in Classification tasks using DNA Language Models

TL;DR

, codon-level, and backtranslation—to three models (DNABERT2, GROVER, Nucleotide Transformer) and evaluate on AMR drug classification and promoter detection, demonstrating substantial degradation especially from nucleotide-level attacks. They additionally assess adversarial training as a defense, finding it can improve robustness and, in some cases, overall accuracy, with effectiveness dependent on attack type and dataset. The work highlights the need to integrate robustness considerations into bioinformatics workflows to ensure reliable genomic analyses in practical settings.

Abstract

Paper Structure (25 sections, 1 equation, 9 figures, 6 tables)

This paper contains 25 sections, 1 equation, 9 figures, 6 tables.

Introduction
Related Work
Adversarial Attacks in Text Classification
Adversarial Attacks in DNA Sequence Classification
Adversarial Training
Adversarial Attack on DNA Language Models
Nucleotide-Level Attack
Codon-Level Attack
Backtranslation Attack
Results
Attack Effectiveness and Context Preservation
Model Robustness Comparison
Comparative Analysis of Classification Tasks
Performance Against Defense Method
Conclusion
...and 10 more sections

Figures (9)

Figure 1: Adversarial examples with nucleotides and codon change, which will be misclassified by a non-promoter to a promoter.
Figure 2: Anti-Microbial Resistance (AMR) gene Classification and Promoter Detection Results
Figure 3: Comparison of success rates and accuracies between adversarial training and standard training in AMR gene Classification across increasing iterations of backtranslation attacks
Figure 4: Overview of Adversarial Training(Fine-tuning) for DNA Language Models and Testing Adversarial Attacks
Figure 5: Comparison of Adversarial Training and Standard Training in Promoter Detection with Increasing Iterations
...and 4 more figures

Exploring Adversarial Robustness in Classification tasks using DNA Language Models

TL;DR

Abstract

Exploring Adversarial Robustness in Classification tasks using DNA Language Models

Authors

TL;DR

Abstract

Table of Contents

Figures (9)