A mechanistically interpretable neural network for regulatory genomics
Alex M. Tseng, Gokcen Eraslan, Tommaso Biancalani, Gabriele Scalia
TL;DR
This work introduces ARGMINN, a mechanistically interpretable neural network for regulatory genomics that encodes motifs and their grammar directly into its architecture. It comprises a motif-scanner module that learns de novo, non-redundant motifs and a syntax-builder module that uses memory-stream–based attention to model interactions between motif instances, yielding an interpretable readout of motif instances and rules in any sequence. Theoretical results prove ARGMINN’s expressivity for first-order logic–defined motif configurations, while extensive experiments show improved motif discovery and motif-instance calling, robust performance under sequence variation, and the novel ability to perform interpretable sequence design. Together, these contributions offer a scalable, readable alternative to post hoc interpretability methods with potential impact on disease genomics, genome design, and regulatory biology.
Abstract
Deep neural networks excel in mapping genomic DNA sequences to associated readouts (e.g., protein-DNA binding). Beyond prediction, the goal of these networks is to reveal to scientists the underlying motifs (and their syntax) which drive genome regulation. Traditional methods that extract motifs from convolutional filters suffer from the uninterpretable dispersion of information across filters and layers. Other methods which rely on importance scores can be unstable and unreliable. Instead, we designed a novel mechanistically interpretable architecture for regulatory genomics, where motifs and their syntax are directly encoded and readable from the learned weights and activations. We provide theoretical and empirical evidence of our architecture's full expressivity, while still being highly interpretable. Through several experiments, we show that our architecture excels in de novo motif discovery and motif instance calling, is robust to variable sequence contexts, and enables fully interpretable generation of novel functional sequences.
