Inferring genotype-phenotype maps using attention models
Krishna Rijal, Caroline M. Holmes, Samantha Petti, Gautam Reddy, Michael M. Desai, Pankaj Mehta
TL;DR
This work addresses genotype-phenotype mapping by moving beyond additive linear models to attention-based architectures that learn context-dependent epistasis and gene-environment interactions. The authors develop a single-environment model with genotype embeddings and three stacked attention layers, plus a multi-environment extension that uses environment tokens to capture cross-environment information and enable transfer learning. Through simulations and empirical budding yeast QTL data, the attention-based approach shows superior out-of-sample predictions in epistatic regimes and the ability to transfer predictions to new environments with limited data. The approach preserves linear information while uncovering subtle interactions, offering a scalable framework for modeling complex genotype-phenotype landscapes across environments.
Abstract
Predicting phenotype from genotype is a central challenge in genetics. Traditional approaches in quantitative genetics typically analyze this problem using methods based on linear regression. These methods generally assume that the genetic architecture of complex traits can be parameterized in terms of an additive model, where the effects of loci are independent, plus (in some cases) pairwise epistatic interactions between loci. However, these models struggle to analyze more complex patterns of epistasis or subtle gene-environment interactions. Recent advances in machine learning, particularly attention-based models, offer a promising alternative. Initially developed for natural language processing, attention-based models excel at capturing context-dependent interactions and have shown exceptional performance in predicting protein structure and function. Here, we apply attention-based models to quantitative genetics. We analyze the performance of this attention-based approach in predicting phenotype from genotype using simulated data across a range of models with increasing epistatic complexity, and using experimental data from a recent quantitative trait locus mapping study in budding yeast. We find that our model demonstrates superior out-of-sample predictions in epistatic regimes compared to standard methods. We also explore a more general multi-environment attention-based model to jointly analyze genotype-phenotype maps across multiple environments and show that such architectures can be used for "transfer learning" - predicting phenotypes in novel environments with limited training data.
