IgCraft: A versatile sequence generation framework for antibody discovery and engineering
Matthew Greenig, Haowen Zhao, Vladimir Radenkovic, Aubin Ramon, Pietro Sormanni
TL;DR
IgCraft introduces a unified Bayesian Flow Network framework for generating paired human antibody sequences, enabling unconditional sampling, sequence inpainting, inverse folding, and CDR grafting within a single model. It combines a two-track transformer with a structure encoder and uses staged training to leverage both sequence and structure information, achieving competitive performance across tasks and state-of-the-art results in CDR grafting under structural conditioning. The approach improves developability metrics such as humanness and solubility while preserving functional features, demonstrating practical utility for antibody discovery and engineering. Overall, IgCraft provides a scalable, versatile platform for sampling human antibody sequences across diverse design contexts with flexible conditioning.
Abstract
Designing antibody sequences to better resemble those observed in natural human repertoires is a key challenge in biologics development. We introduce IgCraft: a multi-purpose model for paired human antibody sequence generation, built on Bayesian Flow Networks. IgCraft presents one of the first unified generative modeling frameworks capable of addressing multiple antibody sequence design tasks with a single model, including unconditional sampling, sequence inpainting, inverse folding, and CDR motif scaffolding. Our approach achieves competitive results across the full spectrum of these tasks while constraining generation to the space of human antibody sequences, exhibiting particular strengths in CDR motif scaffolding (grafting) where we achieve state-of-the-art performance in terms of humanness and preservation of structural properties. By integrating previously separate tasks into a single scalable generative model, IgCraft provides a versatile platform for sampling human antibody sequences under a variety of contexts relevant to antibody discovery and engineering. Model code and weights are publicly available at https://github.com/mgreenig/IgCraft.
