Ctrl-DNA: Controllable Cell-Type-Specific Regulatory DNA Design via Constrained RL
Xingyu Chen, Shihao Ma, Runsheng Lin, Jiecong Lin, Bo Wang
TL;DR
This work addresses the problem of designing CREs with high target-cell expression and controlled off-target activity. It formulates CRE design as a constrained Markov decision process and solves it with constrained reinforcement learning using Lagrangian multipliers, maximizing $J_0(\theta)$ while enforcing $J_i(\theta)\le \delta_i$ for off-target cell types and computing policy gradients from batch-normalized rewards to avoid separate value models. A TFBS-based regularization term $R_{TFBS}(X)=\text{Corr}(q_{gen},q_{real})$ aligns generated motifs with biologically realistic distributions. Empirical results on human enhancer and promoter MPRA datasets across six cell types show Ctrl-DNA achieves higher target activity and better constraint satisfaction than baselines, while preserving motif plausibility and diversity. This constraint-aware RL approach provides a scalable pathway for cell-type-specific CRE design with potential impact in gene therapy and synthetic biology.
Abstract
Designing regulatory DNA sequences that achieve precise cell-type-specific gene expression is crucial for advancements in synthetic biology, gene therapy and precision medicine. Although transformer-based language models (LMs) can effectively capture patterns in regulatory DNA, their generative approaches often struggle to produce novel sequences with reliable cell-specific activity. Here, we introduce Ctrl-DNA, a novel constrained reinforcement learning (RL) framework tailored for designing regulatory DNA sequences with controllable cell-type specificity. By formulating regulatory sequence design as a biologically informed constrained optimization problem, we apply RL to autoregressive genomic LMs, enabling the models to iteratively refine sequences that maximize regulatory activity in targeted cell types while constraining off-target effects. Our evaluation on human promoters and enhancers demonstrates that Ctrl-DNA consistently outperforms existing generative and RL-based approaches, generating high-fitness regulatory sequences and achieving state-of-the-art cell-type specificity. Moreover, Ctrl-DNA-generated sequences capture key cell-type-specific transcription factor binding sites (TFBS), short DNA motifs recognized by regulatory proteins that control gene expression, demonstrating the biological plausibility of the generated sequences.
