ProtPainter: Draw or Drag Protein via Topology-guided Diffusion
Zhengxi Lu, Shizhuo Cheng, Yuru Jiang, Yan Zhang, Min Zhang
TL;DR
ProtPainter addresses the challenge of flexible, precise topology control in protein backbone generation by conditioning diffusion-based synthesis on 3D curves. It introduces a two-stage pipeline: CurveEncoder-based curve sketching that annotates curves with SSE labels, followed by sketch-guided backbone sampling using a DDPM with Helix-Gating to modulate fusion strength according to helix content, aided by RoseTTAFold guidance for translational consistency. The work contributes a CurveEncoder, a retraining-free guided sampling strategy, and a topology-focused benchmark including the Protein Restoration Task and the scTF metric, demonstrating superior topology fidelity and designability (scTM>0.5, scTF>0.8 in many cases) and enabling drawing/dragging operations for curve-driven design. Across experiments, ProtPainter outperforms unconditional and prior topology-conditioned baselines on topology fidelity and designability, while enabling curve-based editing, hinge designs, and motif scaffolding with practical downstream relevance. The approach promises more natural topology-space navigation for protein design, with implications for binder design and multi-state engineering, albeit with current inference-time limitations to be addressed in future work.$
Abstract
Recent advances in protein backbone generation have achieved promising results under structural, functional, or physical constraints. However, existing methods lack the flexibility for precise topology control, limiting navigation of the backbone space. We present ProtPainter, a diffusion-based approach for generating protein backbones conditioned on 3D curves. ProtPainter follows a two-stage process: curve-based sketching and sketch-guided backbone generation. For the first stage, we propose CurveEncoder, which predicts secondary structure annotations from a curve to parametrize sketch generation. For the second stage, the sketch guides the generative process in Denoising Diffusion Probabilistic Modeling (DDPM) to generate backbones. During this process, we further introduce a fusion scheduling scheme, Helix-Gating, to control the scaling factors. To evaluate, we propose the first benchmark for topology-conditioned protein generation, introducing Protein Restoration Task and a new metric, self-consistency Topology Fitness (scTF). Experiments demonstrate ProtPainter's ability to generate topology-fit (scTF > 0.8) and designable (scTM > 0.5) backbones, with drawing and dragging tasks showcasing its flexibility and versatility.
