An open-source diffusion model for generating and designing protein structures, including binders, symmetric oligomers, and motif-scaffolded proteins.
RFdiffusion is an open-source diffusion model for generating and designing protein structures. It solves the problem of creating novel proteins with desired functions, such as binding to specific targets or scaffolding functional motifs, by using a denoising process that starts from random noise and iteratively refines it into biologically plausible structures.
Computational biologists, protein engineers, and researchers in structural biology who need to design novel proteins for therapeutic, catalytic, or materials applications.
Developers choose RFdiffusion because it outperforms previous protein design methods in silico and experimentally, offers a wide range of design capabilities (binders, symmetric oligomers, motifs), and is open-source with extensive documentation and community support.
Code for running RFdiffusion
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Supports multiple protein design tasks like motif scaffolding, binder design, and symmetric oligomer generation, with detailed examples and protocols in the README.
Modular framework that integrates with tools like ProteinMPNN and AlphaFold2, enabling end-to-end experimental validation pipelines as described in the documentation.
Outperforms previous methods such as Constrained Hallucination and RFjoint Inpainting in tasks like motif scaffolding and binder design, based on the cited paper.
Provides extensive guides, Google Colab notebooks, Docker images, and community resources to facilitate setup and usage, as highlighted in the installation section.
Installation requires managing Conda environments, SE3-Transformer dependencies, and GPU driver compatibility, with notes on customizing for different CUDA versions, which can be error-prone.
Only produces protein backbones; sequence design necessitates external tools like ProteinMPNN, adding extra steps and complexity to the workflow, as admitted in the binder design section.
Runtime scales O(N^2) with residue count, making large targets slow and computationally expensive, and requires high-performance GPUs, as warned in the practical considerations.