Question 1

How to install EvoDiff on a Linux system?

Accepted Answer

Create a conda environment with Python 3.9, install PyTorch, then run 'pip install evodiff'. Follow the examples notebook for step-by-step guidance, and note that torch-scatter may take time to compile.

Question 2

EvoDiff vs RFDiffusion for protein design?

Accepted Answer

EvoDiff focuses on sequence-space generation and excels at designing disordered regions, while RFDiffusion is structure-based and better for precise 3D scaffolding. Choose EvoDiff for sequence-first flexibility, especially if IDRs are needed.

Question 3

How to generate a protein with a specific functional motif?

Accepted Answer

Use the conditional generation script with --cond-task scaffold, specifying PDB codes and motif indices. Provide start and end indexes for motifs, and the code will download PDB files, but care is needed for incomplete files.

Question 4

What datasets are required for EvoDiff conditional tasks?

Accepted Answer

For tasks like IDR generation or scaffolding, you need specific datasets: IDR data from Reverse Homology GitHub, and for MSA tasks, the OpenFold dataset. Unconditional generation doesn't require datasets, but they're needed for full functionality.

Question 5

Can EvoDiff run on CPUs or only GPUs?

Accepted Answer

The README recommends GPU usage for efficiency, as models like 640M parameter versions are resource-intensive. The Docker setup includes --gpus all flag, but CPU might be possible with performance trade-offs.

Question 6

How to evaluate the foldability of generated sequences?

Accepted Answer

Use the self-consistency analysis pipeline with tools like Omegafold and ESM-IF, as detailed in the analysis scripts. This requires installing these external tools separately, adding to setup time.

EvoDiff

What is EvoDiff?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions