Question 1

How to run ProteinMPNN on a multi-chain protein?

Accepted Answer

Use the --pdb_path_chains flag to specify chains or provide a JSONL file with chain_id_jsonl, as demonstrated in the example scripts like submit_example_2.sh for multi-chain designs.

Question 2

What does the score mean in ProteinMPNN output?

Accepted Answer

The score is the negative log probability of the sampled amino acids; lower scores indicate better sequence-backbone compatibility, with global_score averaging over all residues, as explained in the output example.

Question 3

ProteinMPNN vs Rosetta: which is better for protein sequence design?

Accepted Answer

ProteinMPNN is faster and open-source with robust deep learning performance, but Rosetta offers more comprehensive physics-based simulations and refinement tools, making it better for detailed energetic optimization.

Question 4

Can ProteinMPNN handle missing residues in PDB files?

Accepted Answer

The README does not specify support for incomplete backbones; it likely requires full atom or CA coordinates, so missing residues might need preprocessing or could lead to errors.

Question 5

How to add amino acid bias in ProteinMPNN?

Accepted Answer

Use the --bias_AA_jsonl flag with a dictionary specifying composition biases, e.g., {A: -1.1, F: 0.7}, as shown in submit_example_8.sh, to influence sequence generation.

Question 6

Is ProteinMPNN suitable for designing membrane proteins?

Accepted Answer

The soluble models are trained on soluble proteins, so performance on membrane proteins may be suboptimal without retraining on relevant data, limiting its applicability in such cases.

ProteinMPNN

What is ProteinMPNN?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions