EvoPro

Genetic algorithm-based protein binder optimization using AlphaFold2 and ProteinMPNN

400
Configure input settings on the left, then click "Submit"

Related tools

BindCraft

BindCraft

Design de novo protein binders using AlphaFold2 backpropagation, ProteinMPNN sequence optimization, and PyRosetta relaxation. BindCraft generates novel protein sequences that bind to user-specified target surfaces.

PepMLM

PepMLM

Design linear peptide binders for target proteins using a target sequence-conditioned masked language model. PepMLM generates peptide sequences optimized to bind specific protein targets based on ESM-2 protein language modeling.

BoltzGen

BoltzGen

BoltzGen is a state-of-the-art AI model for designing protein and peptide binders against any biomolecular target. Using generative diffusion models, it creates novel binders (proteins, peptides, nanobodies) with nanomolar-level binding affinity.

mBER

mBER

Design VHH nanobody binders using AlphaFold-Multimer with structure templates and sequence conditioning. mBER (Manifold Binder Engineering and Refinement) generates novel VHH antibody sequences that bind to user-specified target proteins.

Humatch

Humatch

Humatch is an antibody humanization tool that transforms non-human antibody sequences into humanized variants. Uses three lightweight CNNs to identify optimal human V-genes and generate paired heavy and light chain sequences with minimal edits while maintaining functionality.

IgDesign

IgDesign

Design antibody CDR sequences via inverse folding. Generates complementarity-determining region (CDR) sequences for antibodies targeting therapeutic antigens using deep learning. Optimizes CDR loops (HCDR1, HCDR2, HCDR3) based on antibody-antigen complex structures.

IgGM

IgGM

IgGM is a generative foundation model for antibody and nanobody design against a target antigen. Supports CDR design, affinity maturation, inverse design, and framework design. Requires an antigen structure (PDB) and antibody sequences with "X" marking positions to design.

ODesign

ODesign

All-atom generative AI for designing protein binders. Specify target binding sites and generate diverse binding proteins with fine-grained control over interaction parameters.

PocketFlow

PocketFlow

PocketFlow is a structure-based molecular generative model that designs novel drug-like molecules within protein binding pockets. It uses autoregressive flow modeling with chemical knowledge to generate 100% chemically valid, highly drug-like compounds.

ProFam

ProFam

ProFam-1 is a protein family language model for family-conditioned sequence generation. Provide a protein family FASTA/MSA and generate new sequences with model likelihood scores for downstream ranking and screening.

What is EvoPro?

EvoPro is a genetic algorithm-based pipeline for designing protein binders through in silico evolution. Developed by the Kuhlman Lab at the University of North Carolina, it combines iterative structure prediction with AlphaFold2 and sequence design with ProteinMPNN to evolve protein sequences that bind tightly to a target protein.

The approach differs from traditional computational design methods by allowing backbone plasticity during optimization. As sequences evolve across generations, their predicted structures can undergo conformational changes favorable for binding—something difficult to encode in physics-based design methods like Rosetta.

In published work, EvoPro generated autoinhibitory domains for a PD-L1 antagonist, with four designs achieving sub-150 nM binding affinity and the best reaching 0.9 nM without any experimental optimization.

How does EvoPro work?

EvoPro runs a genetic algorithm that maintains a population of candidate binder sequences and evolves them through repeated cycles:

  1. Scoring: Each sequence is evaluated by predicting its structure bound to the target using AlphaFold-Multimer
  2. Selection: Sequences are ranked by fitness; the bottom half is discarded
  3. Diversification: New sequences fill the population through mutation, crossover, or ProteinMPNN redesign
  4. Iteration: The cycle repeats for the specified number of generations

Fitness function

The fitness score combines three components from AlphaFold2 predictions:

ComponentWhat it measures
Placement confidenceInterface quality based on sidechain contacts weighted by PAE (predicted aligned error)
Fold confidenceBinder stability from average pLDDT across the designed protein
Conformational stabilityRMSD between bound and unbound structures to minimize binding-induced changes

Lower scores indicate better designs. The conformational stability term encourages rigid binders with fast association kinetics.

Sequence diversification

New sequences are generated through two strategies:

  • Random mutagenesis + crossover: Introduces point mutations (~12.5% of residues) and recombines sequences from surviving parents
  • ProteinMPNN optimization: Applied periodically (every ~10 generations), redesigns sequences using AlphaFold-predicted backbones as templates

How to use EvoPro online

ProteinIQ provides GPU-accelerated EvoPro runs without local installation, making binder design accessible through a browser interface.

Inputs

InputDescription
Target ProteinPDB file or RCSB PDB ID for the protein to design binders against
Starting ScaffoldOptional starting structure or sequence. If omitted, EvoPro generates scaffolds automatically

Evolution parameters

SettingRangeDefaultDescription
Population size20–500100Candidates per generation. Larger populations explore more diversity but increase runtime
Number of generations5–20050Evolutionary cycles. More generations improve optimization at the cost of time
Mutation rate0.05–0.50.15Per-residue mutation probability. Higher rates increase diversity but may slow convergence

Design settings

SettingDescription
Mutable residuesRestrict which positions can mutate. Use A* for all residues in chain A, or specify positions like A10,A15,A20. Leave empty to allow all positions
Enable ProteinMPNNToggle ProteinMPNN sequence design during evolution (recommended)

Quality thresholds

SettingRangeDefaultDescription
pLDDT threshold50–9570Minimum structure confidence. Higher values filter to more confident predictions
PAE threshold5–3015Maximum interface error. Lower values require higher confidence in the binding interface

Results

EvoPro returns a ranked list of designed binders with:

ColumnDescription
RankPosition in the ranked output (1 = best)
Binding ScoreComposite fitness score (lower = better)
pLDDTAverage structure confidence (0–100)
SequenceDesigned amino acid sequence
DownloadPDB file of the predicted complex

The 3D viewer displays selected designs bound to the target protein.

Interpreting binding scores

Score rangeInterpretation
< -50Excellent candidate, high confidence interface
-50 to -30Good candidate, worth experimental validation
-30 to -10Moderate, may require optimization
> -10Weak prediction, likely needs redesign

Scores depend heavily on target protein and starting scaffolds. Compare designs relative to each other rather than using absolute thresholds.

Limitations

  • Runtime: Full evolutionary trajectories take 30–60 minutes depending on population size and generations
  • Target size: Very large target proteins increase AlphaFold prediction time per generation
  • No explicit binding energy: Fitness scores correlate with but don't directly predict experimental binding affinity
  • Single binding mode: The algorithm optimizes for one binding interface; alternative binding sites aren't explored