Input
Output
CDR optimization
Full antibody redesign

Design antibody CDR sequences via inverse folding. Generates complementarity-determining region (CDR) sequences for antibodies targeting therapeutic antigens using deep learning. Optimizes CDR loops (HCDR1, HCDR2, HCDR3) based on antibody-antigen complex structures.

Design thermostable protein sequences using ProteinMPNN trained on hyperthermophilic organism structures. Generates sequences optimized for improved thermal stability without requiring ligands or additional context.

Design protein sequences with atomic context from ligands, metals, and nucleotides. Achieves 63.3% sequence recovery at binding sites, significantly outperforming ProteinMPNN (50.5%).

Design protein sequences for given backbone structures using deep learning. Fast and accurate inverse folding with state-of-the-art sequence recovery (52.4%).

Specialized model for soluble protein sequence design. Trained exclusively on soluble proteins for optimized performance on cytoplasmic and extracellular proteins.

Humatch is an antibody humanization tool that transforms non-human antibody sequences into humanized variants. Uses three lightweight CNNs to identify optimal human V-genes and generate paired heavy and light chain sequences with minimal edits while maintaining functionality.

Reasoning-guided antibody CDR co-design for antibody-antigen complexes. Proteo-R1 identifies residue-level functional decisions and uses conditional diffusion to generate ranked designed structures with confidence metrics.

Design VHH nanobody binders using AlphaFold-Multimer with structure templates and sequence conditioning. mBER (Manifold Binder Engineering and Refinement) generates novel VHH antibody sequences that bind to user-specified target proteins.

Structure-based de novo antibody and nanobody design pipeline combining antibody-tuned RFdiffusion, ProteinMPNN sequence design, and antibody-tuned RoseTTAFold2 filtering.

IgGM is a generative foundation model for antibody and nanobody design against a target antigen. Supports CDR design, affinity maturation, inverse design, and framework design. Requires an antigen structure (PDB) and antibody sequences with "X" marking positions to design.
AntiFold is an inverse folding model specialized for antibody variable domains. Given an antibody structure, it predicts amino acid sequences that would fold into that structure—the reverse of traditional structure prediction.
Developed by the Oxford Protein Informatics Group, AntiFold builds on Meta's ESM-IF1 model but is fine-tuned specifically on antibody structures from SAbDab (Structural Antibody Database) and predicted structures from OAS (Observed Antibody Space). This antibody-specific training dramatically improves sequence recovery for antibody CDR loops compared to general-purpose inverse folding tools.
The model uses IMGT numbering, the international standard for immunoglobulin sequences, enabling precise targeting of specific antibody regions during design.
AntiFold takes a 3D antibody structure and predicts the probability distribution of amino acids at each position. The model learns structural constraints—which residues are compatible with a given backbone geometry—from millions of antibody structures.
During fine-tuning from ESM-IF1, several strategies improved CDR3 recovery: span masking to learn regional context, weighted masking that emphasizes CDR residues (3:1 ratio over frameworks), layer-wise learning rate decay, and augmentation with OAS predicted structures. These modifications increased CDRH3 amino acid recovery from 43% (ESM-IF1 baseline) to 60%.
For each position, the model outputs:
From these probabilities, AntiFold samples sequences using temperature-controlled multinomial sampling. Lower temperatures produce conservative sequences closer to the probability maximum; higher temperatures explore more diverse sequence space.
ProteinIQ provides GPU-accelerated AntiFold without installation. The tool keeps AntiFold's native chain-handling behavior: paired VH/VL structures can use any valid chain IDs, and nanobody/VHH inputs run in a dedicated nanobody mode.
| Input | Description |
|---|---|
Antibody Structure | PDB or mmCIF file containing the antibody variable domains. Structures can be uploaded directly or fetched by PDB ID. |
The structure should contain paired heavy and light chains (VH/VL) or a single-domain antibody (VHH/nanobody). Chain IDs do not need to be literal H and L; AntiFold accepts any valid structure chain labels when the correct chains are specified. An optional antigen chain may be included but should be kept small for optimal performance.
AntiFold is not a general-purpose inverse folding tool for arbitrary proteins. For non-antibody proteins, use ProteinMPNN or LigandMPNN instead.
| Setting | Description |
|---|---|
Input mode | Choose Paired antibody (VH/VL) for standard antibodies or Nanobody / VHH for single-domain antibodies. |
Heavy chain ID | Optional paired-antibody heavy chain. Can be any valid chain label such as A, B, H, or L. Leave blank to infer the first chain. |
Light chain ID | Optional paired-antibody light chain. Can be any valid chain label such as A, B, H, or L. Leave blank to infer the second chain. |
Nanobody chain ID | Chain ID for nanobody/VHH inputs. If the uploaded structure contains a single chain, AntiFold can infer it automatically. |
Number of sequences | Sequence variants to generate per structure (1–100, default: 10). |
Sampling temperature | Controls sequence diversity (0.0–1.5, default: 0.2). Values 0.1–0.3 produce conservative designs; 0.7–1.5 explores diverse sequence space. |
| Setting | Description |
|---|---|
IMGT regions to design | Target specific regions: All regions, CDRs only, individual CDRs (CDR1, CDR2, CDR3), or Frameworks only. |
Complementarity-determining regions (CDRs) form the antigen binding site. CDR1, CDR2, and CDR3 loops on both heavy and light chains determine specificity. Framework regions provide structural scaffolding. Nanobody mode applies the same region controls to the single VHH chain.
| Setting | Description |
|---|---|
Random seed | For reproducible results (default: 42). |
Include per-residue scores | Output log-likelihoods and perplexity at each position. |
Results include designed sequences in FASTA format with associated metrics:
| Column | Description |
|---|---|
score | Average log-likelihood over the designed region |
global_score | Average log-likelihood over all residues |
seq_recovery | Fraction of positions matching the original sequence |
When per-residue scoring is enabled, a CSV file provides position-by-position analysis including perplexity values and individual amino acid log-likelihoods.
Perplexity reflects how many amino acids are structurally compatible at a position:
| Perplexity | Interpretation |
|---|---|
| 1–3 | Highly constrained; few substitutions tolerated |
| 4–8 | Moderately flexible; several alternatives possible |
| 10–14 | Structurally tolerant; typical for exposed CDR positions |
| > 14 | Very permissive; likely surface-exposed or disordered |
Lower perplexity at a position suggests that the backbone geometry strongly constrains the amino acid identity—useful for identifying structurally critical residues.
ProteinMPNN is a general-purpose inverse folding model trained on diverse protein structures. While powerful, it occasionally produces artifacts problematic for antibodies: chain reordering, gaps in IMGT-numbered structures, and suboptimal CDR3 predictions.
AntiFold addresses these issues with antibody-specific training. On experimental structures, it achieves 60% CDRH3 amino acid recovery versus 43% for ESM-IF1 and 56% for the antibody-adapted AbMPNN. For the notoriously variable CDRH3 loop, AntiFold constrains perplexity to 4–8 amino acids (versus 6–10 for AbMPNN), producing more focused designs that better preserve backbone geometry.