IgGM

Design antibodies using a generative foundation model.

5
Configure input settings on the left, then click "Submit"

Related tools

Humatch

Humatch

Humatch is an antibody humanization tool that transforms non-human antibody sequences into humanized variants. Uses three lightweight CNNs to identify optimal human V-genes and generate paired heavy and light chain sequences with minimal edits while maintaining functionality.

IgDesign

IgDesign

Design antibody CDR sequences via inverse folding. Generates complementarity-determining region (CDR) sequences for antibodies targeting therapeutic antigens using deep learning. Optimizes CDR loops (HCDR1, HCDR2, HCDR3) based on antibody-antigen complex structures.

mBER

mBER

Design VHH nanobody binders using AlphaFold-Multimer with structure templates and sequence conditioning. mBER (Manifold Binder Engineering and Refinement) generates novel VHH antibody sequences that bind to user-specified target proteins.

RFantibody

RFantibody

Structure-based de novo antibody and nanobody design pipeline combining antibody-tuned RFdiffusion, ProteinMPNN sequence design, and antibody-tuned RoseTTAFold2 filtering.

AntiFold

AntiFold

Inverse folding for antibody variable domains and nanobodies. Predicts amino acid sequences compatible with antibody structures using IMGT numbering while preserving upstream AntiFold chain handling and structural constraints.

BioPhi

BioPhi

Antibody humanization and humanness evaluation platform from Merck. Sapiens mode uses deep learning trained on the Observed Antibody Space (OAS) to humanize antibody sequences, while OASis mode evaluates humanness using 9-mer peptide search against human antibody databases.

BoltzGen

BoltzGen

BoltzGen is a state-of-the-art AI model for designing protein and peptide binders against any biomolecular target. Using generative diffusion models, it creates novel binders (proteins, peptides, nanobodies) with nanomolar-level binding affinity.

DiffAb

DiffAb

AI-powered antibody CDR design using equivariant diffusion models. Generates optimized complementarity-determining region (CDR) sequences and structures for antibodies targeting specific antigens. Supports single CDR, multi-CDR co-design, and fixed-backbone sequence design modes.

PocketFlow

PocketFlow

PocketFlow is a structure-based molecular generative model that designs novel drug-like molecules within protein binding pockets. It uses autoregressive flow modeling with chemical knowledge to generate 100% chemically valid, highly drug-like compounds.

PepMLM

PepMLM

Design linear peptide binders for target proteins using a target sequence-conditioned masked language model. PepMLM generates peptide sequences optimized to bind specific protein targets based on ESM-2 protein language modeling.

What is IgGM?

IgGM (Immunoglobulin Generative Model) is a generative foundation model for designing antibodies and nanobodies against target antigens. Developed by Tencent AI4S and published at ICLR 2025, it can generate sequence proposals and, for some tasks, structural outputs as well.

The model excels at designing complementarity-determining regions (CDRs), the hypervariable loops responsible for antigen recognition. On benchmark tests, IgGM achieved a 36% sequence recovery rate for CDR-H3 (the most flexible and challenging region), representing a 22% improvement over prior state-of-the-art methods.

IgGM supports both conventional antibodies (heavy + light chain) and single-domain nanobodies. It won a top-three prize in the AIntibody competition, an experimentally validated antibody design challenge.

How does IgGM work?

IgGM combines three components:

  1. Pre-trained language model (ESM-PPI): Extracts evolutionary and structural features from protein sequences. ESM-PPI extends ESM2 with improved handling of inter-chain relationships in multi-chain complexes.
  2. Feature learning module: Processes 3D coordinate information through structure encoding and inter-chain interaction modules, integrating spatial relationships between antibody and antigen.
  3. Diffusion-based prediction module: Uses SO(3) diffusion to model rotational degrees of freedom in protein structure. The system samples initial translations from a Gaussian distribution and rotations from the standard SO(3) group, then iteratively refines both sequence and structure.

Training proceeds in two phases. First, a diffusion model learns structure prediction while preserving original sequence information. Then a consistency model is distilled from the diffusion model, enabling fast generation from arbitrary noise levels. This two-phase approach proved essential for model performance.

How to use IgGM online

ProteinIQ provides GPU-accelerated IgGM without local installation or the complexity of managing PyTorch Geometric dependencies.

Inputs

InputDescription
Heavy Chain (VH)Antibody heavy chain variable region sequence. Mark positions to design with X characters.
Light Chain (VL)Light chain variable region (optional for nanobodies).
Antigen StructureRequired. PDB file of the target antigen. Upload directly or fetch by PDB ID.
Antigen SequenceOptional sequence for epitope-guided design. If omitted, extracted from PDB.
Original SequenceReference sequence for affinity maturation comparisons.

The X character indicates positions for the model to redesign. For CDR design, mask the CDR residues while providing the framework sequence.

Design tasks

TaskDescription
CDR DesignDesign CDR loops to bind the target epitope. Provide framework sequence with X at CDR positions.
Inverse DesignGenerate a sequence compatible with an existing antibody structure.
Framework DesignRedesign framework regions while preserving CDRs. Useful for humanization.
Affinity MaturationOptimize an existing antibody for improved binding. Provide original sequence for comparison.

Settings

SettingDescription
Number of designsSamples to generate (1–100, default 10). More samples provide diversity but increase runtime.
Epitope residuesTarget binding site positions on the antigen (e.g., 45,46,47,50-55). If omitted, calculated automatically from the antigen structure.
Apply PyRosetta relaxationEnergy minimization to improve structural quality. Increases runtime.
Random seedFor reproducible results. Leave empty for random seed.

Outputs

IgGM can produce:

  • Designed sequences in FASTA format for each sample
  • Predicted structures as PDB files when the selected task/run emits structure outputs
  • Statistics (CSV) and visualizations (PNG) for tasks that emit analysis artifacts

Sequence-only and epitope-only runs are shown in task-appropriate result tabs instead of the 3D viewer. All emitted files remain downloadable as a ZIP archive.

Epitope specification

The epitope—the antigen region the antibody should bind—guides design. IgGM requires epitope information and handles it three ways:

  1. Provide comma-separated residue numbers (e.g., 417,449,453,455-456). Numbers correspond to sequence positions, not PDB residue IDs.
  2. For multi-chain PDB files (e.g., an existing antibody-antigen complex), IgGM identifies interface residues using a 5Å distance cutoff.
  3. For single-chain antigen structures, surface-exposed residues are identified by solvent-accessible surface area (SASA > 25 Ų).

Epitope size is limited to 50 residues to manage computational cost.

Limitations

  • Antigen structure required: Unlike some tools that accept sequence-only input, IgGM needs a 3D antigen structure for epitope-guided design.
  • Memory constraints: Large antigens may need truncation (the max_antigen_size parameter, 2000 residues by default in ProteinIQ unless overridden).
  • Binding affinity not predicted: IgGM generates plausible sequences but does not estimate binding strength. Experimental validation remains necessary.
  • Rigid docking: The antigen structure is treated as fixed during design.