Boltz-2

AI-powered structure and binding affinity prediction with near-FEP accuracy

Input

Job name

Add your input molecules to get started

50 credits

Output

Configure input settings on the left, then click "Submit"orTry an example (it's free)

Human KRAS G12C protein with covalent ligand

A transcription factor and DNA complex

What is Boltz-2?

Boltz-2 is a biomolecular foundation model that jointly predicts molecular complex structures and protein-ligand binding affinities. Developed by MIT Jameel Clinic and Recursion and released in June 2025, it extends Boltz-1—the most widely used open-source alternative to AlphaFold3 across academia and industry.

The model achieves binding affinity predictions approaching the accuracy of physics-based free-energy perturbation (FEP) calculations while running over 1000x faster. Traditional FEP methods cost hundreds of dollars per prediction and require 6–12 hours to complete; Boltz-2 achieves comparable accuracy in approximately 20 seconds on a single GPU.

Boltz-2 supports multi-component biomolecular complexes including proteins, small molecule ligands, DNA, and RNA. This broad coverage makes it particularly valuable for drug discovery workflows requiring rapid screening of compound libraries before experimental synthesis.

How to use Boltz-2 online

ProteinIQ hosts Boltz-2 on GPU infrastructure, delivering structure and affinity predictions through a browser interface without local installation or command-line configuration.

Inputs

Boltz-2 accepts multiple molecule types that combine into a single prediction job. Chain IDs (A, B, C...) are assigned automatically and displayed in the interface—these identifiers are needed when defining constraints.

Input	Description
`Protein`	PDB/CIF structure files, FASTA sequences, or RCSB PDB IDs (e.g., `1ABC`). Up to 10 chains.
`Ligand (SMILES)`	SMILES strings, SDF/MOL/MOL2 files, or PubChem compound IDs. Up to 10 ligands.
`Ligand (CCD)`	Standard codes from the PDB Chemical Component Dictionary (e.g., `ATP`, `NAD`, `HEM`).
`DNA`	FASTA sequences supplied as text or file input. Up to 10 chains.
`RNA`	FASTA sequences supplied as text or file input. Up to 10 chains.
`Template`	Known structures used to bias prediction toward a homologous fold. PDB/CIF files or RCSB structure IDs. Up to 5 templates.
`Pre-computed MSA`	A3M alignment files. Optional—Boltz-2 generates MSAs automatically when enabled.

Settings

Prediction parameters

Setting	Description
`Number of samples`	Structure predictions to generate (1–20, default 5). More samples capture conformational diversity but increase runtime.
`Confidence threshold`	Minimum confidence score for predictions (0.0–1.0, default 0.5). Higher values filter low-confidence results.

MSA generation

Multiple sequence alignment (MSA) aligns a target protein sequence against thousands of evolutionarily related sequences. By analyzing coevolution patterns—which amino acid positions change together across evolution—the model infers spatial relationships in the 3D structure.

Setting	Description
`Generate MSA`	Enables automatic MSA generation via ColabFold. Disable only for quick screening or synthetic proteins lacking natural homologs.
`MSA depth`	Search thoroughness: `Shallow` (2048 seqs, fast), `Normal` (8192 seqs, default), or `Deep` (16384 seqs, slow but thorough).
`MSA pairing strategy`	How MSAs from different chains combine: `Greedy` (default) or `Complete` (slower, better for obligate heterodimers).
`Max MSA sequences`	Maximum aligned sequences (512–16384, default 8192). Lower values are faster; higher values may improve accuracy.
`Subsample MSA`	Reduces a large alignment to a smaller working set for faster inference on long proteins.

Advanced options

Boltz-2 uses diffusion-based structure generation with iterative refinement ("recycling"). These parameters control that process.

Setting	Description
`Recycling steps`	Refinement iterations (1–10, default 3). More steps improve accuracy at increased runtime.
`Sampling steps`	Diffusion denoising steps (50–500, default 200). Higher values produce more refined structures.
`Step scale`	Temperature controlling sampling diversity (1.0–2.0, default 1.5). Higher values explore more conformational space.
`Output format`	`CIF` (recommended) or `PDB` (legacy compatibility).
`MW-corrected affinity`	Applies molecular weight correction when comparing ligands of different sizes.
`Affinity sampling steps`	Diffusion steps for affinity prediction (50–500, default 200).

Constraints

Constraints guide predictions toward specific structural features when prior knowledge exists about the system—from crystallography, mutagenesis studies, or crosslinking mass spectrometry data.

Setting	Description
`Enforce constraints`	Must be enabled before defining constraints below. Activates inference-time potentials.
`Pocket constraints`	Forces ligand binding near specified residues. Format: `binder
`Covalent bonds`	Defines covalent bonds between atoms. Format: `chain:residue:atom,chain:residue:atom` (e.g., `A:12:SG,C:1:C1`).
`Contact constraints`	Forces residues within a specified distance. Format: `chain:residue,chain:residue

Template options

Templates can be used either as soft structural priors or as stronger geometric constraints for cases where a homologous backbone is already known.

Setting	Description
`Enforce template backbone`	Keeps the predicted backbone close to the supplied template instead of treating the template as loose guidance.
`Template deviation threshold (Å)`	Maximum allowed backbone deviation from the template when enforcement is enabled.
`Template chain mapping`	Assigns template indices to specific query chains. Format: `template_index:chain_ids`.

Modifications

Setting	Description
`Residue modifications`	Post-translational modifications using CCD codes. Format: `chain:position:CCD_code`. Common codes: `SEP` (phosphoserine), `TPO` (phosphothreonine), `MLY` (methylated lysine).
`Cyclic chains`	Chains with head-to-tail cyclization. Enter one chain ID per line.

Output

Boltz-2 returns ranked predictions, each with downloadable structure files and a data table summarizing structure and affinity outputs. Optional diagnostic matrices can also be saved from the advanced settings.

Column	Description
`Rank`	Prediction ranking by confidence.
`Affinity (log10 IC50)`	Quantitative binding strength in log10(IC50) μM. More negative = stronger binding.
`Confidence`	Structural prediction confidence (0–1).
`Structure file`	Predicted complex saved as `CIF` or `PDB`, depending on the selected output format.
`PAE/PDE files`	Optional error matrices written when `Save PAE matrix` or `Save PDE matrix` is enabled.

Interpreting affinity predictions

Boltz-2 outputs two affinity metrics serving different purposes:

Affinity probability (0–1): Binary classifier confidence that the molecule binds. Values above 0.7 indicate strong predicted binders; below 0.5 suggests unlikely binding. Best used for initial screening to separate hits from non-binders.

Affinity value (log10 IC50 in μM): Quantitative binding strength. More negative values indicate stronger binding.

Value	Interpretation	Typical use case
< −8	Very strong (nM)	Clinical candidates
−8 to −6	Strong (low μM)	Lead compounds
−6 to −4	Moderate	Hit compounds
> −4	Weak	Likely inactive

Interpreting confidence scores

Confidence reflects certainty about the predicted structure, not the affinity estimate. Low confidence does not mean the prediction is wrong—it indicates the result warrants closer examination.

Score	Interpretation
> 0.7	High confidence—prediction likely reliable
0.5–0.7	Moderate—visual inspection recommended
< 0.5	Low—may indicate a difficult target, missing MSA data, or unusual binding mode

How does Boltz-2 work?

Boltz-2 extends Boltz-1 with an affinity prediction module trained on millions of binding measurements. The result is a single model predicting both 3D structures and binding strength.

Architecture

The model processes input in two stages. First, the structure module generates 3D coordinates using diffusion—starting from noise and iteratively refining toward a physically plausible structure. This approach parallels image generation models but operates on molecular geometry.

Second, the affinity module takes the predicted structure and outputs binding predictions: both a binary "does it bind?" probability and a quantitative IC50 estimate.

Training data

Boltz-2 was trained on approximately 5 million binding affinity measurements (Kd and IC50 values) from public assays, plus molecular dynamics simulations capturing protein flexibility.

The training data spans diverse complex types: protein-ligand, protein-DNA, protein-RNA, and protein-protein interactions. This breadth accounts for Boltz-2's effectiveness on multi-molecule complexes.

Benchmark performance

For structure prediction, Boltz-2 matches or slightly outperforms AlphaFold3, Chai-1, and Boltz-1 on recent PDB deposits (2024–2025), with particular strength on RNA, DNA-protein complexes, and antibody-antigen interactions.

For affinity prediction, Boltz-2 achieves Pearson correlation r ≈ 0.62 on the FEP+ benchmark—matching physics-based free energy perturbation methods that require hours instead of seconds. In the CASP16 affinity challenge, it outperformed all submitted methods across 140 complexes.

When to use Boltz-2

Boltz-2 excels in scenarios requiring both structure and affinity prediction in a single workflow.

Scenario	Recommendation
Need structure + quantitative affinity	Boltz-2
Multi-component complexes (protein + DNA + ligand)	Boltz-2
Already have protein structure, need ligand poses only	DiffDock (faster, no affinity)
High-throughput screening with known binding site	AutoDock Vina or AutoDock-GPU
CNN-enhanced scoring on traditional docking	GNINA

Comparison with other methods

Feature	Boltz-2	DiffDock	AutoDock Vina	FEP+
Predicts structure	Yes	No (pose only)	No (pose only)	No
Predicts affinity	Yes (quantitative)	No	Yes (scoring)	Yes (quantitative)
Affinity accuracy	`r ≈ 0.62`	N/A	`r ≈ 0.3–0.4`	`r ≈ 0.6–0.7`
Speed	~20 sec	~30 sec	~1 min	6–12 hours
Molecule types	Protein, DNA, RNA, ligand

Limitations

Maximum system size: The ProteinIQ deployment accepts up to 5,000 total residues across all sequence-based chains, but very large assemblies remain substantially slower and more memory-intensive than smaller jobs.
Memory scaling: Memory requirements increase quadratically with sequence length.
MSA dependency: Prediction accuracy correlates with MSA depth (Spearman r ≈ 0.35–0.37). Synthetic or designed proteins lacking natural homologs may yield lower-confidence results.
Stereochemistry sensitivity: Ligand binding is highly stereospecific. SMILES strings must correctly represent the intended enantiomer.
At least one foldable biopolymer is required: ProteinIQ requires at least one protein, DNA, or RNA chain in the job; ligands alone are not a valid Boltz-2 input.

Structure prediction

AlphaFold 2: Higher residue limits, no affinity prediction
Chai-1: Alternative multi-molecule structure prediction
ESMFold: Fast single-chain prediction without MSA
OpenFold 3: Open-source AlphaFold3 implementation
Protenix: ByteDance's AlphaFold3-style model
RoseTTAFold 3: Baker lab structure prediction

Molecular docking

DiffDock: Diffusion-based blind docking
AutoDock Vina: Classical scoring function
AutoDock-GPU: GPU-accelerated AutoDock
GNINA: CNN-enhanced docking and scoring
DynamicBind: Flexible receptor docking

Binding site analysis

Fpocket: Geometry-based pocket detection
PocketFlow: Deep learning pocket prediction
ScanNet: Binding site prediction from sequence

Boltz-2

Input

Prediction parameters

MSA generation

Advanced options

Constraints

Template options

Modifications

Output

What is Boltz-2?

How to use Boltz-2 online

Inputs

Settings

Prediction parameters

MSA generation

Advanced options

Constraints

Template options

Modifications

Output

Interpreting affinity predictions

Interpreting confidence scores

How does Boltz-2 work?

Architecture

Training data

Benchmark performance

When to use Boltz-2

Comparison with other methods

Limitations

Related tools

Structure prediction

Molecular docking

Binding site analysis

Input

Prediction parameters

MSA generation

Advanced options

Constraints

Template options

Modifications

Output

What is Boltz-2?

How to use Boltz-2 online

Inputs

Settings

Prediction parameters

MSA generation

Advanced options

Constraints

Template options

Modifications

Output

Interpreting affinity predictions

Interpreting confidence scores

How does Boltz-2 work?

Architecture

Training data

Benchmark performance

When to use Boltz-2

Comparison with other methods

Limitations

Related tools

Structure prediction

Molecular docking

Binding site analysis