Add your input molecules to get started
Human KRAS G12C protein with covalent ligand
A transcription factor and DNA complex

Chai-1 is a multi-modal foundation model for molecular structure prediction. Predicts 3D structures for proteins, ligands, DNA, RNA, and multi-component complexes with high accuracy.

OpenFold-3 is an open-source AI model for biomolecular structure prediction, aiming to reproduce AlphaFold3. Predicts 3D structures for proteins, RNA, DNA, and small molecule ligands with high accuracy.

Controllable biomolecular structure prediction model for proteins, ligands, DNA, RNA, and multi-component complexes. IntelliFold 2 supports fast v2-Flash inference, optional MSA generation, and ranked confidence outputs.

Open-source AlphaFold 3 implementation by ByteDance for biomolecular structure prediction. Predicts 3D structures for proteins, RNA, DNA, and small molecule ligands with high accuracy.

Open-source structure prediction neural network for proteins, nucleic acids, and small molecules. State-of-the-art accuracy with multi-chain support.

Generate protein conformational ensembles using flow matching. Produces multiple diverse structures showing protein flexibility and dynamics.

AlphaFold2 via ColabFold for high-accuracy protein structure prediction. Uses MMSeqs2 API for MSA generation with no local databases required. Supports monomer and multimer prediction.

ESMfold is a fast, single-sequence protein structure predictor from Meta AI. Predicts 3D protein structures directly from amino acid sequences without requiring multiple sequence alignments (MSA), making it significantly faster than AlphaFold while automatically scaling GPU resources for larger proteins.

ImmuneBuilder predicts 3D structures of immune receptor proteins including antibodies, nanobodies, and T-cell receptors. It uses ABodyBuilder2, NanoBodyBuilder2, and TCRBuilder2/TCRBuilder2+ to generate structures with per-residue error estimates and optional ensemble artifacts.

MiniFold is a fast single-sequence protein structure predictor that is 10-20x faster than ESMFold. It predicts 3D protein structures directly from amino acid sequences without requiring multiple sequence alignments (MSA), making it ideal for rapid structure prediction.
Boltz-2 is a biomolecular foundation model that jointly predicts molecular complex structures and protein-ligand binding affinities. Developed by MIT Jameel Clinic and Recursion and released in June 2025, it extends Boltz-1—the most widely used open-source alternative to AlphaFold3 across academia and industry.
The model achieves binding affinity predictions approaching the accuracy of physics-based free-energy perturbation (FEP) calculations while running over 1000x faster. Traditional FEP methods cost hundreds of dollars per prediction and require 6–12 hours to complete; Boltz-2 achieves comparable accuracy in approximately 20 seconds on a single GPU.
Boltz-2 supports multi-component biomolecular complexes including proteins, small molecule ligands, DNA, and RNA. This broad coverage makes it particularly valuable for drug discovery workflows requiring rapid screening of compound libraries before experimental synthesis.
ProteinIQ hosts Boltz-2 on GPU infrastructure, delivering structure and affinity predictions through a browser interface without local installation or command-line configuration.
Boltz-2 accepts multiple molecule types that combine into a single prediction job. Chain IDs (A, B, C...) are assigned automatically and displayed in the interface—these identifiers are needed when defining constraints.
| Input | Description |
|---|---|
Protein | PDB/CIF structure files, FASTA sequences, or RCSB PDB IDs (e.g., 1ABC). Up to 10 chains. |
Ligand (SMILES) | SMILES strings, SDF/MOL/MOL2 files, or PubChem compound IDs. Up to 10 ligands. |
Ligand (CCD) | Standard codes from the PDB Chemical Component Dictionary (e.g., ATP, NAD, HEM). |
DNA | FASTA sequences supplied as text or file input. Up to 10 chains. |
RNA | FASTA sequences supplied as text or file input. Up to 10 chains. |
Template | Known structures used to bias prediction toward a homologous fold. PDB/CIF files or RCSB structure IDs. Up to 5 templates. |
Pre-computed MSA | A3M alignment files. Optional—Boltz-2 generates MSAs automatically when enabled. |
| Setting | Description |
|---|---|
Number of samples | Structure predictions to generate (1–20, default 5). More samples capture conformational diversity but increase runtime. |
Confidence threshold | Minimum confidence score for predictions (0.0–1.0, default 0.5). Higher values filter low-confidence results. |
Multiple sequence alignment (MSA) aligns a target protein sequence against thousands of evolutionarily related sequences. By analyzing coevolution patterns—which amino acid positions change together across evolution—the model infers spatial relationships in the 3D structure.
| Setting | Description |
|---|---|
Generate MSA | Enables automatic MSA generation via ColabFold. Disable only for quick screening or synthetic proteins lacking natural homologs. |
MSA depth | Search thoroughness: Shallow (2048 seqs, fast), Normal (8192 seqs, default), or Deep (16384 seqs, slow but thorough). |
MSA pairing strategy | How MSAs from different chains combine: Greedy (default) or Complete (slower, better for obligate heterodimers). |
Max MSA sequences | Maximum aligned sequences (512–16384, default 8192). Lower values are faster; higher values may improve accuracy. |
Subsample MSA | Reduces a large alignment to a smaller working set for faster inference on long proteins. |
Boltz-2 uses diffusion-based structure generation with iterative refinement ("recycling"). These parameters control that process.
| Setting | Description |
|---|---|
Recycling steps | Refinement iterations (1–10, default 3). More steps improve accuracy at increased runtime. |
Sampling steps | Diffusion denoising steps (50–500, default 200). Higher values produce more refined structures. |
Step scale | Temperature controlling sampling diversity (1.0–2.0, default 1.5). Higher values explore more conformational space. |
Output format | CIF (recommended) or PDB (legacy compatibility). |
MW-corrected affinity | Applies molecular weight correction when comparing ligands of different sizes. |
Affinity sampling steps | Diffusion steps for affinity prediction (50–500, default 200). |
Affinity samples |
Constraints guide predictions toward specific structural features when prior knowledge exists about the system—from crystallography, mutagenesis studies, or crosslinking mass spectrometry data.
| Setting | Description |
|---|---|
Enforce constraints | Must be enabled before defining constraints below. Activates inference-time potentials. |
Pocket constraints | Forces ligand binding near specified residues. Format: `binder |
Covalent bonds | Defines covalent bonds between atoms. Format: chain:residue:atom,chain:residue:atom (e.g., A:12:SG,C:1:C1). |
Contact constraints | Forces residues within a specified distance. Format: `chain:residue,chain:residue |
Templates can be used either as soft structural priors or as stronger geometric constraints for cases where a homologous backbone is already known.
| Setting | Description |
|---|---|
Enforce template backbone | Keeps the predicted backbone close to the supplied template instead of treating the template as loose guidance. |
Template deviation threshold (Å) | Maximum allowed backbone deviation from the template when enforcement is enabled. |
Template chain mapping | Assigns template indices to specific query chains. Format: template_index:chain_ids. |
| Setting | Description |
|---|---|
Residue modifications | Post-translational modifications using CCD codes. Format: chain:position:CCD_code. Common codes: SEP (phosphoserine), TPO (phosphothreonine), MLY (methylated lysine). |
Cyclic chains | Chains with head-to-tail cyclization. Enter one chain ID per line. |
Boltz-2 returns ranked predictions, each with downloadable structure files and a data table summarizing structure and affinity outputs. Optional diagnostic matrices can also be saved from the advanced settings.
| Column | Description |
|---|---|
Rank | Prediction ranking by confidence. |
Affinity (log10 IC50) | Quantitative binding strength in log10(IC50) μM. More negative = stronger binding. |
Confidence | Structural prediction confidence (0–1). |
Structure file | Predicted complex saved as CIF or PDB, depending on the selected output format. |
PAE/PDE files | Optional error matrices written when Save PAE matrix or Save PDE matrix is enabled. |
Boltz-2 outputs two affinity metrics serving different purposes:
Affinity probability (0–1): Binary classifier confidence that the molecule binds. Values above 0.7 indicate strong predicted binders; below 0.5 suggests unlikely binding. Best used for initial screening to separate hits from non-binders.
Affinity value (log10 IC50 in μM): Quantitative binding strength. More negative values indicate stronger binding.
| Value | Interpretation | Typical use case |
|---|---|---|
| < −8 | Very strong (nM) | Clinical candidates |
| −8 to −6 | Strong (low μM) | Lead compounds |
| −6 to −4 | Moderate | Hit compounds |
| > −4 | Weak | Likely inactive |
Confidence reflects certainty about the predicted structure, not the affinity estimate. Low confidence does not mean the prediction is wrong—it indicates the result warrants closer examination.
| Score | Interpretation |
|---|---|
| > 0.7 | High confidence—prediction likely reliable |
| 0.5–0.7 | Moderate—visual inspection recommended |
| < 0.5 | Low—may indicate a difficult target, missing MSA data, or unusual binding mode |
Boltz-2 extends Boltz-1 with an affinity prediction module trained on millions of binding measurements. The result is a single model predicting both 3D structures and binding strength.
The model processes input in two stages. First, the structure module generates 3D coordinates using diffusion—starting from noise and iteratively refining toward a physically plausible structure. This approach parallels image generation models but operates on molecular geometry.
Second, the affinity module takes the predicted structure and outputs binding predictions: both a binary "does it bind?" probability and a quantitative IC50 estimate.
Boltz-2 was trained on approximately 5 million binding affinity measurements (Kd and IC50 values) from public assays, plus molecular dynamics simulations capturing protein flexibility.
The training data spans diverse complex types: protein-ligand, protein-DNA, protein-RNA, and protein-protein interactions. This breadth accounts for Boltz-2's effectiveness on multi-molecule complexes.
For structure prediction, Boltz-2 matches or slightly outperforms AlphaFold3, Chai-1, and Boltz-1 on recent PDB deposits (2024–2025), with particular strength on RNA, DNA-protein complexes, and antibody-antigen interactions.
For affinity prediction, Boltz-2 achieves Pearson correlation r ≈ 0.62 on the FEP+ benchmark—matching physics-based free energy perturbation methods that require hours instead of seconds. In the CASP16 affinity challenge, it outperformed all submitted methods across 140 complexes.
Boltz-2 excels in scenarios requiring both structure and affinity prediction in a single workflow.
| Scenario | Recommendation |
|---|---|
| Need structure + quantitative affinity | Boltz-2 |
| Multi-component complexes (protein + DNA + ligand) | Boltz-2 |
| Already have protein structure, need ligand poses only | DiffDock (faster, no affinity) |
| High-throughput screening with known binding site | AutoDock Vina or AutoDock-GPU |
| CNN-enhanced scoring on traditional docking | GNINA |
| Feature | Boltz-2 | DiffDock | AutoDock Vina | FEP+ |
|---|---|---|---|---|
| Predicts structure | Yes | No (pose only) | No (pose only) | No |
| Predicts affinity | Yes (quantitative) | No | Yes (scoring) | Yes (quantitative) |
| Affinity accuracy | r ≈ 0.62 | N/A | r ≈ 0.3–0.4 | r ≈ 0.6–0.7 |
| Speed | ~20 sec | ~30 sec | ~1 min | 6–12 hours |
| Molecule types | Protein, DNA, RNA, ligand | Protein, ligand |
r ≈ 0.35–0.37). Synthetic or designed proteins lacking natural homologs may yield lower-confidence results.Subsampled sequences | Number of sequences retained after subsampling (256–4096, default 1024). Used only when Subsample MSA is enabled. |
MSA server URL | Optional custom endpoint for external ColabFold-compatible MSA services. |
MSA username | Optional basic-auth username for a private MSA server. Must be paired with MSA password. |
MSA password | Optional basic-auth password for a private MSA server. |
MSA API key header | Custom header name for token-based authentication to an MSA service. |
MSA API key value | Token value sent with the configured API key header. |
| Independent affinity predictions to average (1–20, default 5). More samples reduce variance. |
Save PAE matrix | Exports the full predicted aligned error matrix for interface and domain-quality analysis. |
Save PDE matrix | Exports the predicted distance error matrix for downstream structural inspection. |
Random seed | Fixes stochastic sampling for reproducible reruns. Leaving the field empty uses a fresh random seed. |
Model version | Chooses Boltz-2 for joint structure and affinity prediction or Boltz-1 for legacy structure-only inference. |
| Protein, ligand |
| Protein, ligand |