PoseBusters

Validate generated and docked ligand poses with structure, chemistry, and reference-pose checks.

5
Configure input settings on the left, then click "Submit"

Related tools

DockQ

DockQ

Assess docking model quality by comparing predicted complexes against native references. DockQ v2.1.3 supports protein, nucleic-acid, and supported small-molecule interfaces with faithful upstream metrics.

IPSAE

IPSAE

Scoring function for interprotein interactions in AlphaFold2, AlphaFold3 and Boltz predictions. Calculates ipSAE, ipTM, pDockQ, pDockQ2, and LIS scores to assess protein-protein interface quality.

MolProbity

MolProbity

Validate protein structure quality with all-atom contact analysis, Ramachandran plots, rotamer assessment, and geometry checks.

RMSD calculator

RMSD calculator

Calculate Root Mean Square Deviation (RMSD) between protein structures. Compare a reference PDB against multiple structures with automatic Kabsch alignment.

AllMetal3D

AllMetal3D

Predict metal and water binding sites in protein structures using 3D convolutional neural networks (AllMetal3D + Water3D).

DSSP

DSSP

Assign protein secondary structure using the DSSP algorithm. The gold standard for hydrogen bond-based structure assignment from coordinates.

Molecular descriptors

Molecular descriptors

Compute 200+ RDKit molecular descriptors, drug-likeness rule violations, and structural fingerprints for QSAR, virtual screening, and ML workflows

Radius of gyration

Radius of gyration

Calculate the radius of gyration (Rg) for protein structures from PDB files. Supports multiple chains and atom selection options.

SASA calculator

SASA calculator

Calculate Solvent Accessible Surface Area (SASA) for protein structures using the Shrake-Rupley algorithm.

ScanNet

ScanNet

Geometric deep learning model for predicting protein binding sites directly from 3D structure. Identifies where proteins interact with other proteins, antibodies, or disordered proteins with high accuracy, including for novel protein folds.

What is PoseBusters?

PoseBusters checks whether generated or docked ligand poses are chemically sane, geometrically plausible, and compatible with an optional protein binding site. The method was introduced by Buttenschoen, Morris, and Deane after showing that low RMSD alone can hide broken chemistry, impossible bond geometry, ligand self-clashes, and receptor overlaps in modern docking benchmarks.

The main question is not "did the pose score well?" It is "could this pose exist as a molecule in this pocket?" That makes PoseBusters useful after docking with GNINA, DiffDock, DynamicBind, or other pose generation tools, especially when comparing many ranked conformations.

How to use PoseBusters online

Run PoseBusters online by uploading a predicted ligand pose, optionally adding a reference ligand and protein receptor, then choosing the validation mode. ProteinIQ returns a downloadable table of pass/fail checks for ligand chemistry, 3D geometry, receptor clashes, and reference-pose agreement when reference inputs are supplied.

Inputs

PoseBusters can run with just a ligand pose, but the most informative checks require the files that define the intended docking context.

InputRequiredAccepted formatsDescription
Predicted/generated moleculeYesSDF, MOL, MOL2, PDBThe ligand pose or generated 3D molecule to validate. SDF is preferred for multiple poses because conformers and molecule records are handled cleanly.
True/reference ligandNoSDF, MOL, MOL2, PDBThe crystallographic or trusted ligand pose used for identity, stereochemistry, and RMSD-style checks. The file should contain 3D coordinates.
Conditioning protein/receptorNoPDB, SDF, MOL, MOL2, RCSB PDB fetchThe receptor or protein structure used for protein-ligand distance, pocket placement, and volume-overlap checks. PDB is recommended for protein inputs because residue metadata helps classify protein atoms, cofactors, metals, and waters.

The predicted molecule file can contain several poses. In that case, PoseBusters reports one row per molecule or conformer unless Top poses to validate limits the run.

Settings

SettingDescription
Validation modeSelects the PoseBusters check set. Auto-select from inputs chooses the appropriate mode from the provided predicted, reference, and receptor files.
Output formatControls the PoseBusters report style: Short, Long, or CSV. ProteinIQ displays the parsed table and keeps the results downloadable.
Full reportAdds numeric diagnostic columns beyond pass/fail status, such as bond outlier counts, clash counts, ring planarity distances, and energy ratios.
Top poses to validateRestricts validation to the first N poses in the predicted molecule file. Leave blank to validate every pose.

Validation modes

Auto-select from inputs is usually the safest choice. Manual modes are useful when the files are available but the intended evaluation task is narrower than the full input set.

ModeBest forInputs normally usedMain checks
Molecule onlyGenerated conformers, molecule generators, or ligand cleanup QCPredicted moleculeLoading, sanitization, InChI conversion, connectivity, radicals, bond lengths, bond angles, internal clashes, ring flatness, double-bond flatness, internal energy
DockDocked ligands or newly generated ligands placed into a receptor pocketPredicted molecule, receptorMolecule-only checks plus protein-ligand distance and volume-overlap checks against protein atoms, cofactors, metals, and waters
RedockRe-docking benchmark cases where a crystallographic ligand is availablePredicted molecule, reference ligand, receptorDock checks plus molecular identity, bond/stereochemistry agreement, and RMSD within 2 Angstrom reference agreement
RegenerateGenerated 3D conformers for a known molecule with a reference ligandPredicted molecule, reference ligandMolecule identity and geometry checks without receptor-pocket checks
GenerateDe novo molecule generation conditioned on a receptorPredicted molecule, receptorChemistry, geometry, and receptor compatibility without requiring identity to a reference ligand

Fast modes run the corresponding check set with speed-oriented settings. They are useful for triage on large SDF files, followed by the full mode on the best candidates.

Results

The results table starts with file and molecule identifiers, then reports Boolean check columns. True means the pose passed that check. False marks a specific failure worth inspecting before using the pose for ranking, rescoring, molecular dynamics, or free-energy calculations.

Column groupCommon columnsMeaning
Loadingmol_pred_loaded, mol_true_loaded, mol_cond_loadedWhether each supplied structure could be read. A failed loading check makes later checks unreliable or unavailable.
Chemical validitysanitization, inchi_convertible, all_atoms_connected, no_radicalsRDKit-based checks for chemically valid molecule graphs. Failures often point to malformed files, disconnected fragments, invalid valence, radicals, or structures that need repair before docking analysis.
Identity agreementmolecular_formula, molecular_bonds, double_bond_stereochemistry, tetrahedral_chiralityChecks used when a reference ligand is supplied. Failures mean the predicted pose may not represent the same ligand, even if the coordinates look close.
Intramolecular geometrybond_lengths, bond_angles, internal_steric_clash, aromatic_ring_flatness, non-aromatic_ring_non-flatness, double_bond_flatness, internal_energyTests for impossible or strained conformations inside the ligand. These catch poses with distorted bonds, atoms sitting on top of each other, twisted aromatic systems, or unusually high conformer energy.
Receptor placementprotein-ligand_maximum_distance, minimum_distance_to_protein, volume_overlap_with_protein, cofactor and water overlap columnsChecks whether the ligand is placed in a plausible protein environment rather than far from the pocket or clashing with receptor atoms.
Reference agreementRMSD within 2 AngstromIndicates whether the predicted ligand pose is within the common 2 Angstrom RMSD success criterion when a reference ligand is available.

Full report adds diagnostic measurements that explain failures. For example, number_clashes helps separate one local contact from a badly collapsed conformer, while energy_ratio highlights conformations that pass simple bond checks but remain energetically strained.

Interpreting PoseBusters results

A good docking result should pass both physical plausibility checks and task-specific accuracy checks. In redocking, a pose within 2 Angstrom RMSD but with failed stereochemistry or volume overlap should not be counted as a clean success. The pose may be close to the crystal ligand by coordinate RMSD while still representing the wrong stereoisomer or colliding with protein atoms.

Failures have different practical meanings:

  • File loading failures: Check file format first. SDF with explicit 3D coordinates is the least ambiguous ligand format for multi-pose validation.
  • Sanitization or InChI failures: Repair the ligand before interpreting docking quality. Ligand Fixer can help standardize small-molecule files before rerunning validation.
  • Bond length, angle, or internal clash failures: Treat the pose as physically invalid unless a known unusual chemistry explains the geometry. These failures are common when generative models output coordinates without enough chemical constraints.
  • Ring flatness and double-bond flatness failures: Inspect conjugated systems manually. Aromatic and double-bond systems should not be arbitrarily twisted to fit a pocket.
  • Protein overlap failures: The ligand is colliding with receptor atoms, cofactors, metals, or waters. Receptor preparation and protonation may be involved, but severe volume overlap usually means the pose is unusable.
  • Reference identity failures: In redocking benchmarks, RMSD should be interpreted only after confirming formula, bond pattern, and stereochemistry are unchanged.

Passing PoseBusters does not prove that a pose binds strongly. It proves that the pose clears a set of chemistry and geometry sanity checks. Binding affinity, ranking, induced fit, waters, protonation states, and assay relevance remain separate questions.

How PoseBusters works

PoseBusters runs a suite of RDKit-based tests against the predicted ligand and any optional context structures. The checks combine molecule graph validation, 3D geometry validation, receptor-contact analysis, and optional reference comparison.

The paper emphasized a practical benchmarking problem: many AI docking methods were evaluated mainly by ligand RMSD, but some low-RMSD poses contained nonphysical chemistry. PoseBusters adds a second axis to pose evaluation. A method should recover native-like binding modes and generate structures that pass basic chemistry and steric criteria.

The check set changes with the validation mode:

  • Molecule checks validate that the ligand can be parsed, sanitized, converted through InChI, kept connected, and represented without radicals or impossible geometry.
  • Reference checks compare the predicted and true ligand graphs, formulas, bond patterns, double-bond stereochemistry, tetrahedral chirality, and reference-pose RMSD.
  • Receptor checks measure whether the ligand is reasonably positioned with respect to the protein and whether volumes overlap with protein atoms, cofactors, inorganic cofactors, or waters.

PoseBusters is stricter than a visual inspection because it catches subtle graph and geometry problems that can be hard to see in a 3D viewer. It is also narrower than a physics simulation: it does not relax the pose, compute binding free energy, model receptor flexibility, or decide whether the ligand is a true binder.

When to use PoseBusters vs alternatives

ToolBest questionTypical use
PoseBustersIs this ligand pose chemically and geometrically plausible?Quality control after docking, molecule generation, or benchmarking.
MolProbityIs this protein structure geometrically sound?Protein model validation, clashscore, rotamer, Ramachandran, and all-atom contact checks.
DockQHow close is a predicted protein complex to a reference complex?Protein-protein docking benchmark evaluation.
GNINAWhat ligand pose and CNN score does a docking model predict?Protein-ligand docking and CNN rescoring before PoseBusters validation.
DiffDockWhat poses does a diffusion docking model generate quickly?Generating candidate ligand poses that can then be checked with PoseBusters.
PDB FixerIs the receptor structure ready for downstream modeling?Adding missing atoms, repairing residues, and preparing PDB structures before docking or validation.

For ligand-pose workflows, a common sequence is docking first, PoseBusters second, and deeper analysis third. For example, candidate poses from GNINA or DiffDock can be filtered for chemistry and receptor clashes before expensive molecular dynamics or free-energy calculations with GROMACS or OpenFE.

Practical caveats

PoseBusters results depend on the structure files supplied. Missing hydrogens, unusual protonation, metal coordination, alternate locations, covalent ligands, or incomplete receptor preparation can create failures that need chemical review rather than automatic rejection.

For benchmark reporting, keep the mode consistent across methods. Comparing one docking method with Redock checks and another with Molecule only checks mixes different definitions of success. Report both the physical pass rate and the pose-recovery metric when a reference ligand is available.