DFMDock

Dock protein-protein complexes with diffusion-based sampling and confidence ranking.

30 credits
Configure input settings on the left, then click "Submit"

Related tools

EquiDock

EquiDock

EquiDock is an SE(3)-equivariant graph neural network for rigid protein-protein docking. It predicts a binding pose for a protein-protein complex from unbound structures using geometric deep learning, with DIPS and DB5 pretrained checkpoints from the upstream release.

ColabDock

ColabDock

ColabDock is a protein-protein docking framework that uses AlphaFold2 to predict complex structures guided by experimental restraints from cross-linking mass spectrometry, NMR, or other sources.

HADDOCK3

HADDOCK3

HADDOCK (High Ambiguity Driven protein-protein DOCKing) is an integrative modeling platform for biomolecular complexes. It uses experimental data and bioinformatic predictions to guide the docking process, generating accurate protein-protein complex structures.

LightDock

LightDock

LightDock is a protein-protein, protein-peptide, and protein-DNA docking framework using Glowworm Swarm Optimization (GSO). It predicts macromolecular binding modes and interfaces for biological complexes.

ParaSurf

ParaSurf

ParaSurf is a state-of-the-art surface-based deep learning model for predicting interactions between antibodies and antigens. It identifies paratope binding sites on antibody structures with high accuracy across multiple benchmark datasets.

SurfDock

SurfDock

SurfDock is a surface-informed diffusion generative model for protein-ligand docking, published in Nature Methods 2024. It leverages protein surface geometry to guide a diffusion process for reliable and accurate protein-ligand complex prediction.

DiffDock-L

DiffDock-L

DiffDock-L is a state-of-the-art molecular docking tool that uses diffusion models to predict how small molecule ligands bind to protein targets. It generates multiple binding poses with confidence scores.

DynamicBind

DynamicBind

DynamicBind is an AI-powered protein-ligand binding prediction tool that recovers ligand-induced conformational changes from unbound protein structures. It predicts both ligand binding poses and protein conformational changes.

GNINA

GNINA

GNINA is a molecular docking tool that combines traditional physics-based docking with deep learning CNN scoring for protein-small-molecule complexes. It provides accurate binding predictions with confidence scores, optimized for high-throughput virtual screening.

SigmaDock

SigmaDock

SigmaDock is a fragment-based molecular docking tool using SE(3) equivariant diffusion models to predict how small molecule ligands bind to protein targets. Presented at ICLR 2026, it generates multiple binding poses with Vinardo scoring.

What is DFMDock?

Protein-protein interfaces are hard to dock when no restraints, cross-links, or known interface residues are available. DFMDock addresses that case with a rigid docking model that generates candidate complexes and ranks them with the same learned energy landscape.

The method was developed in the Gray Lab as Denoising Force Matching Dock. It starts from two unbound protein structures, receptor and ligand, and searches rigid-body translations and rotations with a reverse diffusion process. No multiple sequence alignment is required, which makes it useful for fast structure-only screening. The tradeoff is the same one shared by other rigid docking methods: if binding depends on major backbone rearrangement, loop opening, or induced fit, the top pose can still look reasonable while missing the true interface.

How to use DFMDock online

Run DFMDock online by uploading two protein structures, or by entering two RCSB PDB IDs, then choosing how many poses to sample and how many reverse diffusion steps to use. ProteinIQ returns ranked complex PDB files, the cleaned receptor and ligand inputs, learned energy values, clash counts, and pose-level diagnostic metrics in a spreadsheet and 3D viewer.

Inputs

InputAccepted formatDescription
Receptor (larger protein).pdb, .ent, or PDB IDFirst docking partner. The wrapper requires protein ATOM records and limits each partner to 1000 residues and 50 MB.
Ligand (smaller protein).pdb, .ent, or PDB IDSecond docking partner moved during docking. The same protein-only, 1000-residue, and 50 MB limits apply.

Settings

SettingDefaultDescription
Number of poses1Number of sampled complexes returned. Range 1 to 20. Higher values broaden the search over alternative interfaces.
SDE steps40Number of reverse diffusion denoising steps. Range 10 to 200. More steps give the sampler more refinement iterations but increase runtime.
Use clash forcefalseAdds repulsive guidance during sampling to discourage severe steric overlap between receptor and ligand.
Random seed42Controls reproducibility. The same structures, settings, and seed reproduce the same run.

Results

ProteinIQ returns two reference files, receptor_input.pdb and ligand_input.pdb, plus one PDB file for each sampled complex. The spreadsheet is sorted by energy, lowest first.

ColumnDescription
RankPose rank after sorting by DFMDock energy.
FileDownloadable PDB file for the predicted complex.
EnergyDFMDock learned energy. Lower values rank ahead of higher values within the same run.
DockQDiagnostic score returned by the current wrapper. Useful for comparing poses within one job, not as a benchmark-quality estimate across unrelated jobs.
iRMSDInterface RMSD-style diagnostic reported by the wrapper. Lower values indicate less movement under that diagnostic.
LRMSDLigand RMSD-style diagnostic reported by the wrapper. Lower values indicate less ligand displacement under that calculation.
cRMSDComplex RMSD-style diagnostic reported by the wrapper.
FnatContact-overlap style diagnostic reported by the wrapper. Higher values indicate more preserved contacts under that calculation.
ClashesNumber of steric clashes detected in the final pose. Lower is better.

How does DFMDock work?

DFMDock treats docking as denoising in rigid-body pose space. The ligand is randomized in orientation and translation, then a reverse stochastic differential equation updates that pose over a series of steps. At each step, the model predicts how the ligand should move relative to the receptor.

The network is SE(3)-equivariant, so rotation and translation of the input coordinates do not change the learned physical relationship. Residues are encoded with amino acid identity, backbone geometry, and ESM-2 embeddings. From those features, DFMDock learns two coupled signals:

  • Denoising force: Translational and rotational updates that move the ligand toward plausible interfaces during sampling.
  • Energy: A scalar landscape used to score and rank the sampled poses after generation.

That single-model setup is what makes DFMDock distinctive. Classical docking often separates search from rescoring. DFMDock uses one learned representation for both.

The current ProteinIQ wrapper keeps the workflow thin:

  • It cleans each uploaded structure down to standard protein ATOM records.
  • It samples the requested number of rigid docking poses with the chosen SDE steps.
  • It can add optional clash-force guidance during sampling.
  • It ranks the sampled complexes by learned energy.
  • It returns the complex PDB files, the cleaned inputs, and the pose metrics emitted by the inference script.

The wrapper does not model backbone flexibility, cofactors, glycans, membrane context, or non-protein partners. Inputs with multiple models are reduced to the first model, and only standard protein atoms are kept.

Understanding the results

Energy is the main ranking signal. DFMDock was trained to make that value useful for selecting among poses sampled in the same run, so it should be read as a within-job score rather than an absolute affinity estimate. A large gap between the top pose and the rest is usually more meaningful than the raw value itself.

Clashes is the fastest physical sanity check. A pose with a slightly worse energy but far fewer clashes is often the better candidate for inspection or follow-up refinement.

The remaining columns, DockQ, iRMSD, LRMSD, cRMSD, and Fnat, deserve careful reading. In the current ProteinIQ wrapper, these are emitted directly by the inference script as pose diagnostics even though a true native reference complex is not supplied at submission time. They are best used for relative comparison among poses from the same job, not as standalone claims that a complex is CAPRI-high-quality or experimentally near-native.

For practical triage, the most useful pattern is usually:

  • Prefer low Energy
  • Prefer low Clashes
  • Use the remaining metrics as secondary diagnostics when several poses have similar energy

When to use DFMDock vs alternatives

DFMDock is a good fit when both unbound structures are already available and the goal is to screen plausible rigid interfaces without building restraints first.

ToolBest fitWhy choose it
DFMDockFast structure-only protein-protein dockingLearned sampling and ranking in one model, no MSA required
HADDOCK3Interface residues, cross-links, mutagenesis, or other restraints are availableRestraint-guided docking is usually the stronger option when experimental evidence exists
LightDockBroader exploratory sampling is neededSwarm-based search can be useful when interface hypotheses are still wide open
EquiDockOne fast deep-learning pose is enoughDeterministic and lightweight when pose diversity is less important
ColabDockAlphaFold-style docking with external restraints is preferredBetter aligned with workflows built around AlphaFold2-derived complex prediction

DFMDock is especially attractive for MSA-poor systems and early exploratory docking runs. It is less attractive when binding requires major conformational change or when strong experimental restraints are already available, because those cases favor flexible or data-driven docking workflows.

  • HADDOCK3: Restraint-guided macromolecular docking when interface evidence is available.
  • LightDock: Swarm-based protein docking for broader blind-search exploration.
  • EquiDock: Fast equivariant rigid docking for protein pairs.
  • ColabDock: AlphaFold2-based docking with restraint support.
  • DockQ: Evaluate a predicted complex against a known reference complex.

Based on the Gray Lab DFMDock repository and preprint, plus the current ProteinIQ DFMDock wrapper implementation.