OpenFE

Alchemical free energy calculations for binding affinity prediction in drug discovery

5
Configure input settings on the left, then click "Submit"

Related tools

OpenMM

OpenMM

Run GPU-accelerated molecular dynamics simulations using OpenMM. Simulate protein dynamics, study conformational changes, and analyze stability with industry-standard force fields (AMBER, CHARMM).

PandaDock

PandaDock

Open-source molecular docking platform using physics-based scoring functions. CPU-optimized algorithms achieve sub-angstrom accuracy (0.014A RMSD) without GPU requirements.

SMINA

SMINA

SMINA is a fork of AutoDock Vina with enhanced scoring functions, custom scoring support, and 10-20x faster minimization. Ideal for scoring function development, pose refinement, and high-performance docking workflows.

Boltz-2

Boltz-2

Boltz-2 is a biomolecular foundation model for structure and binding affinity prediction. Supports proteins, ligands, DNA, and RNA in multi-component complexes. Automatically scales GPU resources for large complexes. Predicts binding affinity with near-FEP accuracy at 1000x faster speed.

gmx_MMPBSA

gmx_MMPBSA

Calculate binding free energies using MM/PBSA and MM/GBSA methods for protein-ligand, protein-protein, and protein-DNA complexes. Provides detailed energy decomposition and per-residue contributions.

LMI4Boltz

LMI4Boltz

LMI4Boltz is a low-memory fork of Boltz for biomolecular structure and binding affinity prediction. It preserves Boltz inference behavior while reducing VRAM use with in-place pair updates, CPU offload, reduced precision pair representation, and aggressive chunking.

GROMACS

GROMACS

Run molecular dynamics simulations using the GROMACS engine with classical force fields (AMBER, CHARMM, GROMOS, OPLS). Study protein dynamics, conformational flexibility, and structural stability with production-grade MD methodology.

PPAP

PPAP

PPAP (Protein-Protein Affinity Predictor) predicts binding affinity (ΔG and Kd) between interacting protein chains using deep learning with ESM2-3B embeddings. Requires a PDB with 2+ protein chains. Note: This tool is for protein-protein interactions only, not protein-ligand binding.

AutoDock-GPU

AutoDock-GPU

GPU-accelerated molecular docking using the AutoDock4 force field. Up to 56x faster than serial AutoDock via CUDA parallelization of the Lamarckian Genetic Algorithm.

AutoDock Vina

AutoDock Vina

AutoDock Vina is a widely-used molecular docking tool that predicts protein-ligand binding modes using physics-based force fields. Fast, reliable, and the gold standard for structure-based drug discovery.

What is OpenFE?

OpenFE (Open Free Energy) is an open-source Python framework for alchemical free energy calculations, a physics-based computational method used in drug discovery to predict how strongly molecules interact with their environment or a protein target. Rather than scoring a single static pose like molecular docking, alchemical methods simulate the thermodynamics of molecular transformations, yielding quantitative binding affinity estimates in kcal/mol with statistical uncertainty.

The framework supports two main calculation types:

  • Absolute Hydration Free Energy (AHFE): The free energy change when a molecule is transferred from vacuum into water. AHFE quantifies how much a compound prefers aqueous solution over the gas phase, a property directly linked to solubility and membrane permeability.
  • Relative Binding Free Energy (RBFE): The difference in binding affinity between two ligands at a protein target. RBFE is the workhorse of free energy methods in pharmaceutical lead optimization, predicting which chemical modifications improve or weaken binding without synthesizing every candidate.

OpenFE is developed by a consortium of pharmaceutical companies and academic groups, with GPU-accelerated simulations powered by OpenMM.

How alchemical free energy calculations work

In classical thermodynamics, measuring a binding free energy directly would require simulating the full association and dissociation of a ligand, an event that happens on timescales far beyond what molecular dynamics can reach. Alchemical methods sidestep this by exploiting the fact that free energy is a state function: the path between two states does not matter, only the endpoints.

Instead of physically pulling a ligand out of a binding pocket, the calculation gradually "switches off" the ligand's interactions with its surroundings through a series of unphysical intermediate states. A coupling parameter λ\lambda controls this transformation, varying from 0 (full interactions) to 1 (fully decoupled). At each λ\lambda value, a short molecular dynamics simulation samples the local thermodynamics.

Lambda windows

The number of λ\lambda intermediates is called the lambda window count. More windows provide smoother overlap between adjacent states and more reliable free energy estimates, at the cost of additional simulation time. A typical AHFE calculation uses 11-14 windows; RBFE calculations may need more depending on the size of the chemical transformation.

AHFE thermodynamic cycle

For hydration free energies, the ligand is decoupled in two environments independently:

  1. In solvent: electrostatic interactions are annihilated, then van der Waals interactions are decoupled
  2. In vacuum: the same decoupling is performed without solvent

The hydration free energy is the difference: ΔGhydration=ΔGsolventΔGvacuum\Delta G_{\text{hydration}} = \Delta G_{\text{solvent}} - \Delta G_{\text{vacuum}}.

RBFE thermodynamic cycle

For relative binding free energies, ligand A is alchemically transformed into ligand B in two legs:

  1. In complex: A is morphed into B while bound to the protein
  2. In solvent: the same A-to-B transformation in water alone

The relative binding free energy is: ΔΔGbind=ΔGcomplexΔGsolvent\Delta\Delta G_{\text{bind}} = \Delta G_{\text{complex}} - \Delta G_{\text{solvent}}.

Both legs require the ligands to share a common molecular scaffold so that the alchemical transformation is tractable.

Analysis

Free energies are extracted from the simulation data using MBAR (Multistate Bennett Acceptance Ratio), which simultaneously analyzes energy differences across all λ\lambda windows for statistically optimal estimates.

How to use OpenFE online

ProteinIQ provides cloud-hosted OpenFE calculations on GPU infrastructure, handling all environment setup, force field parameterization, and simulation orchestration automatically.

Inputs

InputDescription
Protein StructurePDB file, mmCIF file, or RCSB PDB ID (e.g., 3HTB). Required for RBFE calculations, not used for AHFE.
LigandSMILES string or SDF/MOL/MOL2 file describing the small molecule.

For AHFE calculations, only the ligand is needed. For RBFE, both protein structure and ligand are required.

Calculation settings

SettingDescription
Calculation TypeAbsolute Hydration Free Energy runs an AHFE calculation (ligand in water vs vacuum). Relative Binding Free Energy runs an RBFE calculation against a protein target.
Simulation LengthDuration per lambda window. Short (0.5 ns) for quick estimates, Medium (2 ns) for reasonable convergence, Long (10 ns) for publication-quality results.
Number of RepeatsIndependent repeat calculations. 1 is faster; 3 is recommended for reliable uncertainty estimates.

Advanced settings

SettingDescription
Lambda windowsNumber of alchemical intermediates (5-24, default 11). More windows improve accuracy at increased cost.
Small molecule force fieldOpenFF Sage 2.2.0 (recommended), OpenFF Sage 2.1.0, or GAFF 2.11 for ligand parameterization.
Water modelTIP3P (fast, well-validated) or TIP4P-Ew (more accurate solvation thermodynamics, slower).

Results

Output is a table of free energy estimates in kcal/mol with associated uncertainties. For AHFE, the result is the hydration free energy. For RBFE, the result is the relative binding free energy difference between the ligand and a reference state.

Interpreting results

AHFE values

Hydration free energies are typically negative for polar, water-soluble compounds and near zero or positive for hydrophobic molecules. Experimental AHFE values for druglike molecules generally range from +2+2 to 15-15 kcal/mol. Well-converged OpenFE AHFE calculations typically achieve accuracy within 1 kcal/mol of experimental values.

RBFE values

A negative ΔΔG\Delta\Delta G indicates the ligand binds more strongly than the reference compound. In lead optimization, differences of 1-2 kcal/mol correspond to roughly 10-fold changes in binding affinity, significant enough to guide compound selection.

ΔΔG\Delta\Delta G (kcal/mol)Interpretation
< 2-2Substantially stronger binding than reference
1-1 to 2-2Moderate improvement (~10-100x affinity gain)
0.5-0.5 to 0.50.5Similar affinity to reference
>1> 1Weaker binding than reference

Uncertainty and convergence

Results include statistical uncertainty from MBAR analysis. If uncertainty exceeds 1 kcal/mol, the calculation may not be converged. Increasing simulation length, adding lambda windows, or running more repeats can improve convergence.

Limitations

  • RBFE requires a common scaffold: The two ligands being compared must share a core structure. Large R-group transformations or scaffold hops may fail or produce unreliable results.
  • Protein flexibility is limited: While sidechains near the binding site can rearrange during simulation, large-scale conformational changes (loop movements, domain motions) may not be captured within typical simulation timescales.
  • Force field dependence: Results are only as accurate as the underlying force field. Molecules with unusual functional groups may be poorly parameterized.
  • Convergence is not guaranteed: Short simulations with few lambda windows can produce results with small reported uncertainties that are nonetheless systematically wrong. When accuracy matters, use longer simulations with 3 repeats.