OpenFE (Open Free Energy) is an open-source Python framework for alchemical free energy calculations, a physics-based computational method used in drug discovery to predict how strongly molecules interact with their environment or a protein target. Rather than scoring a single static pose like molecular docking, alchemical methods simulate the thermodynamics of molecular transformations, yielding quantitative binding affinity estimates in kcal/mol with statistical uncertainty.
The framework supports two main calculation types:
OpenFE is developed by a consortium of pharmaceutical companies and academic groups, with GPU-accelerated simulations powered by OpenMM.
In classical thermodynamics, measuring a binding free energy directly would require simulating the full association and dissociation of a ligand, an event that happens on timescales far beyond what molecular dynamics can reach. Alchemical methods sidestep this by exploiting the fact that free energy is a state function: the path between two states does not matter, only the endpoints.
Instead of physically pulling a ligand out of a binding pocket, the calculation gradually "switches off" the ligand's interactions with its surroundings through a series of unphysical intermediate states. A coupling parameter controls this transformation, varying from 0 (full interactions) to 1 (fully decoupled). At each value, a short molecular dynamics simulation samples the local thermodynamics.
The number of intermediates is called the lambda window count. More windows provide smoother overlap between adjacent states and more reliable free energy estimates, at the cost of additional simulation time. A typical AHFE calculation uses 11-14 windows; RBFE calculations may need more depending on the size of the chemical transformation.
For hydration free energies, the ligand is decoupled in two environments independently:
The hydration free energy is the difference: .
For relative binding free energies, ligand A is alchemically transformed into ligand B in two legs:
The relative binding free energy is: .
Both legs require the ligands to share a common molecular scaffold so that the alchemical transformation is tractable.
Free energies are extracted from the simulation data using MBAR (Multistate Bennett Acceptance Ratio), which simultaneously analyzes energy differences across all windows for statistically optimal estimates.
ProteinIQ provides cloud-hosted OpenFE calculations on GPU infrastructure, handling all environment setup, force field parameterization, and simulation orchestration automatically.
| Input | Description |
|---|---|
Protein Structure | PDB file, mmCIF file, or RCSB PDB ID (e.g., 3HTB). Required for RBFE calculations, not used for AHFE. |
Ligand | SMILES string or SDF/MOL/MOL2 file describing the small molecule. |
For AHFE calculations, only the ligand is needed. For RBFE, both protein structure and ligand are required.
| Setting | Description |
|---|---|
Calculation Type | Absolute Hydration Free Energy runs an AHFE calculation (ligand in water vs vacuum). Relative Binding Free Energy runs an RBFE calculation against a protein target. |
Simulation Length | Duration per lambda window. Short (0.5 ns) for quick estimates, Medium (2 ns) for reasonable convergence, Long (10 ns) for publication-quality results. |
Number of Repeats | Independent repeat calculations. 1 is faster; 3 is recommended for reliable uncertainty estimates. |
| Setting | Description |
|---|---|
Lambda windows | Number of alchemical intermediates (5-24, default 11). More windows improve accuracy at increased cost. |
Small molecule force field | OpenFF Sage 2.2.0 (recommended), OpenFF Sage 2.1.0, or GAFF 2.11 for ligand parameterization. |
Water model | TIP3P (fast, well-validated) or TIP4P-Ew (more accurate solvation thermodynamics, slower). |
Output is a table of free energy estimates in kcal/mol with associated uncertainties. For AHFE, the result is the hydration free energy. For RBFE, the result is the relative binding free energy difference between the ligand and a reference state.
Hydration free energies are typically negative for polar, water-soluble compounds and near zero or positive for hydrophobic molecules. Experimental AHFE values for druglike molecules generally range from to kcal/mol. Well-converged OpenFE AHFE calculations typically achieve accuracy within 1 kcal/mol of experimental values.
A negative indicates the ligand binds more strongly than the reference compound. In lead optimization, differences of 1-2 kcal/mol correspond to roughly 10-fold changes in binding affinity, significant enough to guide compound selection.
| (kcal/mol) | Interpretation |
|---|---|
| < | Substantially stronger binding than reference |
| to | Moderate improvement (~10-100x affinity gain) |
| to |
Results include statistical uncertainty from MBAR analysis. If uncertainty exceeds 1 kcal/mol, the calculation may not be converged. Increasing simulation length, adding lambda windows, or running more repeats can improve convergence.
| Similar affinity to reference |
| Weaker binding than reference |