Related tools

OpenMM
Run GPU-accelerated molecular dynamics simulations using OpenMM. Simulate protein and protein-ligand complex dynamics with industry-standard force fields (AMBER, CHARMM) and OpenFF ligand parameterization.

OpenFE
Run alchemical free energy calculations for drug discovery using Open Free Energy. Supports Absolute Hydration Free Energy (AHFE) and Relative Binding Free Energy (RBFE) calculations with GPU-accelerated OpenMM simulations.

MD Trajectory Analysis
Analyze molecular dynamics trajectories using a ProteinIQ wrapper pinned to MDAnalysis 2.9.0. Calculate RMSD, residue-aggregated RMSF, radius of gyration, distance tracking, and additional trajectory observables from standard topology and trajectory files.

MDGen
MDGen is a generative AI model for molecular dynamics trajectory generation. Generate physically plausible conformational ensembles from a single protein structure, enabling rapid exploration of protein dynamics without expensive MD simulations.

ORB v3
ORB v3 is a universal interatomic potential (machine learning force field) that predicts energies, forces, and stress tensors for atomic systems. Supports both molecular and materials structures with geometry optimization using conservative and direct model variants.

gmx_MMPBSA
Calculate binding free energies using MM/PBSA and MM/GBSA methods for protein-ligand, protein-protein, and protein-DNA complexes. Provides detailed energy decomposition and per-residue contributions.

Aggrescan3D
Faithful static-mode Aggrescan3D wrapper for per-residue aggregation propensity analysis from a single protein structure.

Aliphatic Index
Calculate the aliphatic index of protein sequences. A measure of the relative volume occupied by aliphatic side chains, indicating thermostability.

Protein charge plot
Plot net charge vs pH for protein sequences. Visualize how protein charge changes across pH 0-14 and identify the isoelectric point (pI) where the net charge crosses zero.

Chou-Fasman
Predict protein secondary structure using the classic Chou-Fasman algorithm based on amino acid propensities
What is GROMACS?
GROMACS (GROningen MAchine for Chemical Simulations) is one of the most widely used molecular dynamics engines in computational biology. Originally developed at the University of Groningen in 1991, it has grown into a community-maintained open-source project known for exceptional performance on both CPUs and GPUs.
Molecular dynamics (MD) simulates how atoms move over time by integrating Newton's equations of motion. For proteins, this reveals conformational flexibility, loop motions, domain rearrangements, and stability under different conditions. GROMACS handles the full pipeline: system setup, solvation, energy minimization, equilibration, production dynamics, and trajectory analysis.
How does GROMACS work?
A GROMACS simulation progresses through several stages, each building on the previous one.
Energy minimization
The input structure often contains steric clashes or suboptimal bond geometries from crystallography or homology modeling. Steepest-descent minimization relaxes these high-energy contacts before dynamics begins. The algorithm iteratively adjusts atomic positions to reduce the total potential energy until convergence or a step limit is reached.
NVT equilibration
With clashes resolved, the system is heated to the target temperature. During NVT (constant Number, Volume, Temperature) equilibration, position restraints hold the protein backbone in place while the solvent and ions settle around it. The V-rescale thermostat couples the system to a heat bath, generating a canonical ensemble with correct kinetic energy fluctuations.
NPT equilibration
Once the temperature is stable, the system transitions to NPT (constant Number, Pressure, Temperature) equilibration. A Berendsen barostat adjusts the simulation box volume until the density converges to the target pressure. Position restraints remain active during this phase to prevent premature structural drift while the box dimensions adjust.
Production MD
Restraints are released and the system evolves freely. The Parrinello-Rahman barostat replaces Berendsen for production runs because it produces a true isothermal-isobaric ensemble, which is important for accurate thermodynamic sampling. Coordinates are saved at regular intervals to form the trajectory.
Trajectory analysis
After production, MDTraj computes structural metrics across all saved frames: backbone RMSD, per-residue RMSF, radius of gyration, secondary structure assignment (DSSP), solvent-accessible surface area, hydrogen bond networks, Ramachandran angles, and inter-residue contact maps.
How to use GROMACS online
ProteinIQ runs GROMACS on cloud infrastructure with automated system preparation, solvation, and multi-stage equilibration. No command-line setup or force field file management is needed.
Input
| Input | Description |
|---|---|
Protein Structure | PDB file upload, mmCIF file, or RCSB PDB ID (e.g., 1UBQ). Maximum 50 MB. |
The structure should contain standard amino acid residues. Non-standard residues, modified amino acids, or HETATMs not recognized by the selected force field will cause topology generation to fail. Pre-process structures with PDBFixer if needed.
Simulation settings
| Setting | Description |
|---|---|
Simulation Duration | Production MD length in nanoseconds (1-200 ns, default 10). Longer runs capture slower motions but cost more. |
Force Field | Parameterization for interatomic interactions. See force field comparison below. Default: AMBER99SB-ILDN. |
Water Model | Explicit solvent model. TIP3P (default), SPC/E, or SPC. See water model comparison below. |
Environment settings
| Setting | Description |
|---|---|
Temperature | Simulation temperature (200-400 K, default 300 K). Physiological temperature is 310 K. |
Pressure | Target pressure (0.5-2.0 bar, default 1.0 bar). |
Ionic Strength | NaCl concentration in molar (0-0.5 M, default 0.15 M). Ions neutralize the system charge plus add excess salt. |
Advanced settings
| Setting | Description |
|---|---|
Timestep | Integration step size. 2 fs is standard with LINCS hydrogen-bond constraints. 4 fs enables GROMACS hydrogen mass repartitioning and requires bond constraints. |
Minimization steps | Steepest-descent steps before dynamics (500-5000, default 1000). |
Equilibration | Combined NVT + NPT equilibration time (0.1-2.0 ns, default 0.5 ns). Split equally between the two phases. |
Bond constraints | H-bonds only (default, required for 2 fs timestep), All bonds, or None. |
Save interval | Trajectory frame spacing. 10 ps (detailed), 50 ps (standard), or 100 ps (compact). |
Remove water from output | Strip solvent from the output trajectory to reduce file size. Enabled by default. |
Analysis level | Full preserves all analysis CSVs, Standard returns core energy/RMSD/RMSF/Rg/DSSP summaries faster, and None skips post-run analysis while still returning raw GROMACS artifacts. |
Compute backend | Auto routes larger or longer jobs to GPU when useful, CPU forces CPU execution, and GPU requests A10G GPU offload with CPU fallback if CUDA offload is unavailable. |
Output
The simulation produces:
- Final structure: PDB file of the last frame, centered and processed
- Trajectory: XTC file containing all saved frames (protein-only if water removal is enabled)
- Analysis CSVs: Structural metrics computed across the trajectory (see interpreting results below)
Force fields
Each force field represents a different parameterization philosophy. The choice affects secondary structure propensities, sidechain dynamics, and solvation behavior.
| Force field | Type | Strengths |
|---|---|---|
AMBER99SB-ILDN | All-atom | Well-validated for protein folding and dynamics. Improved sidechain torsions for Ile, Leu, Asp, Asn. Default choice for most applications. |
CHARMM27 | All-atom | Reliable for proteins and nucleic acids. Tends to slightly favor helical conformations. |
GROMOS96 54a7 | United-atom | Non-polar hydrogens are implicit, reducing atom count and speeding up simulations. Parameterized against thermodynamic data (heats of vaporization, solvation free energies). |
OPLS-AA/L | All-atom | Optimized for liquid-state properties and small-molecule interactions. Useful when solvent thermodynamics matter. |
For general protein dynamics, AMBER99SB-ILDN is the most common choice in recent literature. GROMOS96 54a7 offers faster simulations at the cost of losing explicit non-polar hydrogen detail.
Water models
All simulations use explicit solvation in a dodecahedral box with 1.0 nm padding around the solute.
| Model | Sites | Notes |
|---|---|---|
TIP3P | 3 | Standard choice, compatible with all force fields. Diffusion coefficient is higher than experiment. |
SPC/E | 3 | Better self-diffusion and dielectric constant than TIP3P. Includes a polarization correction. |
SPC | 3 | Simpler parameterization. Fastest of the three, but less accurate for dynamical properties. |
The water model choice has little effect on folded protein structure but can influence unfolded-state conformational sampling and solvation dynamics.
Interpreting results
RMSD
Root mean square deviation of backbone atoms from the initial structure, measured in nanometers. A plateau in the RMSD trace indicates the protein has reached a stable conformation. Continuously rising RMSD suggests the simulation may need more time, or that the protein is undergoing a conformational transition.
| RMSD (nm) | Interpretation |
|---|---|
| < 0.15 | Very stable, minimal deviation from starting structure |
| 0.15 - 0.3 | Typical for a well-folded, equilibrated protein |
| 0.3 - 0.5 | Significant conformational change or flexible regions |
| > 0.5 | Major structural rearrangement or unfolding |
RMSF
Root mean square fluctuation per residue captures local flexibility. High RMSF values correspond to mobile loops, termini, and disordered regions. Low RMSF indicates rigid core residues or those involved in stable secondary structure elements.
Radius of gyration
Measures overall compactness. A stable radius of gyration indicates the protein maintains its fold. A sharp increase suggests partial unfolding or domain separation.
Secondary structure (DSSP)
Per-residue secondary structure assignment across all frames. Tracks helix-to-coil transitions, beta-sheet stability, and transient structure formation. The summary reports percentage of helix, sheet, and coil content per frame.
Potential energy
Should decrease during minimization and stabilize during production. Large fluctuations during production may indicate simulation instability.
Additional analyses
- Hydrogen bonds: Persistent intramolecular H-bonds (frequency ) and per-frame H-bond counts
- Ramachandran angles: / dihedral distributions per residue across all frames
- SASA: Per-residue solvent-accessible surface area over time (Shrake-Rupley algorithm)
- Contact map: - distance-based contacts ( angstrom cutoff) with occupancy frequencies
- B-factors: Crystallographic temperature factors estimated from RMSF ()
Limitations
- Only standard amino acid residues are supported. Non-standard residues, post-translational modifications, and ligands require manual parameterization that is not available through the web interface.
- Simulations run on CPUs. For the same wall-clock budget, GPU-accelerated engines like OpenMM can sample longer timescales.
- The maximum simulation duration is 200 ns, which may be insufficient for slow conformational transitions, protein folding, or large-scale domain motions that occur on microsecond timescales.
- Protein backbone is treated as fully flexible from the start of production MD. There is no option for partial restraints during production.