Click or drag files to upload (.pdb, .ent, .cif)
GROMACS (GROningen MAchine for Chemical Simulations) is one of the most widely used molecular dynamics engines in computational biology. Originally developed at the University of Groningen in 1991, it has grown into a community-maintained open-source project known for exceptional performance on both CPUs and GPUs.
Molecular dynamics (MD) simulates how atoms move over time by integrating Newton's equations of motion. For proteins, this reveals conformational flexibility, loop motions, domain rearrangements, and stability under different conditions. GROMACS handles the full pipeline: system setup, solvation, energy minimization, equilibration, production dynamics, and trajectory analysis.
A GROMACS simulation progresses through several stages, each building on the previous one.
The input structure often contains steric clashes or suboptimal bond geometries from crystallography or homology modeling. Steepest-descent minimization relaxes these high-energy contacts before dynamics begins. The algorithm iteratively adjusts atomic positions to reduce the total potential energy until convergence or a step limit is reached.
With clashes resolved, the system is heated to the target temperature. During NVT (constant Number, Volume, Temperature) equilibration, position restraints hold the protein backbone in place while the solvent and ions settle around it. The V-rescale thermostat couples the system to a heat bath, generating a canonical ensemble with correct kinetic energy fluctuations.
Once the temperature is stable, the system transitions to NPT (constant Number, Pressure, Temperature) equilibration. A Berendsen barostat adjusts the simulation box volume until the density converges to the target pressure. Position restraints remain active during this phase to prevent premature structural drift while the box dimensions adjust.
Restraints are released and the system evolves freely. The Parrinello-Rahman barostat replaces Berendsen for production runs because it produces a true isothermal-isobaric ensemble, which is important for accurate thermodynamic sampling. Coordinates are saved at regular intervals to form the trajectory.
After production, MDTraj computes structural metrics across all saved frames: backbone RMSD, per-residue RMSF, radius of gyration, secondary structure assignment (DSSP), solvent-accessible surface area, hydrogen bond networks, Ramachandran angles, and inter-residue contact maps.
ProteinIQ runs GROMACS on cloud infrastructure with automated system preparation, solvation, and multi-stage equilibration. No command-line setup or force field file management is needed.
| Input | Description |
|---|---|
Protein Structure | PDB file upload, mmCIF file, or RCSB PDB ID (e.g., 1UBQ). Maximum 50 MB. |
The structure should contain standard amino acid residues. Non-standard residues, modified amino acids, or HETATMs not recognized by the selected force field will cause topology generation to fail. Pre-process structures with PDBFixer if needed.
| Setting | Description |
|---|---|
Simulation Duration | Production MD length in nanoseconds (1-200 ns, default 10). Longer runs capture slower motions but cost more. |
Force Field | Parameterization for interatomic interactions. See force field comparison below. Default: AMBER99SB-ILDN. |
Water Model | Explicit solvent model. TIP3P (default), SPC/E, or SPC. See water model comparison below. |
| Setting | Description |
|---|---|
Temperature | Simulation temperature (200-400 K, default 300 K). Physiological temperature is 310 K. |
Pressure | Target pressure (0.5-2.0 bar, default 1.0 bar). |
Ionic Strength | NaCl concentration in molar (0-0.5 M, default 0.15 M). Ions neutralize the system charge plus add excess salt. |
pH | Target pH for protonation state assignment (4.0-10.0, default 7.0). Affects histidine, aspartate, and glutamate protonation. |
| Setting | Description |
|---|---|
Timestep | Integration step size. 2 fs (standard with LINCS hydrogen bond constraints) or 4 fs (hydrogen mass repartitioning). |
Minimization steps | Steepest-descent steps before dynamics (500-5000, default 1000). |
Equilibration | Combined NVT + NPT equilibration time (0.1-2.0 ns, default 0.5 ns). Split equally between the two phases. |
Bond constraints | H-bonds only (default, required for 2 fs timestep), All bonds, or None. |
Save interval |
The simulation produces:
Each force field represents a different parameterization philosophy. The choice affects secondary structure propensities, sidechain dynamics, and solvation behavior.
| Force field | Type | Strengths |
|---|---|---|
AMBER99SB-ILDN | All-atom | Well-validated for protein folding and dynamics. Improved sidechain torsions for Ile, Leu, Asp, Asn. Default choice for most applications. |
CHARMM27 | All-atom | Reliable for proteins and nucleic acids. Tends to slightly favor helical conformations. |
GROMOS96 54a7 | United-atom | Non-polar hydrogens are implicit, reducing atom count and speeding up simulations. Parameterized against thermodynamic data (heats of vaporization, solvation free energies). |
OPLS-AA/L | All-atom | Optimized for liquid-state properties and small-molecule interactions. Useful when solvent thermodynamics matter. |
For general protein dynamics, AMBER99SB-ILDN is the most common choice in recent literature. GROMOS96 54a7 offers faster simulations at the cost of losing explicit non-polar hydrogen detail.
All simulations use explicit solvation in a dodecahedral box with 1.0 nm padding around the solute.
| Model | Sites | Notes |
|---|---|---|
TIP3P | 3 | Standard choice, compatible with all force fields. Diffusion coefficient is higher than experiment. |
SPC/E | 3 | Better self-diffusion and dielectric constant than TIP3P. Includes a polarization correction. |
SPC | 3 | Simpler parameterization. Fastest of the three, but less accurate for dynamical properties. |
The water model choice has little effect on folded protein structure but can influence unfolded-state conformational sampling and solvation dynamics.
Root mean square deviation of backbone atoms from the initial structure, measured in nanometers. A plateau in the RMSD trace indicates the protein has reached a stable conformation. Continuously rising RMSD suggests the simulation may need more time, or that the protein is undergoing a conformational transition.
| RMSD (nm) | Interpretation |
|---|---|
| < 0.15 | Very stable, minimal deviation from starting structure |
| 0.15 - 0.3 | Typical for a well-folded, equilibrated protein |
| 0.3 - 0.5 | Significant conformational change or flexible regions |
| > 0.5 | Major structural rearrangement or unfolding |
Root mean square fluctuation per residue captures local flexibility. High RMSF values correspond to mobile loops, termini, and disordered regions. Low RMSF indicates rigid core residues or those involved in stable secondary structure elements.
Measures overall compactness. A stable radius of gyration indicates the protein maintains its fold. A sharp increase suggests partial unfolding or domain separation.
Per-residue secondary structure assignment across all frames. Tracks helix-to-coil transitions, beta-sheet stability, and transient structure formation. The summary reports percentage of helix, sheet, and coil content per frame.
Should decrease during minimization and stabilize during production. Large fluctuations during production may indicate simulation instability.
Trajectory frame spacing. 10 ps (detailed), 50 ps (standard), or 100 ps (compact). |
Remove water from output | Strip solvent from the output trajectory to reduce file size. Enabled by default. |