MD Trajectory Analysis

Calculate RMSD, RMSF, and Rg from MD trajectories

1
Configure input settings on the left, then click "Submit"

Related tools

ORB v3

ORB v3

ORB v3 is a universal interatomic potential (machine learning force field) that predicts energies, forces, and stress tensors for atomic systems. Supports both molecular and materials structures with geometry optimization using conservative and direct model variants.

Aggrescan3D

Aggrescan3D

Faithful static-mode Aggrescan3D wrapper for per-residue aggregation propensity analysis from a single protein structure.

PROPKA 3

PROPKA 3

Predict pKa values of ionizable groups in proteins and protein-ligand complexes from 3D structure. PROPKA calculates environment-driven pKa shifts for standard ionizable residues, terminal groups, and supported ligand atom types.

GROMACS

GROMACS

Run molecular dynamics simulations using the GROMACS engine with classical force fields (AMBER, CHARMM, GROMOS, OPLS). Study protein dynamics, conformational flexibility, and structural stability with production-grade MD methodology.

OpenFE

OpenFE

Run alchemical free energy calculations for drug discovery using Open Free Energy. Supports Absolute Hydration Free Energy (AHFE) and Relative Binding Free Energy (RBFE) calculations with GPU-accelerated OpenMM simulations.

OpenMM

OpenMM

Run GPU-accelerated molecular dynamics simulations using OpenMM. Simulate protein dynamics, study conformational changes, and analyze stability with industry-standard force fields (AMBER, CHARMM).

Aliphatic Index

Aliphatic Index

Calculate the aliphatic index of protein sequences. A measure of the relative volume occupied by aliphatic side chains, indicating thermostability.

Protein charge plot

Protein charge plot

Plot net charge vs pH for protein sequences. Visualize how protein charge changes across pH 0-14 and identify the isoelectric point (pI) where the net charge crosses zero.

Extinction coefficient calculator

Extinction coefficient calculator

Calculate the molar extinction coefficient of protein sequences at 280 nm. Used for protein concentration determination by UV spectroscopy.

FindPept

FindPept

Match experimental peptide masses against theoretical digest fragments of a protein sequence. Identify peptides from mass spectrometry data by peptide mass fingerprinting.

What is MD trajectory analysis?

MD trajectory analysis turns a simulation trajectory into measurements that can actually be interpreted: structural drift, residue flexibility, compactness, contact persistence, secondary-structure content, collective motions, and solvent-facing behavior. The raw frames coming out of GROMACS, AMBER, CHARMM, NAMD, or OpenMM do not answer those questions on their own.

This ProteinIQ tool wraps a workflow pinned to MDAnalysis 2.9.0. It is not a single upstream command-line program. It is a curated analysis surface built from MDAnalysis modules and a small amount of wrapper logic that standardizes inputs, runs multiple observables in one job, and returns structured outputs for charting and download.

The practical value is in combining observables that answer different failure modes:

  • RMSD: Detects whether the selected atoms stay close to a chosen reference structure.
  • RMSF: Shows which residues are mobile after aggregating atom-level fluctuations by residue.
  • Radius of gyration: Tracks compaction or expansion over time.
  • Distance tracking: Follows a specific geometric separation, often between domains, termini, ligands, or catalytic motifs.
  • Hydrogen bonds and salt bridges: Measures persistence of stabilizing interactions.
  • PCA and clustering: Describes dominant motions and recurring conformational states.
  • DSSP, Ramachandran, chi angles, helix analysis: Connects motion to secondary structure and torsional states.
  • SASA, RDF, density, water dynamics: Relates structural changes to solvent exposure and local environment.

Applications

  • Fold stability checks: Compare early and late frames after simulations initialized from AlphaFold 2, Boltz-2, or ESMFold.
  • Mutation analysis: Test whether a variant shifts flexibility, compaction, hydrogen-bond persistence, or helix geometry relative to the wild type.
  • Binding-site monitoring: Track contact counts, hydrogen bonds, or a distance restraint around a ligand or interface during refinement.
  • Ensemble characterization: Use PCA and clustering to summarize whether the trajectory samples one basin or several distinct states.
  • Solvent-facing transitions: Use SASA, RDF, density, and water dynamics to examine collapse, exposure, or hydration changes.

How to use MD trajectory analysis online

Upload a topology file and a matching trajectory file, choose one or more analyses, and run the job to receive tabular and chart-ready outputs for each selected observable. The result page returns structural, interaction, conformational, environmental, and secondary-structure summaries with downloadable data for follow-up analysis.

Inputs

InputDescriptionAccepted formatsMax size
Topology/Structure fileDefines atom identities, residue assignments, connectivity, and metadata needed to interpret trajectory coordinates..pdb, .gro, .psf, .prmtop, .top, .tpr50 MB
Trajectory fileStores coordinates for each sampled frame. The topology and trajectory must describe the same system and atom ordering..xtc, .trr, .dcd, .nc, .tng500 MB

Common pairings include:

  • .gro + .xtc for GROMACS
  • .tpr + .xtc when topology metadata from GROMACS should be preserved
  • .prmtop + .nc for AMBER
  • .psf + .dcd for CHARMM or NAMD

How does MD trajectory analysis work?

The wrapper loads the uploaded files into an MDAnalysis Universe, applies the selected atom selection, resolves the reference mode, optionally aligns sampled frames to that reference, and then dispatches the requested observables.

Three details matter because they change interpretation:

  • Reference mode: first, last, and average are not cosmetic options. average computes an average reference over the sampled frames instead of aliasing to the first frame.
  • Frame sampling: step subsamples the trajectory globally for the audited analysis paths, which reduces runtime and directly changes temporal resolution.
  • Alignment: When enabled, sampled frames are superimposed onto the chosen reference before RMSD, RMSF, PCA, clustering, and cross-correlation calculations. This removes rigid-body translation and rotation and makes those outputs more about internal motion than bulk drift.

Some analyses rely on additional assumptions from the underlying MDAnalysis modules or the wrapper:

  • Hydrogen bonds: The wrapper uses a donor-acceptor cutoff of 3.0 Å and angle cutoff of 150°, matching the audited MDAnalysis default. Explicit hydrogens are still required for meaningful hydrogen-bond counts.
  • RMSF: MDAnalysis computes atom-level fluctuations. The wrapper then aggregates them by residue so the output is residue-oriented even when the selection contains more than one atom per residue.
  • DSSP: Secondary-structure assignment uses MDAnalysis.analysis.dssp when available. If that path fails, the wrapper falls back to a simpler internal estimate rather than aborting the whole job.
  • Water dynamics: This mode is available in the current wrapper, but it depends on water naming conventions and upstream MDAnalysis functionality that is being deprecated on the path to MDAnalysis 3.0.

Understanding the settings

Core settings

SettingDescriptionDefault
Analyses to runSelects which observables are calculated. Multiple analyses can run in one job.RMSD, RMSF, Radius of gyration
Atom selectionMDAnalysis selection string applied to the primary analysis group. Examples include protein, protein and name CA, backbone, and resid 1-100.protein and name CA
Reference frameReference for RMSD-aligned analyses. Average structure computes the mean coordinate set over the sampled frames.first

Advanced settings

SettingDescriptionDefault
Frame stepSamples every nth frame. A value of 10 processes frames 0, 10, 20, ... instead of every saved frame.1
Align trajectoryAligns sampled frames to the chosen reference before RMSD, RMSF, PCA, clustering, and cross-correlation analysis.true
Second atom selectionSecondary MDAnalysis selection used for distance tracking and RDF.resname SOL and name OW
PCA componentsNumber of principal components returned in PCA projections.3
ClustersNumber of k-means clusters used for conformational clustering.5
Density axisAxis used for one-dimensional density profiles.z

Selection examples

Selection stringMeaning
proteinAll atoms in standard amino-acid residues
protein and name CAOne alpha carbon per residue, often used for coarse-grained structural tracking
backboneBackbone atoms N, CA, C, O
resid 45-70Residues 45 through 70
protein and not name H*Heavy atoms only
resname ATPAll atoms in ATP residues, useful for ligand-focused measurements

Understanding the results

The result page groups outputs into tabs. Not every selected analysis appears in every tab, so the most useful way to read the output is by question rather than by file format.

Structural outputs

AnalysisOutput fieldsWhat it answers
RMSDframe, time_ps, rmsd_angstromIs the selected structure drifting away from the reference?
RMSFresidue_id, residue_name, chain, rmsf_angstromWhich residues fluctuate the most over the sampled frames?
Radius of gyrationframe, time_ps, rg_angstromIs the selected structure compacting or expanding?
End-to-end distanceframe, time_ps, distance_angstromAre termini or endpoints moving apart or together?
Asphericityframe, time_ps, asphericityIs the shape becoming more elongated or more spherical?
Moment of inertiaframe, time_ps, I1, I2, I3How is mass distribution changing along principal axes?

Interaction outputs

AnalysisOutput fieldsWhat it answers
Hydrogen bondsframe, time_ps, hbond_count; pair table with donor, acceptor, occupancyAre stabilizing hydrogen-bond networks persistent?
Salt bridgesframe, time_ps, salt_bridge_count; pair table with residue ids and occupancyAre charged interactions forming or breaking over time?
Contactsframe, time_ps, n_contacts; pair table with occupancyIs an interface tightening, loosening, or rearranging?
Distance trackingframe, time_ps, distance_angstromIs a specific geometric feature moving toward or away from another one?
H-bond lifetimelifetime distribution and summary statisticsAre hydrogen bonds brief, recurrent events or long-lived contacts?

Conformational outputs

AnalysisOutput fieldsWhat it answers
PCAframe, time_ps, pc1, pc2, pc3 plus explained variance arraysWhat are the dominant collective motions?
Clusteringper-frame cluster assignments and cluster statisticsHow many recurring conformational states are populated?
MSDtime-dependent displacement summary and diffusion coefficientIs there translational diffusion or large-scale wandering?
Dynamic cross-correlationsignificant residue pairs and correlation coefficientsWhich residue motions are coupled or anticorrelated?
Persistence lengthestimated persistence length and correlation decayHow stiff is a chain-like segment across the sampled trajectory?

Secondary structure and environment outputs

AnalysisOutput fieldsWhat it answers
DSSPper-frame helix, sheet, and coil percentagesIs secondary structure stable, gained, or lost?
Ramachandranresidue-level mean and standard deviation for phi and psiWhich residues move into strained or alternative backbone states?
Chi anglesresidue-level mean and standard deviation for side-chain dihedralsAre rotamers stable or switching between states?
Helix analysishelix ranges, rise, twist, and variabilityAre helices remaining canonical or distorting?
SASAper-frame total SASA and per-residue SASA summaryIs the structure exposing or burying surface area?
RDFradial bins and g(r) valuesHow is local solvent or particle organization distributed around a selection?
Densityposition bins and density values along one axisIs mass or solvent redistributed along a membrane or box axis?
Water dynamicsresidence-time summary statisticsHow long do hydration waters remain near the selected region?

Interpreting results

RMSD and RMSF

RMSD is easiest to misread when alignment and atom selection are not considered together.

  • Low, stable RMSD after alignment usually indicates internal structural stability of the selected atoms.
  • Low RMSD without alignment disabled can hide large rigid-body motion if the entire selection rotates or translates together.
  • High RMSF in loops or termini is often expected.
  • High RMSF inside a buried core helix or beta strand usually deserves inspection, especially when it appears alongside SASA growth or DSSP loss.

Distance, contacts, and hydrogen bonds

Distance tracking is strongest when paired with contacts or hydrogen bonds.

  • Distance increases while contact count drops often indicates real separation rather than internal breathing.
  • Stable distance with fluctuating contacts can mean the interface remains nearby but repacks locally.
  • Hydrogen bonds near zero with an explicit warning usually means the uploaded structure lacks hydrogens, not necessarily that the system has no hydrogen bonding in reality.

PCA and clustering

PCA answers whether motion is dominated by a few directions. Clustering answers whether those motions produce recurring states.

  • High PC1 variance with one dominant cluster suggests one major breathing or hinge motion inside one broad state.
  • Several populated clusters with separated PC projections suggests discrete conformations rather than one continuous fluctuation cloud.
  • Poorly separated clusters often mean the trajectory is either short, noisy, or not naturally state-separated under the chosen atom selection.

DSSP, SASA, and compactness

These metrics become more informative when read together.

  • Rg decreases while SASA decreases usually indicates compaction.
  • DSSP helix or sheet content falls while RMSF rises suggests local unfolding or disordering.
  • SASA rises without major Rg change often reflects side-chain or surface rearrangement rather than global unfolding.

When to use MD trajectory analysis vs alternatives

MD trajectory analysis on ProteinIQ is a post-simulation interpretation tool. It is most useful after a trajectory already exists.

ToolBest use caseNot the right choice when
MD Trajectory AnalysisSummarizing structure, interaction, and solvent observables from an existing MD runNo trajectory exists yet
OpenMMRunning or setting up molecular dynamics simulationsThe simulation is already complete and only analysis is needed
gmx_MMPBSAEstimating binding free energy from MD snapshotsThe goal is structural interpretation rather than free-energy estimation
Ramachandran PlotInspecting backbone geometry from a single structureTime-resolved trajectory behavior is needed
RMSD CalculatorComparing one structure against anotherA full time series across many frames is needed

The main decision point is whether the question is dynamic or static:

  • For one structure or one pairwise comparison, a simpler structure-analysis tool is usually enough.
  • For questions about persistence, transitions, convergence, flexibility, or hydration over time, trajectory analysis is the correct layer.

Caveats that matter in practice

  • Sampling frequency still limits interpretation: A frame saved every 10 ns cannot recover fast side-chain switching, no matter how many downstream analyses are run.
  • Selection choice changes the science: protein and name CA is useful for global fold tracking, but it can miss side-chain rearrangements and interaction chemistry.
  • Hydrogen-dependent analyses depend on protonation and topology quality: Missing hydrogens or inconsistent naming can suppress hydrogen-bond signals.
  • Subsampling trades speed for detail: Larger step values reduce runtime and noise, but short-lived events may disappear.
  • Water dynamics is not future-proof upstream: The current wrapper still exposes it, but the underlying MDAnalysis path is being deprecated on the route to version 3.0.