ProteinIQ

RNAdistance

RNA structure distance comparison

What is RNAdistance?

RNAdistance compares RNA secondary structures by computing distance metrics that quantify structural differences. Part of the ViennaRNA Package, it calculates tree edit distance and base pair distance between two structures represented in dot-bracket notation. These metrics help assess how similar or different two RNA conformations are, which is essential for studying structural evolution, evaluating prediction accuracy, and analyzing the effects of mutations on RNA folding.

How to use RNAdistance online

ProteinIQ provides browser-based access to RNAdistance with no command-line setup required.

Inputs

InputDescription
Structure 1First RNA secondary structure in dot-bracket notation (e.g., ((((....)))))
Structure 2Second RNA secondary structure in dot-bracket notation (e.g., ((((.....)))))

Structures must use standard dot-bracket notation where dots represent unpaired bases and matched parentheses represent base pairs.

Output

ColumnDescription
Structure 1The first input structure
Structure 2The second input structure
Tree Edit DistanceMinimum cost to transform one structure's tree representation into the other
Base Pair DistanceNumber of base pairs present in one structure but not the other

How RNAdistance works

RNAdistance implements two complementary distance measures:

Base pair distance

The simplest metric counts base pairs that exist in one structure but not the other. If structure A has the pair (i,j) and structure B does not, that contributes to the distance. The total equals the number of base pair insertions and deletions needed to convert one structure into the other.

For equal-length structures on identical sequences, base pair distance provides an intuitive measure: a distance of zero means identical folding, while higher values indicate greater structural divergence.

Tree edit distance

RNA secondary structures can be represented as ordered trees, where each node corresponds to a structural element. Tree edit distance computes the minimum-cost sequence of node insertions, deletions, and replacements needed to transform one tree into another.

The algorithm assigns costs to operations:

  • Inserting or deleting an unpaired base: 1
  • Inserting or deleting a base pair: 2
  • Replacing a base pair with an unpaired base: 1

Tree editing captures structural relationships more faithfully than string-based metrics because it respects the hierarchical nesting of helices, loops, and bulges.

Applications

  • Mutation analysis: Quantify how single nucleotide changes affect secondary structure
  • Prediction benchmarking: Compare predicted structures against experimentally determined ones
  • Structural clustering: Group similar conformations from suboptimal structure ensembles
  • Evolutionary studies: Track structural conservation across homologous RNAs
  • RNAfold: Predict secondary structures to compare
  • RNAsubopt: Generate suboptimal structures for clustering analysis
  • RNAeval: Calculate free energy of specific structures
  • RNAalifold: Predict consensus structures from aligned sequences
  • ViennaRNA: Access all 14 ViennaRNA methods through a unified interface