RAxML-NG

Infer maximum-likelihood phylogenetic trees from aligned protein or DNA sequences with bootstrap support.

20
Configure input settings on the left, then click "Infer Tree"

Related tools

DockQ

DockQ

Assess docking model quality by comparing predicted complexes against native references. DockQ v2.1.3 supports protein, nucleic-acid, and supported small-molecule interfaces with faithful upstream metrics.

HMMER

HMMER

Sensitive sequence homology search using profile hidden Markov models. More accurate than BLAST for detecting remote homologs, ideal for finding evolutionarily distant protein family members.

MAFFT

MAFFT

Perform multiple sequence alignment using MAFFT (Multiple Alignment using Fast Fourier Transform). Supports multiple algorithms from fast progressive to highly accurate iterative methods.

MMseqs2

MMseqs2

Ultra-fast sequence search and clustering. 10,000x faster than BLAST for database searches, with powerful sequence clustering capabilities for proteins and nucleotides.

MUSCLE5

MUSCLE5

Perform multiple sequence alignment using MUSCLE5 (MUltiple Sequence Comparison by Log-Expectation). Uses the PPP algorithm for high-quality alignments with support for ensemble generation.

RNAdistance

RNAdistance

RNAdistance compares RNA secondary structures using the selected upstream ViennaRNA distance representation and comparison mode.

AF-Cluster

AF-Cluster

Cluster Multiple Sequence Alignments to predict alternative protein conformations with AlphaFold2. Uses DBSCAN clustering to identify sequence subgroups.

MSA Viewer

MSA Viewer

Interactive viewer for multiple sequence alignments with color-coded residues and consensus sequence

ORF Finder

ORF Finder

Find all Open Reading Frames (ORFs) in DNA sequences. Searches all six reading frames and supports multiple genetic codes.

DR-BERT

DR-BERT

DR-BERT is a compact protein language model that predicts intrinsically disordered regions (IDRs) in proteins. It outputs per-residue disorder probability scores (0–1) from amino acid sequences, enabling fast and accurate annotation of disordered regions without structural data.

What is RAxML-NG?

RAxML-NG infers phylogenetic trees from an existing multiple sequence alignment using maximum likelihood. It is the current-generation rewrite of RAxML and ExaML, built for faster likelihood calculations, more stable optimization, and large alignments where branch support still matters.

The practical point of RAxML-NG is not just tree building. It is tree building under explicit evolutionary models, with branch lengths, likelihood scores, and bootstrap support that can stand up in downstream comparative analyses. That makes it a better fit than approximate methods when the topology will be interpreted biologically or used in a manuscript.

This tool expects an alignment, not raw sequences. If the sequences are not already aligned, start with MAFFT, MUSCLE5, or Clustal Omega before running tree inference.

How does RAxML-NG work?

RAxML-NG searches tree space for the topology and branch lengths that maximize the likelihood of the observed alignment under a substitution model. In practice, that means comparing many candidate trees, optimizing branch lengths and model parameters, and keeping the trees that improve the score.

The rewrite improved several pieces of the original RAxML search procedure. The published implementation fixes missed topological moves from earlier versions, improves optimization for models such as LG4X, adds transfer bootstrap expectation (TBE), and reports terraces in tree space when the data structure implies many equally scoring topologies. It also incorporates site-repeat optimizations and parallelization improvements that matter on taxon-rich datasets.

One consequence of maximum likelihood inference is that alignment quality dominates the final result. RAxML-NG can optimize a tree very efficiently, but it cannot rescue a poor alignment, mixed paralogs, or a nucleotide alignment forced into a protein model.

How to use RAxML-NG online

RAxML-NG runs on ProteinIQ from a pre-aligned protein or DNA dataset in FASTA or PHYLIP format. Submit the alignment, choose whether to run ML search, bootstrapping, or both, and the job returns Newick tree files, bootstrap outputs, support-annotated trees, model information, and the execution log produced by RAxML-NG.

Inputs

InputDescription
AlignmentOne aligned protein or DNA dataset in FASTA, PHYLIP, or plain text containing one of those formats. At least 2 sequences are required.
Job nameOptional label for the submitted job in the ProteinIQ interface.

Settings

SettingDescription
Analysis modeML tree search, Bootstrapping only, or ML search + bootstrapping. The combined mode is the default.
Sequence typeAuto-detect from alignment, Protein, or DNA. Auto-detection works when the alignment alphabet is unambiguous. Short nucleotide alignments made only of A, C, G, and N may need manual selection.
Model modeAutomatic (AA or DNA) uses the upstream model family shorthand, AA or DNA. Custom model string passes an explicit RAxML-NG model such as GTR+G or LG+G8+F.
Custom modelRequired only when Model mode is set to custom. Useful when a manuscript or lab workflow needs a specific model string instead of the family default.
Starting treeauto, pars{N}, or rand{N}. pars{10} starts from 10 parsimony trees, rand{10} starts from 10 random trees. This only affects ML search workflows.
Random seedPositive integer used for reproducibility. The same alignment and same settings should reproduce the same random starting conditions.
CPU threadsNumber of CPU threads used by the hosted run. More threads can reduce wall-clock time on larger alignments.
Bootstrap replicatesNumber of bootstrap trees when bootstrap analysis is enabled. 100 is often enough for exploratory work. Publication workflows often use several hundred to one thousand replicates.
Bootstrap support metricFBP, TBE, or RBS. FBP is the classic bootstrap proportion. TBE is often more stable on large trees with a few unstable taxa.
Output prefixPrefix used for downloadable filenames such as raxml.bestTree.newick.

Outputs

Output fileWhen it appearsMeaning
prefix.bestTree.newickML search or combined modeBest maximum-likelihood tree in Newick format.
prefix.bestTreeCollapsed.newickML search or combined modeBest tree with near-zero branches collapsed, useful when tiny branches clutter interpretation.
prefix.bootstraps.newickBootstrap or combined modeAll bootstrap replicate trees in one Newick file.
prefix.support.newick or metric-specific support treesBootstrap or combined modeSupport-annotated tree. Depending on the chosen metric, the wrapper can return files such as supportFBP, supportTBE, or supportRBS.
prefix.bestModel.txtWhen emitted by RAxML-NGOptimized model parameters and final model details.
prefix.mlTrees.newickML search or combined modeCandidate ML trees from the starting-tree searches. Useful for auditing the search rather than for final reporting.
prefix.raxml.logAlways when the run succeedsFull RAxML-NG log, including command details and summary statistics such as final log-likelihood, AIC, and BIC when available.

Understanding the results

RAxML-NG results are easiest to interpret in three layers: topology, branch lengths, and branch support.

Topology

The branching pattern is the inferred evolutionary hypothesis. Internal nodes define clades. A change in topology is biologically meaningful, so any region of the tree with weak support should be treated as unresolved rather than overinterpreted.

Branch lengths

Branch lengths are reported in substitutions per site. Longer branches indicate more inferred evolutionary change. Extremely long terminal branches often signal problematic sequences, alignment issues, contamination, or highly divergent taxa.

Support values

Support values only appear when bootstrap analysis is run. Lower values indicate that small changes in the resampled alignment often alter that split.

Support valueInterpretation
>= 95Strong support for that clade in most datasets
80-94Reasonable support, often usable but worth checking against alignment quality
70-79Weak to moderate support, commonly treated with caution
< 70Unstable split, usually not strong evidence for that relationship

Those cutoffs are conventions, not laws. Deep trees with uneven taxon sampling can show lower classical bootstrap support even when the overall signal is meaningful. That is one reason TBE exists.

FBP vs TBE vs RBS

MetricWhat it emphasizesWhen it is useful
FBPExact recovery of the same bipartition across replicatesStandard reporting and direct comparison with older phylogenetics literature
TBESimilarity between splits rather than exact matchingLarge trees where a few unstable taxa would otherwise deflate support for deep branches
RBSRapid bootstrap workflowFaster support estimation when runtime matters more than strict comparability with classical bootstrap

When to use RAxML-NG vs alternatives

RAxML-NG occupies the middle ground between very fast approximate tree builders and feature-rich ML packages with their own model-selection ecosystems.

ToolBest use caseTradeoff
RAxML-NGMaximum-likelihood inference when a solid ML search and standard bootstrap workflow are the priorityRequires a pre-aligned dataset and is slower than approximate methods
IQ-TREEAnalyses where built-in model selection and ultrafast bootstrap are centralDifferent search heuristics and support framework, often more feature-focused on model testing
FastTreeVery large alignments where speed matters more than exact ML optimizationMuch faster, but approximate

RAxML-NG is a strong default when the alignment is already prepared and the goal is a conventional ML tree with explicit bootstrap support. IQ-TREE is often the better choice when systematic model testing is part of the analysis plan. FastTree is better for exploratory work on very large datasets where approximate topology is good enough.

  • MAFFT: Builds multiple sequence alignments before phylogenetic inference.
  • MUSCLE5: Alternative MSA method for protein or nucleotide datasets.
  • Clustal Omega: Scalable alignment tool that fits naturally before tree building.
  • IQ-TREE: Another maximum-likelihood phylogeny tool with strong model-selection features.
  • FastTree: Approximate maximum-likelihood trees for very large alignments.

Based on the RAxML-NG paper by Kozlov et al. (2019) in Bioinformatics and the official RAxML-NG GitHub repository.