
Glycosylation site finder
Find potential N-linked glycosylation sites (NX[S/T] sequons) in protein sequences.

Calculate the aliphatic index of protein sequences. A measure of the relative volume occupied by aliphatic side chains, indicating thermostability.

Calculate the molar extinction coefficient of protein sequences at 280 nm. Used for protein concentration determination by UV spectroscopy.

Calculate the GRAVY (Grand Average of Hydropathy) score of protein sequences. Positive values indicate hydrophobic proteins, negative values indicate hydrophilic proteins.

Calculate the instability index of protein sequences. Values above 40 indicate an unstable protein with a short half-life in vitro.

Calculate protein molecular weight (MW) from amino acid sequences in Daltons and kilodaltons. Supports FASTA and CSV sequence input, average or monoisotopic masses, initiator Met removal, and disulfide correction.

Calculate the theoretical isoelectric point (pI) of protein sequences. The pI is the pH at which a protein carries no net electrical charge.

Analyze amino acid composition of protein sequences. The tool accepts FASTA sequences and outputs the percentage of each amino acid in the sequence.

Scan protein sequences for biologically important motifs including glycosylation sites, phosphorylation sites, nuclear localization signals, prenylation motifs, and more.

Faithful static-mode Aggrescan3D tool for per-residue aggregation propensity analysis from a single protein structure.

Plot net charge vs pH for protein sequences. Visualize how protein charge changes across pH 0-14 and identify the isoelectric point (pI) where the net charge crosses zero.
N-linked glycosylation is one of the most common and important post-translational modifications (PTMs) of proteins. It involves the attachment of oligosaccharide chains (glycans) to the nitrogen atom of asparagine (Asn, N) side chains at specific sequence motifs called sequons. This modification occurs co-translationally in the endoplasmic reticulum and plays critical roles in protein folding, stability, cell signaling, and immune recognition.
The consensus motif for N-linked glycosylation is N-X-S/T, where:
The exclusion of proline at the X position is due to its rigid cyclic structure, which prevents the asparagine from adopting the conformation required for glycan attachment by the oligosaccharyltransferase (OST) enzyme.
This tool scans protein sequences for potential N-linked glycosylation sites by identifying all N-X-S/T sequons where X is not proline. It accepts input in FASTA, plain text, or PDB format and supports batch processing of multiple sequences.
| Input | Description |
|---|---|
Protein Sequences | One or more amino acid sequences in FASTA format, plain text, or PDB file. Files can be uploaded as .fasta, .fa, .txt, or .pdb formats. |
Output is a spreadsheet with glycosylation site data for each input sequence.
| Column | Description |
|---|---|
Protein ID | Sequence identifier extracted from the FASTA header. |
Amino Acids | Total number of amino acid residues in the sequence. |
Total Sites | Number of N-X-S/T sequons identified in the sequence. |
Density | Percentage of residues that are part of glycosylation sites (sites / sequence length x 100). |
Sites | List of identified sites showing position number and sequon triplet (e.g., 42:NVT, 158:NAS). |
The tool performs a linear scan of each protein sequence, examining every tripeptide window for the N-X-S/T consensus motif. The algorithm works as follows:
This is a motif-based prediction, not a machine learning model. It identifies all positions in the sequence that match the consensus motif. Not all sequons are necessarily glycosylated in vivo -- actual glycosylation depends on additional factors including protein folding, accessibility to OST, and cellular context.
The density of glycosylation sites varies substantially across protein types:
The presence of an N-X-S/T sequon is necessary but not sufficient for glycosylation. Additional factors that influence whether a sequon is actually glycosylated include: