What's the largest protein? It's PKZILLA-1

PKZILLA-1, discovered in 2024, holds the record at 4.7 megadaltons with 45,212 amino acids.

Category
Statistics
Author
Dr. Matic Broz
Read time
5 min
Updated
Dec 29, 2025
Share
What's the largest protein? It's PKZILLA-1

What is the largest protein?

The largest known protein is PKZILLA-1, a giant enzyme containing 45,212 amino acids with a molecular mass of 4.7 megadaltons (MDa). Discovered in August 2024 by researchers at UC San Diego's Scripps Institution of Oceanography, this protein surpasses titin by 25% in molecular weight.

PKZILLA-1 was identified in the golden alga Prymnesium parvum, where it functions as a polyketide synthase enzyme. The protein contains 140 enzyme domains arranged in sequence, making it an extraordinarily complex molecular machine.

"This is the Mount Everest of proteins," noted Bradley Moore, the study's senior author at Scripps Oceanography.

FeaturePKZILLA-1Titin (previous record)
Mass4.7 MDa3.7–3.8 MDa
Amino acids45,21234,350
Enzyme domains140N/A (structural)
FunctionPolyketide synthesisMuscle elasticity
OrganismPrymnesium parvum (algae)Human muscle
Discovery20241979

How big is titin, the previous record holder?

Titin consists of 34,350 amino acids in its canonical human isoform, with a molecular weight of approximately 3.8 MDa. The protein stretches over 1 micrometer in length, spanning the entire length of the muscle sarcomere from Z-disk to M-line.

Titin functions as a molecular spring in striated muscle, providing passive elasticity that allows muscles to stretch and return to their resting state. According to RCSB PDB-101, titin consists of more than 34,000 amino acids organized into several hundred modular domains, including Ig-like domains, fibronectin-like domains, and a disordered PEVK region.

The mouse homologue is slightly larger, comprising 35,213 amino acids with a molecular weight of 3.9 MDa. Titin constitutes approximately 10% of muscle mass, making it the third most abundant protein in muscle after actin and myosin.

How big is PKZILLA-1 compared to typical proteins?

PKZILLA-1 is approximately 100 times larger than an average protein. According to Bionumbers, eukaryotic proteins average 472 amino acids, while bacterial proteins average 320 amino acids. Using the standard residue weight of 110 Da per amino acid, this translates to approximately 33–55 kDa for a typical protein.

Protein categoryAmino acidsMolecular weight
Typical protein300–50033–55 kDa
Large protein1,000–5,000100–500 kDa
Giant protein (titin)34,3503,700–3,800 kDa
Largest protein (PKZILLA-1)45,2124,700 kDa

PKZILLA-1 reaches approximately 1 micrometer in length. Despite its enormous size, it remains far smaller than most cells; a typical human cell measures 10–100 micrometers in diameter.

Largest proteins by molecular weight
Largest proteins by molecular weight

What does PKZILLA-1 do?

PKZILLA-1 and its companion enzyme PKZILLA-2 (3.2 MDa, 99 enzyme domains) work together to produce prymnesin, a potent ichthyotoxin responsible for massive fish kills during harmful algal blooms. According to the Science publication, these giant enzymes catalyze 239 sequential chemical reactions to assemble the complex toxin molecule.

This discovery has two important implications for us:

  1. Detecting PKZILLA genes in water samples could enable early warning systems for toxic algal blooms. As co-author Timothy Fallon noted, "Monitoring for the genes instead of the toxin could allow us to catch blooms before they start."
  2. Understanding how nature assembles such complex molecules could inform synthesis of new compounds for medical applications. According to Moore, "Understanding how nature has evolved its chemical wizardry gives us as scientific practitioners the ability to apply those insights to creating useful products."

What are the top 10 largest proteins?

Based on molecular weight, here are the largest known proteins. You can calculate the molecular weight and protein parameters for any sequence using our tools.

RankProteinMolecular weightAmino acidsFunction
1PKZILLA-14,700 kDa45,212Polyketide synthesis (algae)
2Titin3,000–3,800 kDa27,000–34,350Muscle elasticity
3PKZILLA-23,200 kDa~30,000Polyketide synthesis (algae)
4Versican~1,000 kDaVariableExtracellular matrix
5Obscurin720–900 kDa~7,000Muscle organization
6Nebulin600–900 kDa~6,700Thin filament regulation
7AHNAK~700 kDa5,890Membrane organization
8Ryanodine receptor (RyR1)~565 kDa (monomer)5,038Calcium release channel
9Apolipoprotein B-100~515 kDa4,536Lipid transport
10Plectin~500 kDa~4,500Cytoskeletal crosslinking

Many of the largest proteins are found in muscle tissue (titin, obscurin, nebulin, dystrophin), reflecting the complex structural requirements of the contractile apparatus.

Why are some proteins so large?

Giant proteins serve specialized functions that require their extraordinary size:

  • Titin spans the entire muscle sarcomere (~1 μm), physically connecting the Z-disk to the M-line. This requires a protein large enough to bridge this distance while maintaining elastic properties.
  • PKZILLA-1 contains 140 enzyme domains that work in sequence. Rather than relying on 140 separate enzymes finding each other through diffusion, consolidating them into a single protein ensures efficient, sequential processing of chemical reactions.
  • Large muscle proteins like titin, nebulin, and obscurin act as molecular rulers that define sarcomere dimensions and organize other proteins during muscle development.

Jon Clardy, a biological chemist at Harvard, noted that these PKZILLA proteins likely approach the upper size limits for proteins because of the sheer workload of expressing and assembling something that large.

How many different proteins exist?

While PKZILLA-1 represents the extreme upper end of protein size, the total number of known protein sequences exceeds 465 million (UniRef100, 2025). The human genome encodes approximately 19,000–20,000 protein-coding genes, which produce roughly 70,000 distinct protein isoforms through alternative splicing.

Matic Broz

Matic Broz

Founder & CEO, ProteinIQ

Matic founded ProteinIQ to make computational biology accessible to every researcher. He builds code-free bioinformatics tools used by thousands of scientists worldwide for protein analysis, molecular docking, and drug discovery.