PROPKA is an empirical method for predicting the pKa values of ionizable amino acid residues in proteins based on their three-dimensional structure. Developed by Jan H. Jensen and colleagues at the University of Copenhagen, PROPKA calculates how the protein environment shifts the pKa of each ionizable group away from its intrinsic (model) value.
The pKa of an ionizable group determines at what pH that group gains or loses a proton. In solution, amino acids have characteristic pKa values, but within a folded protein, factors such as burial from solvent, hydrogen bonding, and electrostatic interactions with nearby charged residues can shift these values substantially. PROPKA quantifies these perturbations using structure-based empirical energy functions, enabling rapid prediction of protonation states at any given pH.
PROPKA 3 introduced consistent treatment of internal and surface residues through improved desolvation and dielectric response modeling. The method handles ASP, GLU, HIS, CYS, TYR, LYS, ARG residues and both N- and C-terminal groups. Benchmarks show root-mean-square deviations of 0.79 pKa units for Asp/Glu, 0.75 for Tyr, 0.65 for Lys, and 1.00 for His residues compared to experimental measurements.
ProteinIQ provides a web-based interface for running PROPKA without command-line installation. Upload a protein structure file or fetch one from RCSB, and receive predicted pKa values for all ionizable residues.
| Input | Description |
|---|---|
Protein Structures | The target protein structure. Upload a PDB file (.pdb or .ent) or enter a PDB ID to fetch from RCSB. Supports batch processing of up to 10 structures. |
The output is a spreadsheet listing all ionizable residues with their predicted and model pKa values.
| Column | Description |
|---|---|
Structure | The source structure identifier (PDB ID or filename). |
Residue | Three-letter amino acid code (ASP, GLU, HIS, CYS, TYR, LYS, ARG, N+, or C-). |
Position | Residue number in the protein sequence. |
Chain | Chain identifier from the PDB file. |
pKa | Predicted pKa value for this residue in the protein environment. |
Model pKa | Intrinsic pKa of the isolated amino acid in solution. |
Shift | Difference between predicted and model pKa (). |
PROPKA uses the following intrinsic pKa values as baselines:
| Residue | Model pKa |
|---|---|
| ASP (Aspartic acid) | 3.80 |
| GLU (Glutamic acid) | 4.50 |
| C-terminus | 3.20 |
| HIS (Histidine) | 6.50 |
| CYS (Cysteine) | 9.00 |
| TYR (Tyrosine) | 10.00 |
| LYS (Lysine) | 10.50 |
| ARG (Arginine) | 12.50 |
| N-terminus | 8.00 |
The Shift column indicates how the protein environment alters the intrinsic pKa:
To determine if a residue is protonated at a given pH:
At physiological pH (~7.4), residues with predicted pKa values near 7.4 may exist in mixed protonation states.
PROPKA computes pKa values by calculating a perturbation to the model (intrinsic) pKa caused by the protein environment. The predicted pKa is expressed as:
where DS represents desolvation effects, HB represents hydrogen bonding interactions, and CC represents Coulombic charge-charge interactions.
When an ionizable group is buried within a protein, it loses favorable solvation interactions with water. Desolvation destabilizes the charged form relative to the neutral form, shifting carboxylate pKa values upward (making them harder to deprotonate) and amine pKa values downward (making them harder to protonate).
PROPKA calculates the desolvation penalty from two factors:
The desolvation contribution scales continuously between surface and internal residues, avoiding the discontinuous behavior of earlier PROPKA versions that treated these as discrete categories.
Hydrogen bonds can stabilize either the charged or neutral form of an ionizable group, depending on the geometry and donor/acceptor relationship:
The hydrogen bonding contribution depends on both distance and angle, with optimal geometry producing the largest pKa shifts. PROPKA evaluates both side-chain and backbone hydrogen bond donors and acceptors.
Electrostatic interactions between ionizable groups affect their pKa values:
For buried residue pairs, PROPKA applies full Coulombic interactions. For surface-exposed pairs, the high dielectric of water screens these interactions substantially. The transition between these regimes uses the same smooth interpolation applied to desolvation effects.
When two ionizable groups interact strongly, their protonation states become coupled. Neither group can be assigned a single pKa value because the ionization of one depends on the protonation state of the other. PROPKA identifies these cases and marks them with an asterisk in detailed output.
Structures with missing atoms or poor resolution may produce unreliable results. Using PDB Fixer to repair structures before running PROPKA is recommended.