
Predict pKa values of ionizable groups in proteins based on 3D structure.
PROPKA 3 predicts the pKa values of ionizable amino acid residues in proteins based on their 3D structure. The pKa determines at what pH a residue will be protonated or deprotonated, which affects protein stability, enzyme activity, and ligand binding.
Amino acids in solution have well-characterized "model" pKa values. However, when buried inside a protein, the local environment can shift these values dramatically—sometimes by several pH units. PROPKA uses empirical rules to estimate these shifts from desolvation effects, hydrogen bonds, and electrostatic interactions.
Understanding pKa values is essential for structure preparation in molecular dynamics simulations, docking studies, and electrostatics calculations. For preparing structures for electrostatics workflows, consider using PDB2PQR, which uses PROPKA internally to assign protonation states.
PROPKA calculates pKa values using an empirical approach that starts with model pKa values and applies perturbations based on the protein environment.
The predicted pKa for each ionizable group is:
pKa=pKa,model+ΔpKa,desolv+ΔpKa,HB+ΔpKa,coulWhere:
PROPKA uses these reference pKa values for amino acids in solution:
| Residue | Model pKa |
|---|---|
| ASP (Aspartic acid) | 3.80 |
| GLU (Glutamic acid) | 4.25 |
| HIS (Histidine) | 6.50 |
| CYS (Cysteine) | 9.00 |
| TYR (Tyrosine) | 10.07 |
| LYS (Lysine) | 10.50 |
| ARG (Arginine) | 12.50 |
| N-terminus (N+) | 8.00 |
| C-terminus (C-) | 3.20 |
When a charged group is buried inside a protein, it loses favorable interactions with water. This desolvation penalty is the largest contributor to pKa shifts for buried residues.
PROPKA estimates desolvation based on how much solvent volume the protein excludes around the ionizable group. Deeply buried residues experience larger pKa shifts than surface-exposed ones.
PROPKA 3 achieves RMSD values of approximately 0.79 for ASP/GLU, 0.65 for LYS, 0.75 for TYR, and 1.00 for HIS residues when compared to experimental data.
Surface-exposed residues are predicted more accurately since their pKa values remain close to model values. Buried residues with large pKa shifts are more challenging to predict.
PROPKA outputs a table with one row per ionizable residue:
| Column | Description |
|---|---|
| Structure | PDB identifier (when processing multiple structures) |
| Residue | Three-letter amino acid code (ASP, GLU, HIS, etc.) |
| Position | Residue number in the sequence |
| Chain | Chain identifier |
| pKa | Predicted pKa value |
| Model pKa | Reference pKa in solution |
| Shift | Difference between predicted and model pKa (pKa−pKa,model) |
A positive shift means the residue is harder to deprotonate than in solution. For acidic residues (ASP, GLU), this indicates stabilization of the protonated form—often from burial or hydrogen bonding.
A negative shift means the residue is easier to deprotonate. For basic residues (LYS, ARG, HIS), a negative shift indicates destabilization of the protonated form.
Shifts greater than ±2 pH units suggest significant environmental perturbation and warrant careful examination of the local structure.
To determine protonation states at a specific pH, compare the predicted pKa to your target pH:
At physiological pH (7.4), ASP and GLU are typically deprotonated (negatively charged), while LYS and ARG are protonated (positively charged). HIS is near its pKa and may exist in either state.
PROPKA predictions help with several common workflows:
For calculating the overall isoelectric point (pI) of a protein from sequence, use our Isoelectric Point calculator. PROPKA provides residue-level detail from structure, while pI gives a single whole-protein value from sequence.
PROPKA works best for globular, well-folded proteins with reliable structures. Keep these limitations in mind:
We recommend using PDB Fixer to repair structures with missing atoms before running PROPKA.