fpocket is an open-source protein pocket detection algorithm that identifies ligand binding sites using Voronoi tessellation and alpha sphere geometry.
The software analyzes protein structures to locate and characterize cavities and clefts where small molecules can bind. The algorithm ranks detected pockets by druggability and provides geometric and physicochemical descriptors.
Traditional binding site detection methods often rely on grid-based approaches or require ligand-bound structures as templates. fpocket instead uses geometric principles to identify pockets directly from protein coordinates, making it applicable to both bound and unbound (apo) protein structures.
The algorithm achieves 94% detection accuracy within the top three ranked pockets while executing in under 3 seconds per structure.
ProteinIQ provides a web interface for running fpocket without command-line installation. You upload a protein structure or enter a PDB ID, and the tool returns ranked pockets with geometric and chemical descriptors.
| Input | Description |
|---|---|
Protein Structure | The target protein for pocket detection. Upload a PDB or mmCIF file, or enter a 4-character PDB ID (e.g., 1HSG) to fetch from RCSB. Maximum file size is 50 MB. |
fpocket returns a ranked table of detected pockets with quantitative descriptors.
| Column | Description |
|---|---|
Pocket | Pocket rank, where 1 represents the most druggable predicted site. |
Score | Overall pocket score derived from geometric and physicochemical descriptors. Higher values indicate more favorable binding characteristics. |
Drug Score | Druggability score estimating suitability for small molecule binding. Values range from 0 to 1, with values above 0.5 considered druggable. |
Volume (ų) | Pocket volume in cubic angstroms. Typical drug binding sites range from 200–800 ų. |
Apolar SASA | Solvent-accessible surface area of hydrophobic residues in Ų. Higher values suggest hydrophobic binding environments. |
Polar SASA | Solvent-accessible surface area of polar residues in Ų. |
Alpha Spheres |
The druggability score integrates multiple descriptors using a partial least squares model trained on known drug-binding sites:
Scores reflect both geometric properties (pocket depth, volume) and chemical features (hydrophobicity, polarity balance).
fpocket employs alpha sphere theory based on computational geometry. An alpha sphere is defined as a sphere that touches exactly four protein atoms on its boundary while containing no atoms in its interior. These spheres naturally concentrate in protein cavities and clefts, making them ideal markers for binding site detection.
The algorithm begins by computing a Voronoi decomposition of 3D space around the protein using the Qhull library. Voronoi vertices—points equidistant from four neighboring atoms—correspond to potential alpha sphere centers. fpocket filters these vertices by radius, discarding spheres too small (tight atomic packing in the protein core) or too large (solvent-exposed surface regions).
The default radius range is 3.0–6.0 Å, optimized for typical small molecule binding sites. This geometric criterion eliminates ~80% of candidate spheres before clustering.
fpocket groups neighboring alpha spheres into pockets using a three-pass clustering procedure:
This hierarchical approach handles irregular pocket geometries better than single-threshold methods. The algorithm leverages Qhull's neighbor lists to avoid pairwise distance calculations, achieving near-linear runtime scaling.
Each pocket receives a composite score derived from five weighted descriptors:
| Descriptor | Weight | Meaning |
|---|---|---|
| Normalized alpha sphere count | 0.3 | Pocket size indicator |
| Mean local hydrophobic density | 0.25 | Apolar residue concentration |
| Proportion of apolar spheres | 0.2 | Hydrophobic character |
| Polarity score | 0.15 | Polar residue presence |
| Alpha sphere density | 0.1 | Spatial compactness |
Weights derive from partial least squares regression against a training set of 48 known ligand-binding sites. The scoring function prioritizes deep, hydrophobic pockets with balanced polarity—characteristics of successful drug targets.
Beyond the scoring function, fpocket computes additional physicochemical properties:
These descriptors enable users to assess pockets beyond the druggability score alone.
Benchmark studies on the PocketPicker dataset (48 diverse proteins) demonstrate:
On the Astex Diverse set (85 high-quality pharmaceutical complexes):
The algorithm outperforms CAST, PASS, SURFNET, and LIGSITE at rank-3 while executing 10–100× faster than grid-based competitors. This speed advantage enables proteome-scale screening applications.
| Number of alpha spheres comprising the pocket. Larger values indicate more extensive cavities. |