
SPRINT (Structure-aware PRotein ligand INTeraction) predicts drug-target interactions using co-embedded protein and ligand representations. Screen thousands of compounds against a protein target in seconds.

Lipinski's Rule of Five predicts whether compounds will be orally bioavailable by evaluating molecular weight, LogP, hydrogen bond donors, and acceptors.

Predict protein solubility and usability for E. coli expression using ESM protein language models

Predict ADMET (Absorption, Distribution, Metabolism, Excretion, Toxicity) properties from SMILES strings using machine learning models trained on Therapeutics Data Commons datasets.

Predict 22 ADMET properties from SMILES strings with the upstream Admetica Chemprop models from Datagrok.

Identify toxic, reactive, and pharmacokinetically problematic molecular fragments using structural alert patterns

Predict toxicity and synthetic accessibility of small molecules using machine learning. eToxPred combines toxicity risk assessment with synthetic accessibility scoring to help prioritize drug candidates.

Screen for lead-like compounds using stricter molecular descriptor criteria than Lipinski or Veber rules for early-stage drug discovery

Screen compounds for Pan-Assay INterference patterns that cause false positives in biological assays

Quantitative estimate for protein-protein interaction inhibitor potential. Evaluates drug-likeness for compounds targeting PPIs.
fpocket is an open-source protein pocket detection algorithm that identifies ligand binding sites using Voronoi tessellation and alpha sphere geometry.
The software analyzes protein structures to locate and characterize cavities and clefts where small molecules can bind. The algorithm ranks detected pockets by druggability and provides geometric and physicochemical descriptors.
Traditional binding site detection methods often rely on grid-based approaches or require ligand-bound structures as templates. fpocket instead uses geometric principles to identify pockets directly from protein coordinates, making it applicable to both bound and unbound (apo) protein structures.
The algorithm achieves 94% detection accuracy within the top three ranked pockets while executing in under 3 seconds per structure.
ProteinIQ provides a web interface for running fpocket without command-line installation. You upload a protein structure or enter a PDB ID, and the tool returns ranked pockets with geometric and chemical descriptors.
| Input | Description |
|---|---|
Protein Structure | The target protein for pocket detection. Upload a PDB or mmCIF file, or enter a 4-character PDB ID (e.g., 1HSG) to fetch from RCSB. Maximum file size is 50 MB. |
fpocket returns a ranked table of detected pockets with the standard descriptors written to the upstream *_info.txt report.
| Column | Description |
|---|---|
Pocket | Pocket rank, where 1 is the top-ranked predicted site. |
Score | Overall fpocket pocket score derived from geometric and physicochemical descriptors. |
Drug Score | Druggability score estimating suitability for small-molecule binding. |
Volume (A^3) | Pocket volume in cubic angstroms. |
Apolar SASA (A^2) | Solvent-accessible surface area assigned to apolar pocket surface. |
Polar SASA (A^2) | Solvent-accessible surface area assigned to polar pocket surface. |
Alpha Spheres | Number of alpha spheres assigned to the pocket. |
Total SASA (A^2) |
The Files tab includes the native fpocket artifacts for the run, including the *_info.txt report, combined pocket structures, per-pocket coordinate files, visualization scripts, and a CSV export of the spreadsheet values.
The druggability score integrates multiple descriptors using a partial least squares model trained on known drug-binding sites:
Scores reflect both geometric properties (pocket depth, volume) and chemical features (hydrophobicity, polarity balance).
fpocket employs alpha sphere theory based on computational geometry. An alpha sphere is defined as a sphere that touches exactly four protein atoms on its boundary while containing no atoms in its interior. These spheres naturally concentrate in protein cavities and clefts, making them ideal markers for binding site detection.
The algorithm begins by computing a Voronoi decomposition of 3D space around the protein using the Qhull library. Voronoi vertices—points equidistant from four neighboring atoms—correspond to potential alpha sphere centers. fpocket filters these vertices by radius, discarding spheres too small (tight atomic packing in the protein core) or too large (solvent-exposed surface regions).
The default radius range is 3.0–6.0 Å, optimized for typical small molecule binding sites. This geometric criterion eliminates ~80% of candidate spheres before clustering.
fpocket groups neighboring alpha spheres into pockets using a three-pass clustering procedure:
This hierarchical approach handles irregular pocket geometries better than single-threshold methods. The algorithm leverages Qhull's neighbor lists to avoid pairwise distance calculations, achieving near-linear runtime scaling.
Each pocket receives a composite score derived from five weighted descriptors:
| Descriptor | Weight | Meaning |
|---|---|---|
| Normalized alpha sphere count | 0.3 | Pocket size indicator |
| Mean local hydrophobic density | 0.25 | Apolar residue concentration |
| Proportion of apolar spheres | 0.2 | Hydrophobic character |
| Polarity score | 0.15 | Polar residue presence |
| Alpha sphere density | 0.1 | Spatial compactness |
Weights derive from partial least squares regression against a training set of 48 known ligand-binding sites. The scoring function prioritizes deep, hydrophobic pockets with balanced polarity—characteristics of successful drug targets.
Beyond the scoring function, fpocket computes additional physicochemical properties:
These descriptors enable users to assess pockets beyond the druggability score alone.
Benchmark studies on the PocketPicker dataset (48 diverse proteins) demonstrate:
On the Astex Diverse set (85 high-quality pharmaceutical complexes):
The algorithm outperforms CAST, PASS, SURFNET, and LIGSITE at rank-3 while executing 10–100× faster than grid-based competitors. This speed advantage enables proteome-scale screening applications.
| Total solvent-accessible surface area of the pocket. |
Mean Local Hydrophobic Density | Average local hydrophobic density around pocket alpha spheres. |
Mean Alpha Sphere Radius | Mean radius of alpha spheres assigned to the pocket. |
Mean Alpha Sphere Solvent Access | Average solvent accessibility of pocket alpha spheres. |
Apolar Alpha Sphere Proportion | Fraction of alpha spheres classified as apolar. |
Hydrophobicity Score | Aggregate pocket hydrophobicity descriptor reported by fpocket. |
Volume Score | fpocket's internal volume-related score term. |
Polarity Score | Integer polarity descriptor based on lining residue properties. |
Charge Score | Integer net charge descriptor at pH 7 from lining residues. |
Proportion of Polar Atoms | Fraction of pocket atoms considered polar. |
Alpha Sphere Density | Pocket density descriptor based on alpha sphere spacing. |
Centroid-Alpha Sphere Max Dist | Distance from the pocket center of mass to the farthest alpha sphere. |
Flexibility | fpocket flexibility descriptor derived from B-factors. |