
Admetica
Admetica is an open-source ADMET prediction toolkit from Datagrok that uses Chemprop-based machine learning models for accurate pharmacokinetic property predictions. Ideal for drug design and lead optimization.
Admetica is an open-source ADMET prediction toolkit developed by Datagrok. It uses Chemprop message-passing neural networks to predict 23 pharmacokinetic and toxicity properties from SMILES strings. Unlike rule-based filters that check molecular descriptors against thresholds, Admetica learns structure-property relationships directly from experimental data.
The toolkit addresses a practical problem in drug discovery: evaluating absorption, distribution, metabolism, excretion, and toxicity (ADMET) early enough to avoid late-stage failures. Compounds that look promising in target assays often fail because they cannot reach the target, get metabolized too quickly, or cause toxicity. Predicting these properties computationally allows researchers to prioritize compounds before committing to expensive synthesis and testing.
Admetica builds on Chemprop, a graph neural network framework that represents molecules as directed graphs. Atoms become nodes; bonds become edges. The network passes information along bonds, learning to associate local structural features with global molecular properties.
Each ADMET property has its own model trained on curated datasets from ChEMBL, AstraZeneca, and academic sources. Training data ranges from 332 compounds (P-gp substrate) to nearly 10,000 compounds (solubility), with model performance validated against held-out test sets.
The regression models output continuous values (permeability coefficients, clearance rates, LD50), while classification models output probabilities for binary outcomes (CYP inhibitor yes/no, hERG liability yes/no).
ProteinIQ hosts Admetica on cloud infrastructure, eliminating the need to install Python environments or configure GPU compute.
| Format | Description |
|---|---|
| Plain SMILES | One compound per line |
| Tab-delimited | compound_name\tSMILES format to preserve identifiers |
| SDF file | Structure-data file with multiple molecules |
| CSV file | Comma-separated with SMILES column |
| PubChem fetch | Retrieve structures by compound name or CID |
Results appear as a spreadsheet with one row per compound. Columns are organized by ADMET category.
| Column | Description |
|---|---|
molecular_weight | Molecular weight in Daltons |
logp | Octanol-water partition coefficient |
tpsa | Topological polar surface area (Ų) |
hbd | Hydrogen bond donor count |
hba | Hydrogen bond acceptor count |
qed | Quantitative estimate of drug-likeness (0–1) |
| Column | Unit/Type | Description |
|---|---|---|
caco2_permeability | log cm/s | Permeability across intestinal epithelial cells |
lipophilicity | log ratio | Octanol-water partition (model-predicted) |
solubility | log mol/L | Aqueous solubility |
pgp_substrate | probability | Likelihood of P-glycoprotein efflux |
| Column | Unit/Type | Description |
|---|---|---|
ppbr | percentage | Plasma protein binding rate |
CYP enzymes metabolize the majority of small-molecule drugs. Inhibiting them causes drug-drug interactions; being a substrate affects clearance.
| Column | Type | Description |
|---|---|---|
cyp1a2_inhibitor | probability | CYP1A2 inhibition liability |
cyp2c9_inhibitor | probability | CYP2C9 inhibition liability |
cyp2c19_inhibitor | probability | CYP2C19 inhibition liability |
cyp2d6_inhibitor | probability | CYP2D6 inhibition liability |
cyp3a4_inhibitor | probability | CYP3A4 inhibition liability |
cyp2c9_substrate | probability | Metabolized by CYP2C9 |
cyp2d6_substrate |
| Column | Unit | Description |
|---|---|---|
clearance_hepatocyte | log mL/min/g | Clearance rate in hepatocyte assays |
clearance_microsome | log mL/min/g | Clearance rate in microsome assays |
| Column | Unit/Type | Description |
|---|---|---|
herg | probability | hERG channel inhibition (cardiac risk) |
ld50 | log mg/kg | Acute oral toxicity |
ames | probability | Mutagenicity (Ames test) |
dili | probability | Drug-induced liver injury risk |
skin_sensitization | probability | Skin sensitization potential |
carcinogenicity | probability | Carcinogenic potential |
clinical_toxicity |
Caco-2 permeability values above −5 log cm/s suggest good intestinal absorption. Values below −6 indicate poor passive permeability, though active transport can compensate.
Solubility above −4 log mol/L is generally favorable for oral drugs. Lower values may require formulation strategies.
P-gp substrate probability above 0.5 suggests the compound may be pumped out of cells, reducing bioavailability and potentially causing drug-drug interactions with P-gp inhibitors.
CYP inhibition probabilities above 0.7 warrant attention for drug-drug interaction potential. CYP3A4 inhibition is particularly significant because this enzyme metabolizes approximately 50% of marketed drugs.
High clearance predictions (hepatocyte or microsome) indicate rapid metabolism, which may limit exposure. Low clearance can cause accumulation.
hERG inhibition probability above 0.5 flags potential cardiac liability. This channel controls heart rhythm; blocking it can cause fatal arrhythmias. Most drug development programs include hERG screening as a safety gate.
Ames positivity (probability > 0.5) indicates potential mutagenicity. Mutagenic compounds are typically excluded from development unless the therapeutic benefit justifies the risk (e.g., oncology).
Both tools use Chemprop neural networks for ADMET prediction. Key differences:
| Aspect | Admetica | ADMET-AI |
|---|---|---|
| Properties | 23 models | 41 models |
| Training data | ChEMBL, AstraZeneca | Therapeutics Data Commons |
| Developer | Datagrok | Stanford/Greenstone |
| Additional properties | Skin sensitization, carcinogenicity, clinical toxicity | BBB penetration, half-life, oral bioavailability |
The tools complement each other. Running both provides broader coverage and a second opinion on shared endpoints.
| probability |
| Metabolized by CYP2D6 |
cyp3a4_substrate | probability | Metabolized by CYP3A4 |
| probability |
| General clinical toxicity risk |