AllMetal3D

Predict metal and water binding sites in proteins with 3D CNN models.

55 credits
Configure input settings on the left, then click "Submit"

Related tools

SuperWater

SuperWater

Predict protein hydration sites from a structure using a diffusion model with ESM features and a confidence-filtering head.

Aggrescan3D

Aggrescan3D

Faithful static-mode Aggrescan3D wrapper for per-residue aggregation propensity analysis from a single protein structure.

PROPKA 3

PROPKA 3

Predict pKa values of ionizable groups in proteins and protein-ligand complexes from 3D structure. PROPKA calculates environment-driven pKa shifts for standard ionizable residues, terminal groups, and supported ligand atom types.

ThermoMPNN

ThermoMPNN

Predict protein thermostability changes (ΔΔG) for point mutations using a graph neural network. Enables computational saturation mutagenesis screening to identify stabilizing mutations.

MolProbity

MolProbity

Validate protein structure quality with all-atom contact analysis, Ramachandran plots, rotamer assessment, and geometry checks.

Protein stability

Protein stability

Predict protein stability using validated BioPython methods: Instability Index, Aliphatic Index, GRAVY, flexibility analysis, and charge distribution

ScanNet

ScanNet

Geometric deep learning model for predicting protein binding sites directly from 3D structure. Identifies where proteins interact with other proteins, antibodies, or disordered proteins with high accuracy, including for novel protein folds.

DockQ

DockQ

Assess docking model quality by comparing predicted complexes against native references. DockQ v2.1.3 supports protein, nucleic-acid, and supported small-molecule interfaces with faithful upstream metrics.

DSSP

DSSP

Assign protein secondary structure using the DSSP algorithm. The gold standard for hydrogen bond-based structure assignment from coordinates.

IPSAE

IPSAE

Scoring function for interprotein interactions in AlphaFold2, AlphaFold3 and Boltz predictions. Calculates ipSAE, ipTM, pDockQ, pDockQ2, and LIS scores to assess protein-protein interface quality.

What is AllMetal3D?

AllMetal3D predicts metal binding sites in protein structures, including where a site is likely to occur, the most likely upstream identity class, and the coordination geometry. Its identity classifier reports the same classes exposed by the upstream project: Alkali, MG, CA, ZN, NonZNTM, and NoMetal. A companion model, Water3D, predicts likely water binding positions in the same framework.

Developed by Simon Duerr and Ursula Roethlisberger at EPFL's Laboratory of Computational Chemistry and Biochemistry, AllMetal3D addresses a persistent blind spot in structure prediction: deposited PDB structures frequently have missing or misassigned metal ions, and apo structures lack metals entirely. AllMetal3D benchmarks favorably against MetalSiteHunter, AlphaFold3, MIC, MIB2, and MetalHawk.

How to use AllMetal3D online

ProteinIQ runs AllMetal3D and Water3D on cloud GPU infrastructure—no software installation, no Python environment.

Input

InputDetails
Protein StructurePDB, ENT, CIF, or mmCIF file, or a RCSB PDB ID. The structure must contain protein atoms. Maximum file size 50 MB.

Settings

SettingDescription
ModelsAllMetal3D + Water3D (default) runs both models. Selecting one model reduces runtime.
Prediction modeFast (default) subsamples residues for speed. All residues scans every residue and is more thorough. Site-focused restricts analysis to a sphere around specified residues.
Central residueRequired for site mode only. One or more residue numbers separated by spaces (e.g., 101 203).
Site radiusRadius in Ångströms around the central residues in site mode. Range 4–50 Å, default 8 Å.
Clustering thresholdAgglomerative clustering cutoff in Ångströms. Lower values split nearby predictions into separate sites; higher values merge them. Range 1–10 Å, default 7 Å.
Probability thresholdMinimum confidence for a site to appear in results. Raise to see only high-confidence predictions. Range 0.05–1.0, default 0.25.
Batch sizeInference batch size. Lower values (e.g., 10) reduce GPU memory usage on very large structures. Default 50.

Output

Results for AllMetal3D are tabulated by site:

ColumnDescription
StructureSource structure identifier
SiteSequential site index
X, Y, ZPredicted metal coordinates in Ångströms
Location confidenceModel confidence in the predicted position (%)
Predicted identityMost likely upstream identity class (Alkali, MG, CA, ZN, NonZNTM, or NoMetal)
Identity confidenceConfidence in the metal identity classification (%)
Predicted geometryCoordination geometry (e.g., tetrahedral, octahedral)
Geometry confidenceConfidence in the geometry classification (%)

The detailed table also includes raw per-class probability columns for both identity and geometry, so you can inspect the full upstream distribution instead of only the top-ranked class.

Downloadable outputs include the predicted PDB (with metal coordinates appended) and CUBE-format electron density files for visualization.

How AllMetal3D works

AllMetal3D uses a fully convolutional 3D CNN trained on metal-containing structures from the PDB. The network processes a volumetric representation of the local protein environment centered on candidate positions. The architecture consists of five convolutional layers with 8, 60, 100, 80, and 30 channels, with leaky ReLU activation (slope 0.2) and max-pooling after the second layer. Processed features condense into a 1,280-dimensional fingerprint that feeds into separate fully-connected heads for identity and geometry classification.

Prediction follows a two-stage pipeline. AllMetal3Dloc first identifies candidate positions by scanning the structure and scoring each position for the likelihood of containing any metal. Positions above the probability threshold are then passed to the classification network, which outputs the most probable metal identity and coordination geometry.

Water3D applies the same 3D CNN framework trained specifically on crystallographic water positions, identifying ordered water molecules that are consistently present across multiple crystal structures of related proteins.

Interpreting results

Metal identity confidence

Identity classification accuracy varies substantially by metal type. Zinc sites are predicted most reliably given the abundance of Zn-containing structures in training data. Magnesium and sodium are more challenging due to their broad and irregular binding motifs.

ConfidenceInterpretation
>70%High-confidence assignment—treat as reliable
40–70%Moderate—consider the predicted identity likely but not certain
<40%Low—the model sees a metal-like environment but cannot distinguish identity well

Coordination geometry

The geometry classifier performs well for tetrahedral and octahedral arrangements, which dominate in natural proteins. Other geometries (square planar, trigonal bipyramidal, etc.) are classified less accurately and should be treated with caution.

Water3D sites

Water predictions reflect ordered, structurally conserved water positions. Loosely bound or disordered waters—which appear at low occupancy in crystal structures—will generally fall below the probability threshold.

Limitations

Metal selectivity depends partly on cellular context (compartmentalization, metal availability during protein synthesis) that cannot be inferred from structure alone. A site predicted as Mg²⁺ may be occupied by other divalent ions in different cellular environments.

Median positional error for metal site location is approximately 0.5 Å. This is sufficient for identifying which residues coordinate the metal, but marginal for distinguishing subtle geometry differences. Vacancy prediction—identifying sites that are empty in a given structure but could be occupied—was not reliable and is not reported.

Performance on Mg²⁺ and Na⁺/K⁺ is meaningfully lower than for transition metals, reflecting sparser training data and the tendency of these ions to adopt irregular or water-mediated coordination environments.