Click to upload (.pdb, .ent)
AllMetal3D predicts metal ion binding sites in protein structures—identifying where metals bind, which metal is likely present, and the coordination geometry. It extends Metal3D, a framework originally designed for zinc, to cover the full range of biologically relevant metals: alkali ions (Na⁺, K⁺), alkaline-earth ions (Mg²⁺, Ca²⁺), and transition metals (Zn²⁺, Fe³⁺, Mn²⁺, Cu²⁺, Ni²⁺, Co²⁺). A companion model, Water3D, predicts likely water binding positions in the same framework.
Developed by Simon Duerr and Ursula Roethlisberger at EPFL's Laboratory of Computational Chemistry and Biochemistry, AllMetal3D addresses a persistent blind spot in structure prediction: deposited PDB structures frequently have missing or misassigned metal ions, and apo structures lack metals entirely. AllMetal3D benchmarks favorably against MetalSiteHunter, AlphaFold3, MIC, MIB2, and MetalHawk.
ProteinIQ runs AllMetal3D and Water3D on cloud GPU infrastructure—no software installation, no Python environment.
| Input | Details |
|---|---|
Protein Structure | PDB or ENT file, or a RCSB PDB ID. The structure must contain protein atoms. Maximum file size 50 MB. |
| Setting | Description |
|---|---|
Models | AllMetal3D + Water3D (default) runs both models. Selecting one model reduces runtime. |
Prediction mode | Fast (default) subsamples residues for speed. All residues scans every residue and is more thorough. Site-focused restricts analysis to a sphere around specified residues. |
Central residue | Required for site mode only. One or more residue numbers separated by spaces (e.g., 101 203). |
Site radius | Radius in Ångströms around the central residues in site mode. Range 4–50 Å, default 8 Å. |
Clustering threshold | Agglomerative clustering cutoff in Ångströms. Lower values split nearby predictions into separate sites; higher values merge them. Range 1–10 Å, default 7 Å. |
Probability threshold | Minimum confidence for a site to appear in results. Raise to see only high-confidence predictions. Range 0.05–1.0, default 0.25. |
Batch size | Inference batch size. Lower values (e.g., 10) reduce GPU memory usage on very large structures. Default 50. |
Results for AllMetal3D are tabulated by site:
| Column | Description |
|---|---|
Structure | Source structure identifier |
Site | Sequential site index |
X, Y, Z | Predicted metal coordinates in Ångströms |
Location confidence | Model confidence in the predicted position (%) |
Predicted identity | Most likely metal ion (e.g., Zn²⁺, Ca²⁺) |
Identity confidence | Confidence in the metal identity classification (%) |
Predicted geometry | Coordination geometry (e.g., tetrahedral, octahedral) |
Downloadable outputs include the predicted PDB (with metal coordinates appended) and CUBE-format electron density files for visualization.
AllMetal3D uses a fully convolutional 3D CNN trained on metal-containing structures from the PDB. The network processes a volumetric representation of the local protein environment centered on candidate positions. The architecture consists of five convolutional layers with 8, 60, 100, 80, and 30 channels, with leaky ReLU activation (slope 0.2) and max-pooling after the second layer. Processed features condense into a 1,280-dimensional fingerprint that feeds into separate fully-connected heads for identity and geometry classification.
Prediction follows a two-stage pipeline. AllMetal3Dloc first identifies candidate positions by scanning the structure and scoring each position for the likelihood of containing any metal. Positions above the probability threshold are then passed to the classification network, which outputs the most probable metal identity and coordination geometry.
Water3D applies the same 3D CNN framework trained specifically on crystallographic water positions, identifying ordered water molecules that are consistently present across multiple crystal structures of related proteins.
Identity classification accuracy varies substantially by metal type. Zinc sites are predicted most reliably given the abundance of Zn-containing structures in training data. Magnesium and sodium are more challenging due to their broad and irregular binding motifs.
| Confidence | Interpretation |
|---|---|
| >70% | High-confidence assignment—treat as reliable |
| 40–70% | Moderate—consider the predicted identity likely but not certain |
| <40% | Low—the model sees a metal-like environment but cannot distinguish identity well |
The geometry classifier performs well for tetrahedral and octahedral arrangements, which dominate in natural proteins. Other geometries (square planar, trigonal bipyramidal, etc.) are classified less accurately and should be treated with caution.
Water predictions reflect ordered, structurally conserved water positions. Loosely bound or disordered waters—which appear at low occupancy in crystal structures—will generally fall below the probability threshold.
Metal selectivity depends partly on cellular context (compartmentalization, metal availability during protein synthesis) that cannot be inferred from structure alone. A site predicted as Mg²⁺ may be occupied by other divalent ions in different cellular environments.
Median positional error for metal site location is approximately 0.5 Å. This is sufficient for identifying which residues coordinate the metal, but marginal for distinguishing subtle geometry differences. Vacancy prediction—identifying sites that are empty in a given structure but could be occupied—was not reliable and is not reported.
Performance on Mg²⁺ and Na⁺/K⁺ is meaningfully lower than for transition metals, reflecting sparser training data and the tendency of these ions to adopt irregular or water-mediated coordination environments.
Geometry confidence | Confidence in the geometry classification (%) |