
PDB to SDF Converter
Convert PDB protein structure files to SDF format for cheminformatics and molecular analysis. Upload a PDB file or paste the structure data below.
PDB (Protein Data Bank) and SDF (Structure Data File) are both molecular file formats, but they serve different purposes. PDB files store 3D coordinates of macromolecules with biological context, while SDF files are designed for cheminformatics with rich chemical property storage.
The SDF format was developed by MDL (now part of BIOVIA) and has become the standard exchange format in computational chemistry. It wraps the older MOL file format while adding support for multiple molecules and associated data fields.
Converting PDB to SDF is essential when preparing structures for virtual screening, QSAR modeling, or chemical database applications that require the flexible data storage capabilities of SDF.
The converter transforms PDB's fixed-width column format into SDF's sectioned structure while preserving atomic coordinates and connectivity.
PDB ATOM records contain coordinates, atom names, residue information, and chain identifiers in fixed columns. These are mapped to SDF's atom block which stores coordinates and atom types in a more compact format.
SDF files require explicit bond information in the connection table. When "Generate bonds" is enabled, the converter infers chemical bonds from interatomic distances using standard covalent radii.
PDB CONECT records, if present, are also incorporated to ensure accurate bond representation for non-standard connectivity.
SDF files support arbitrary data fields after the M END marker. When metadata is included, information like chain IDs, residue names, and original PDB header data are preserved as searchable properties.
All chains exports the complete structure, First chain only extracts just chain A (or the first defined chain), and Specific chains lets you define exactly which chains to export.A,B,C).SDF is the preferred format for:
Use PDB to MOL2 instead if you need Tripos atom typing for molecular dynamics or SYBYL-based workflows.
1# Basic conversion2obabel input.pdb -O output.sdf3 4# Add hydrogens and compute 3D coordinates5obabel input.pdb -O output.sdf -h --gen3d6 7# Extract specific chains (requires preprocessing)8obabel input.pdb -O output.sdf -d # Remove hydrogens first if needed1from rdkit import Chem2 3# Read PDB and write SDF4mol = Chem.MolFromPDBFile('input.pdb')5if mol:6 writer = Chem.SDWriter('output.sdf')7 writer.write(mol)8 writer.close()1# In PyMOL command line2load input.pdb3save output.sdf