
Calculate solvent accessible surface area for protein structures
Solvent Accessible Surface Area (SASA) measures how much of a protein's surface is exposed to the surrounding solvent. This property reveals which residues are on the protein's exterior versus buried in its hydrophobic core.
SASA is fundamental to understanding protein folding, stability, and function. Exposed hydrophobic residues often indicate binding sites or regions that may aggregate. Changes in SASA between conformational states can quantify domain movements or ligand-induced structural changes.
For a comprehensive analysis of your structure, combine SASA with other structural tools like the Ramachandran Plot for backbone geometry or STRIDE for secondary structure assignment.
The Shrake-Rupley algorithm calculates SASA by computationally "rolling" a probe sphere (representing a water molecule) over the protein surface. First introduced in 1973, it remains the standard method for SASA calculation.
Each atom is represented as a sphere with its van der Waals radius. The algorithm expands these radii by the probe radius (typically 1.4 Å for water) to create an accessible sphere. Points distributed on this expanded sphere are tested for overlap with neighboring atoms.
The accessible surface area for each atom equals the fraction of test points not buried by neighbors, multiplied by the sphere's surface area:
SASAatom=NtotalNaccessible×4πr2where r is the van der Waals radius plus the probe radius, and N is the number of test points.
The algorithm uses standard van der Waals radii for each element. Carbon atoms have a radius of 1.7 Å, nitrogen 1.55 Å, oxygen 1.52 Å, and sulfur 1.8 Å. These radii define the physical size of each atom.
For residue-level output, the calculator also reports relative accessibility—the percentage of a residue's surface that is exposed compared to its maximum possible exposure. This is calculated as:
Relative accessibility=SASAmaxSASAresidue×100%Maximum SASA values are derived from Gly-X-Gly tripeptides, representing a fully exposed residue. Residues with relative accessibility below 20% are typically considered buried, while those above 50% are surface-exposed.
Upload one or more PDB files containing your protein structure. The calculator processes ATOM records and ignores hydrogen atoms by default. You can also fetch structures directly from the RCSB PDB using their 4-character IDs.
Output level: Choose the granularity of results. Structure returns a single total SASA value. Chain breaks down SASA by each polypeptide chain. Residue provides per-residue accessibility, which is most useful for identifying surface-exposed positions.
Probe radius: The radius of the virtual solvent sphere. The standard value of 1.4 Å represents a water molecule. Larger probes (1.8–2.0 Å) can model bulkier solvents or identify only the most accessible regions.
At the structure level, you get the total SASA in Ų along with counts of chains and residues. Typical globular proteins have SASA values ranging from a few thousand to tens of thousands of Ų, depending on size.
Each chain is listed separately with its total SASA and residue count. This is useful for comparing surface exposure between subunits or identifying which chains contribute most to the complex's surface.
The most detailed view shows:
Residues with high relative accessibility are good candidates for surface mutations or chemical modifications. Those with unexpectedly low accessibility despite being charged (Lys, Arg, Glu, Asp) may indicate buried salt bridges.
SASA analysis is valuable when designing mutations—you can confirm a target residue is surface-exposed before introducing modifications. It also helps identify potential binding interfaces, which often show intermediate accessibility values.
Comparing SASA between apo and ligand-bound structures quantifies the buried surface area upon binding, which correlates with binding affinity. Monitoring SASA changes across molecular dynamics trajectories reveals conformational dynamics.
The Shrake-Rupley algorithm treats the protein as a static structure. For flexibility, consider analyzing multiple conformations from NMR ensembles or molecular dynamics. The calculation excludes hydrogen atoms by default, which slightly underestimates true SASA.
Relative accessibility values assume the Gly-X-Gly reference state, which may not perfectly represent the local environment for all residues. Values exceeding 100% can occur for residues in extended conformations.
Shrake A, Rupley JA (1973). Environment and exposure to solvent of protein atoms. Lysozyme and insulin. Journal of Molecular Biology 79:351-371. doi:10.1016/0022-2836(73)90011-9