ProteinIQ
SASA calculator example image

SASA calculator

Calculate solvent accessible surface area for protein structures

What is SASA?

Solvent Accessible Surface Area (SASA) measures how much of a protein's surface is exposed to the surrounding solvent. This property reveals which residues are on the protein's exterior versus buried in its hydrophobic core.

SASA is fundamental to understanding protein folding, stability, and function. Exposed hydrophobic residues often indicate binding sites or regions that may aggregate. Changes in SASA between conformational states can quantify domain movements or ligand-induced structural changes.

For a comprehensive analysis of your structure, combine SASA with other structural tools like the Ramachandran Plot for backbone geometry or STRIDE for secondary structure assignment.

How does the Shrake-Rupley algorithm work?

The Shrake-Rupley algorithm calculates SASA by computationally "rolling" a probe sphere (representing a water molecule) over the protein surface. First introduced in 1973, it remains the standard method for SASA calculation.

The rolling ball concept

Each atom is represented as a sphere with its van der Waals radius. The algorithm expands these radii by the probe radius (typically 1.4 Å for water) to create an accessible sphere. Points distributed on this expanded sphere are tested for overlap with neighboring atoms.

The accessible surface area for each atom equals the fraction of test points not buried by neighbors, multiplied by the sphere's surface area:

SASAatom=NaccessibleNtotal×4πr2\text{SASA}_{\text{atom}} = \frac{N_{\text{accessible}}}{N_{\text{total}}} \times 4\pi r^2

where rr is the van der Waals radius plus the probe radius, and NN is the number of test points.

Van der Waals radii

The algorithm uses standard van der Waals radii for each element. Carbon atoms have a radius of 1.7 Å, nitrogen 1.55 Å, oxygen 1.52 Å, and sulfur 1.8 Å. These radii define the physical size of each atom.

Relative accessibility

For residue-level output, the calculator also reports relative accessibility—the percentage of a residue's surface that is exposed compared to its maximum possible exposure. This is calculated as:

Relative accessibility=SASAresidueSASAmax×100%\text{Relative accessibility} = \frac{\text{SASA}_{\text{residue}}}{\text{SASA}_{\text{max}}} \times 100\%

Maximum SASA values are derived from Gly-X-Gly tripeptides, representing a fully exposed residue. Residues with relative accessibility below 20% are typically considered buried, while those above 50% are surface-exposed.

Input requirements

PDB structure

Upload one or more PDB files containing your protein structure. The calculator processes ATOM records and ignores hydrogen atoms by default. You can also fetch structures directly from the RCSB PDB using their 4-character IDs.

Settings

  • Output level: Choose the granularity of results. Structure returns a single total SASA value. Chain breaks down SASA by each polypeptide chain. Residue provides per-residue accessibility, which is most useful for identifying surface-exposed positions.

  • Probe radius: The radius of the virtual solvent sphere. The standard value of 1.4 Å represents a water molecule. Larger probes (1.8–2.0 Å) can model bulkier solvents or identify only the most accessible regions.

Understanding the results

Structure-level output

At the structure level, you get the total SASA in Ų along with counts of chains and residues. Typical globular proteins have SASA values ranging from a few thousand to tens of thousands of Ų, depending on size.

Chain-level output

Each chain is listed separately with its total SASA and residue count. This is useful for comparing surface exposure between subunits or identifying which chains contribute most to the complex's surface.

Residue-level output

The most detailed view shows:

  • Residue: The three-letter amino acid code
  • Position: Sequence position within the chain
  • SASA (Ų): Absolute surface area for that residue
  • Rel. Accessibility (%): Percentage of maximum possible exposure

Residues with high relative accessibility are good candidates for surface mutations or chemical modifications. Those with unexpectedly low accessibility despite being charged (Lys, Arg, Glu, Asp) may indicate buried salt bridges.

Use cases

SASA analysis is valuable when designing mutations—you can confirm a target residue is surface-exposed before introducing modifications. It also helps identify potential binding interfaces, which often show intermediate accessibility values.

Comparing SASA between apo and ligand-bound structures quantifies the buried surface area upon binding, which correlates with binding affinity. Monitoring SASA changes across molecular dynamics trajectories reveals conformational dynamics.

Limitations

The Shrake-Rupley algorithm treats the protein as a static structure. For flexibility, consider analyzing multiple conformations from NMR ensembles or molecular dynamics. The calculation excludes hydrogen atoms by default, which slightly underestimates true SASA.

Relative accessibility values assume the Gly-X-Gly reference state, which may not perfectly represent the local environment for all residues. Values exceeding 100% can occur for residues in extended conformations.

  • Ramachandran Plot — Analyze backbone dihedral angles from the same PDB structure
  • STRIDE — Assign secondary structure elements from atomic coordinates
  • DSSP — Alternative secondary structure assignment algorithm
  • Radius of Gyration — Measure overall protein compactness
  • PDB Viewer — Visualize your structure in 3D

References

Shrake A, Rupley JA (1973). Environment and exposure to solvent of protein atoms. Lysozyme and insulin. Journal of Molecular Biology 79:351-371. doi:10.1016/0022-2836(73)90011-9