Ammm:Molecular surfaces: Difference between revisions

From biowiki
Jump to navigation Jump to search
Created page with "= Introduction = Calculating molecular surfaces is important and useful for structure-based drug design, representing molecules and computing inter-molecular interactions. It is also useful for displaying certain molecular properties on the surface, using color-coded and texture maps. Examples of such properties include atomic charge, electrostatic potential, hydrophobicity, polarizability, etc. = Types of Molecular Surfaces = File:Surfaces.png == Van der Waals S..."
 
No edit summary
 
Line 350: Line 350:
= Further Reading =
= Further Reading =


[[File:Lee-richards71.pdf‎]]
[[File:Lee-richards71.pdf‎ | thumb]]


[[File:Richards77.pdf]]
[[File:Richards77.pdf| thumb]]


[[File:Connolly.pdf]]
[[File:Connolly.pdf| thumb]]


[[File:Sanner96.pdf]]
[[File:Sanner96.pdf| thumb]]


[[File:Sanner2.pdf]]
[[File:Sanner2.pdf| thumb]]


[[File:Bajaj1999.pdf]]
[[File:Bajaj1999.pdf| thumb]]


[[File:jp2007.pdf]]
[[File:jp2007.pdf| thumb]]


[[File:Cossi96.pdf]]
[[File:Cossi96.pdf| thumb]]


= References =
= References =

Latest revision as of 15:55, 10 May 2022

Introduction

Calculating molecular surfaces is important and useful for structure-based drug design, representing molecules and computing inter-molecular interactions. It is also useful for displaying certain molecular properties on the surface, using color-coded and texture maps. Examples of such properties include atomic charge, electrostatic potential, hydrophobicity, polarizability, etc.

Types of Molecular Surfaces

Van der Waals Surface

This surface is a union-of-balls representation. While acceptable for small proteins such as ligands, it fails to provide a good enough representation for larger proteins, returning too much buried surface.

Solvent Accessible Surface (SAS)

The SAS is also know as the Lee-Richards surface [1], and is calculated using the "rolling ball" algorithm developed by Shrake and Rupley [2], in which a sphere representing the solvent is used to "probe" the surface of the molecule. An important consideration is the selection of the probe radius: if it is too large, the resulting surface tends to be smaller and less accurate, whereas a smaller radius gives a larger surface with more details (bumps, cavities, etc.). A radius of 1.4 Å is normally chosen, approximating water. Also important are the assignment of van der Waals radii, the quality of the atomic coordinates, and hydrogen atom positions.

Solvent Excluded Surface (SES)

An improvement over the Lee-Richards SAS surface, Richards' SES [3] was implemented by Connolly in 1983 and so, is also known as the Connolly surface[4]. The following two parts are computed:

  • contact surface, which is part of the van der Waals surface
  • reentrant surface, which is the inward facing surface of the probe when it is in contact with more than one surface atom

Analytical Molecular Surface Calculation Algorithm

This algorithm for calculating solvent excluded surfaces was presented by Connolly in 1983 [5].

Surface creation

The rolling of a probe sphere is best described in terms of translational degrees of freedom: for each atom the probe touches, it loses one degree of freedom. The main idea in the algorithm is to start with probe positions with the degree of freedom equal to zero, and then to work one's way up; lower degrees of freedom provide starting and stopping points for higher degrees.

Accordingly, there are three different types of surface pieces.

For each case, a different piece of surface is generated, defined by the following:

  • sphere: center, radius
  • torus: center, 2 radii, axial vector
  • boundary (circular arcs): center, radius, plane, end points

All of these pieces form a connected network, like a polyhedron where each piece is a face. The join between these faces is smooth, i.e. there is a well-defined tangent plane at all the points on an edge. This is in contrast with the van der Waals surface, where there are singularities.

The equations defining the surface are:

Torus construction

In order to build concave and saddle faces, a torus must be calculated for every pair of neighboring atoms. Two atoms i and j are neighbors if: .

When the probe is rolled over i and j, its center traces a circle and its surface traces the volume of a torus. The surface now connecting the two atoms is saddle-shaped.

The torus is described by its:

  • center: circle center
  • radius: circle radius
  • plane: plane circle lies in


Probe placement and concave faces

Along the torus central circle, there may be other atoms k, such that k is a mutual neighbor of i and j. In this case there are two possible probe placements, one on each side of the plane passing through the three atom centers. If the third atom k is too far away for the probe to collide with it, however, or if k is located between i and j such that the probe has a van der Waals clash with it, no probe placement is possible.


Saddle faces

Once concave edges have been generated, tori are examined to create saddle faces. Three cases occur:

  • buried tori generate no surface
  • free tori (special case: no concave edges, and convex edges form full contact circle)
  • partially accessible tori: one or more saddle-shaped rectangles are formed by choosing concave arcs that are adjacent in space and separated by convex arcs.

Convex faces

There are three types of atoms:

  • buried, for which no surface is generated
  • entirely accessible, which have no neighbors and a null edge list
  • partially accessible, which have an edge list of size 1 or more

For a given atom, convex edges belonging to its edge list form a cycle, which has an orientation (counter-clockwise). The interior of the cycle is part of the van der Waals surface of the atom (left of edge).


Self-intersecting surfaces

Such intersections, also known as singularities, which cause surfaces to become discontinued or non-smooth, may occur in the following two conditions:

  • a saddle face intersects itself if
  • two concave faces interpenetrate

These problems usually happen in deep grooves. Sanner provides a method for dealing with such problems in MSMS.

Fast and Robust Computation of Molecular Surfaces

In MSMS, Sanner extends previous work by Connolly by introducing the r-reduced surface, and presenting a method for handling singularities [6] [7].

The r-reduced surface

The r-reduced surface is equivalent to the -shape, where . Both the SAS and SES can be recovered from the r-reduced surface.

This surface is defined by the following:

  • face: polygon obtained by connecting centers of atoms probe is lying on
  • vertices: atom centers
  • edges: lines connecting vertices

An example reduced surface looks like this:

To each face corresponds a spheric reentrant face of SES, and for each edge, a toric reentrant face. Cycles of convex edges define contact faces.

Examples of molecular surface images: http://www.scripps.edu/~sanner/html/ms_gallery.html

Handling singularities


  • Radial singularities are detected during computation of r-reduced surface, and stored with edge. A toric reentrant face is then made out of the 2 triangular faces connected by a singular vertex or edge.
  • Non-radial singularities are handled in three step:
    • spheric reentrant faces are clipped
    • if a singular edge is inside a probe, it is entirely or partially "eaten"; singular vertices are added at the intersection of the probes, and the eaten part of the edge is replaced with singular edges lying on the intersection circles on the probes
    • for each pair of tangent probes generating spheric reentrant faces, not tangent to the same 3 atoms, clip faces at intersection circle C


Multiresolution Molecular Shapes

This algorithm was presented by Bajaj et al. in 1999 [8].

Supporting geometric structures

  • Power diagram: tiling of space into convex regions where tile i is the set of points nearest to vertex Pi, in power distance metric.
  • Regular triangulation: dual of the power diagram, like the Delauney triangulation is the dual of the Voronoi diagram.
  • Weighted alpha shapes: a simplex s in the regular triangulation belongs to the alpha shape if the weighted point orthogonal to the vertices of s is smaller than alpha. Example: an edge (u,v) is part of the zero-shape if the two balls centered at u and v intersect.

Decimation of molecular shapes and construction of multiresolution hierarchy

The goal of decimating a molecular structure is to produce a coarser and simpler representation of the model in order to reduce computational costs. This operation is performed on the set of weighted points, which makes it easier to have a hierarchical representation for multiple types of molecular shapes.

Coarsening is transitive. This allows for the construction of an efficient multiresolution representation of molecular models.

Vertex clustering is performed: if two balls overlap (i.e. the edge joining their centers belong to the zero-shape), then they are removed from the triangulation, and a new ball is inserted instead.

Two different clustering techniques can be used:

  • Local Vertex Clustering (LVC) (left)
  • Minimal Vertex Clustering (MVC) (right)

LVC is simpler and more efficient, but MVC returns a tighter coarsening.

The algorithm for building the multiresolution hierarchy depends on this clustering of edges in the zero-shape, creating consecutive levels of coarseness.

Examples

Application to Implicit Nonpolar Solvent Models

Implicit solvation models generally have two components:

  • electrostatic, or polar, which can be calculated using Poisson-Boltzmann model
  • non-electrostatic, or nonpolar, which can be estimated using SAS:

This model is widely used, although its correlation with the explicit solvent model is rather poor. It could be improved upon [9] by breaking down the nonpolar energy term into: and writing each one as follows:

where

is the solvent-solute distance and is the van der Waals attractive potential.

Software Packages

Example Using MSMS

Download

  • Included in the distribution are the following executables:
    • msms."arch": Binary version of MSMS compiled for "arch"
    • pdb_to_xyzr: Script to convert get XYZR coordinates from a PDB file
    • pdb_to_xyzrn: Script to convert get XYZRN coordinates from a PDB file

Input

  • MSMS takes XYZR (or XYZRN) files as input. The main difference between the two types of input files is that XYZRN files also contain the names of each atom at the end of the line, in addition to atomic coordinates and van der Waals radii. This can be useful if you want the surface on a per-atom basis.
  • Input files can be obtained by executing the following commands on PDB files:
$ pdb_to_xyzr 1HSG_r.pdb > 1HSG_r.xyzr
 1HSG_r.xyzr
 
 29.361   39.686    5.862 1.70
 30.307   38.663    5.319 2.00
 29.760   38.071    4.022 1.74
 28.600   38.302    3.676 1.40
 30.508   37.541    6.342 2.00
 29.296   37.591    7.162 2.00
 28.778   39.015    7.019 2.00
 ...
$ pdb_to_xyzrn 1HSG_r.pdb > 1HSG_r.xyzrn
 1HSG_r.xyzrn
 
 29.361000 39.686000 5.862000 1.700000 1 N_PRO_1
 30.307000 38.663000 5.319000 2.000000 1 CA_PRO_1
 29.760000 38.071000 4.022000 1.740000 1 C_PRO_1
 28.600000 38.302000 3.676000 1.400000 1 O_PRO_1
 30.508000 37.541000 6.342000 2.000000 1 CB_PRO_1
 29.296000 37.591000 7.162000 2.000000 1 CG_PRO_1
 28.778000 39.015000 7.019000 2.000000 1 CD_PRO_1
 ...

Generating the surface

  • After generating the input file, MSMS is run by:
$ msms -if 1HSG_r.xyzr -of 1HSG_r

This generates two files: 1HSG_r.face (triangulated surface faces) and 1HSG_r.vert (vertices).

  • To obtain the surface on a per-atom basis, run:
$ msms -if 1HSG_r.xyzrn -of 1HSG_r -af 1HSG_r
  • Using the -af option, an additional file is created: 1HSG_r.area, which contains the surface area on a per-atom basis.
$ msms.x86_64Linux2.2.6.1.staticgcc -if 1HSG_r.xyzr -of 1HSG_r
 MSMS 2.6.1
 Copyright M.F. Sanner (1994)
 Compilation flags -O2 -DVERBOSE -DTIMING
 INPUT  1HSG_r.xyzr 1514 spheres 0 collision only, radii  1.400 to  2.000
 PARAM  Probe_radius  1.500 density  1.000 hdensity  3.000
 REDUCED SURFACE ...
   RS component 0 identified 343 344 341
   0 free edge(s) found in component 0
   RS component  #faces  #edges #free_edges  #vertices   genus
        0        1738     2607          0        817      27
   Time Reduced Surface real: 0.04 user: 0.05 sys: 0.00
 ANALYTICAL SOLVENT EXCLUDED SURFACE...
   Component   0
       Time Surface real: 0.01 user: 0.01 sys: 0.00
       Time Singularities real: 0.00 user: 0.00 sys: 0.00
   SES comp.  #fac.  #edg. #s_edg. #vert. #s_vert. R_h C_h genus
        0     5314  10716    107    5402      188   0   0    1
 ANALYTICAL SURFACE AREA :
   Comp. probe_radius,   reent,    toric,   contact    SES       SAS
     0       1.500    2190.425  3376.937  2929.180  8496.541  9767.748
 TRIANGULATION...
   Component 0
   component#, vertices,  edges,  faces, genus, density
       0           9600   28800   19200    1     1.130
       Time Triangulation real: 0.03 user: 0.04 sys: 0.00
 NUMERICAL VOLUMES AND AREA
   Comp. probe_radius SES_volume SES_area)
      0      1.50    26064.947   8129.946
   Total ses_volume: 26064.947
 MSMS terminated normally
 Total Time real: 0.16 user: 0.14 sys: 0.01

Visualizing the output

MSMS output can be visualized using a number of graphics tools.

This example is displayed using DINO 3D (http://www.dino3d.org/intro.php).

 // load the structure dataset 
 load 1HSG_r.pdb -name 1HSG_r 
 
 // load the surface dataset 
 load 1HSG_r -type msms -name surf 
 
 // attach the surface to the structure 
 // (thereby assigning each surface point 
 // to an atom) 
 .surf attach .1HSG_r 
 
 // center on the surface 
 scene center [.surf] 
 
 // create a surface object 
 .surf new -name all 
 
 // adjust the clipping planes 
 scene autoslab 
HIV-1 Protease from 1HSG
 // color the surface object according 
 // to some underlying residue properties 
 // the variables used here are predefined 
 // by DINO, use 'var' to look at them 
 .surf.all set color=blue -sel $basic 
 .surf.all set color=blue4 -sel $basic2 
 .surf.all set color=red -sel $acidic 
 .surf.all set color=red4 -sel $acidic2 
 .surf.all set color=cyan -sel $polar 
 .surf.all set color=green -sel $aromatic 
 .surf.all set color=yellow -sel $aliphatic 
HIV-1 Protease from 1HSG
 // reset the color to white and make the surface 
 // transparent  to show the underlying structure 
 .surf.all set color=white 
 .1HSG_r new -name all 
 .1HSG_r.all render custom
 .surf.all render t=0.4 
HIV-1 Protease from 1HSG

More tutorials are available at: http://www.dino3d.org/tutorials/

Further Reading

References

  1. Lee, B. and Richards, F.M., The Interpretation of Protein Structures: Estimation of Static Accessibility, 1971. J. Appl. Cryst. 55: p. 379-400.
  2. Shrake A. and Rupley J.A., Environment and exposure to solvent of protein atoms. Lysozyme and insulin, 1973. J. Mol. Biol. 79(2): p. 351-371.
  3. Richards, F.M., Areas, Volumes, Packing, and Protein Structure, 1977. Ann. Rev. Biophys. Bioeng. 6: p. 151-176.
  4. Connolly, M.L., Analytical Molecular Surface Calculation, 1983. J. Appl. Cryst. 16: p. 548-558.
  5. Connolly M.L., Analytical Molecular Surface Calculation, 1983. J. Appl. Cryst. 16: p. 548-558.
  6. Sanner, M., Olson, A.J. and Sephner, J.-C., Fast and Robust Computation of Molecular Surfaces, 1995. Proc. 11th ACM Symp. Comp. Geom.. C6-C7.
  7. Sanner, M., Olson, A.J. and Sephner, J.-C., Reduced Surface: An Efficient Way to Compute Molecular Surfaces, 1996. Biopolymers. 38. pp. 305-320.
  8. Bajaj, C.L., Pascucci, V. Shamir, A., Holt, R.J. and Netravali A.N., Multiresolution Molecular Shapes, 1999. TICAM Report. 99-42.
  9. Tan, C., Tan Y.H. and Luo R., Implicit Nonpolar Solvent Models, 2007. J. Phys. Chem. B. 111. pp. 12263-12274.