3dSS: 3D structural superposition
K. Sumathi
1
P. Ananthalakshmi
1
M. N. A. Md. Roshan
1
K. Sekar
0
1
0
Supercomputer Education and Research Centre, Indian Institute of Science
, Bangalore 560 012,
India
1
Bioinformatics Centre
3dSS is a web-based interactive computing server, primarily designed to aid researchers, to superpose two or several 3D protein structures. In addition, the server can be effectively used to find the invariant and common water molecules present in the superposed homologous protein structures. The molecular visualization tool RASMOL is interfaced with the server to visualize the superposed 3D structures with the water molecules (invariant or common) in the client machine. Furthermore, an option is provided to save the superposed 3D atomic coordinates in the client machine. To perform the above, users need to enter Protein Data Bank (PDB)-id(s) or upload the atomic coordinates in PDB format. This server uses a locally maintained PDB anonymous FTP server that is being updated weekly. This program can be accessed through our Bioinformatics web server at the URL http://cluster.physics.iisc.ernet.in/3dss/ or http://10.188.1.15/3dss/.
-
In the post-genome era, the structural and conformational
properties of the 3D protein molecules are of considerable
interest owing to its importance in various biological
processes. Owing to the recent technological advances like
high power tunable synchrotron radiation, powerful number
crunching computers and due to ambitious structural genomics
programs in different parts of the world, there has been a
tremendous increase in the number of protein structures in
the Protein Data Bank (PDB) (1). Now there are 34 000
3D structures available in this entity. Analysis of the 3D
structure of protein molecules is greatly enhanced by understanding
the relationship between the individual protein molecules.
Furthermore, knowledge of the 3D structural relationship
between different protein molecules is a key issue in
understanding the structure and function. In order to find
the common structural region, one need to lay one molecule
over the other by appropriate rotation and translation and this
process is termed as superposition of the 3D structures. Several
programs are available in the literature (29) for this purpose.
Most of these programs are stand-alone versions and have
their own merits and demerits. Two most recent ones are
web-based servers, namely, SSM (8) and SuperPose (9).
The program SSM uses the procedure of matching graphs
generated using the secondary structural elements followed
by the alignment of Ca atoms of the protein molecule.
Using one of the programs (9), SuperPose, we experienced
problems while trying to superpose multiple structures as
well as portions of molecules. In fact, it was difficult to
superpose different subunits available in multi-subunit protein
structures. In addition, most of the existing programs use
only the first model of the ensemble in the case of structures
solved using NMR technique and there is no provision for the
users to superpose all the models in the ensemble.
It is well known that water molecules play a vital role in
protein structures, aiding in stabilizing the protein fold and in
ligand design (1014). In addition, investigations on the
invariant water molecules in several well studied homologous
protein structures shed light on the specific roles of water
molecules such as catalytic, structural and functional (1518).
Thus, it is necessary to find the invariant and common water
molecules (for definition see below) in homologous protein
structures, for which 3D structural superposition step is
crucial. But the existing programs do not have provisions
for the users to identify the invariant and common water
molecules. Hence, we created a unique computing server to
superpose the three-dimensional structures and to find the
invariant and common water molecules in homologous protein
structures.
The water molecules present in two highly similar (the best
example is native structure and its mutant structure) or highly
This work is dedicated to the late professor M. Sundaralingam
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors
homologous structures (the inhibitor free and inhibitor bound
structure) are known as invariant water molecules. Further,
such situation is also possible in multi-subunit protein
structures. For example, if a molecule has four identical subunits,
the water molecules that interact with the residues in the same
position in different subunits (e.g. subunit A and B) can be
considered as invariant water molecules. On the other hand,
common water molecules are those, which lie at the interface
and interact with the selected subunits.
In the computing server, two widely recognized programs
STAMP (19) and ProFit (A. C. R. Martin, http://www.bioinf.
org.uk/software/profit/) are deployed for superposition
purposes. The program STAMP uses multiple sequence
alignment using the amino acid sequence information followed by
an initial superposition of structures. In contrast, the program
ProFit uses the McLachlan fitting algorithm, essentially a
steepest descent minimization (3). The user-friendly
molecular visualization tool RASMOL (20) is interfaced to view the
superposed molecules in the client machine. This server is
developed using PERL, HTML and JAVASCRIPT. Ploticus
[Copy right 19982002, Stephan C. Grugg ()],
a data display engine is used for generating plots to display
root mean square deviation (r.m.s.d.) graphically.
DATA PRESENTATION AND AVAILABILITY
The software is developed and optimized for Intel based
Solaris (Version 10.0) and is driven by 3.0 GHz pentium IV
processor equipped with 2 GB RD RAM. This operating
system is chosen for better security, scalability and reliability.
The software and its functionalities are well tested on
Windows 95/98/2000, Linux and SGI platforms. During
validation of the software, we realized that two web browsers,
namely, NETSCAPE (version 4.7 and 7.2) and MOZILLA
behaved well. To visualize the superposed 3D structures in
the client machine, user needs to interface the molecular
visualization tool RASMOL (only for the first time usage of the
software) and the necessary instructions are provided in the
link (http://cluster.physics.iisc.ernet.in/3dss/rasmol.html). The
following are the four major options provided in the proposed
computing server.
(a) Superpose only two structures,
(b) Superpose several structures,
(c) Superpose subunits within a structure, and
(d) Superpose different models present in NMR ensemble.
All the above options, allow users to select the structures
available in the PDB by providing its unique PDB-id or by
up-loading the 3D atomic coordinates (PDB format) from the
local hard disk of the client machine. Once the file is uploaded,
the program automatically culls the input PDB file and
displays all the chain details of the structure in a convenient
form. Using (...truncated)