3dSS: 3D structural superposition (pdf)

Article PDF cannot be displayed. You can download it here:

https://academic.oup.com/nar/article-pdf/34/suppl_2/W128/7622580/gkl036.pdf

3dSS: 3D structural superposition

W128–W132 Nucleic Acids Research, 2006, Vol. 34, Web Server issue doi:10.1093/nar/gkl036 3dSS: 3D structural superposition K. Sumathi1, P. Ananthalakshmi1, M. N. A. Md. Roshan1 and K. Sekar1,2,* 1 Bioinformatics Centre and 2Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore 560 012, India Received October 6, 2005; Revised and Accepted November 28, 2005 ABSTRACT 3dSS is a web-based interactive computing server, primarily designed to aid researchers, to superpose two or several 3D protein structures. In addition, the server can be effectively used to find the invariant and common water molecules present in the superposed homologous protein structures. The molecular visualization tool RASMOL is interfaced with the server to visualize the superposed 3D structures with the water molecules (invariant or common) in the client machine. Furthermore, an option is provided to save the superposed 3D atomic coordinates in the client machine. To perform the above, users need to enter Protein Data Bank (PDB)-id(s) or upload the atomic coordinates in PDB format. This server uses a locally maintained PDB anonymous FTP server that is being updated weekly. This program can be accessed through our Bioinformatics web server at the URL http://cluster.physics.iisc.ernet.in/3dss/ or http://10.188.1.15/3dss/. INTRODUCTION In the post-genome era, the structural and conformational properties of the 3D protein molecules are of considerable interest owing to its importance in various biological processes. Owing to the recent technological advances like high power tunable synchrotron radiation, powerful number crunching computers and due to ambitious structural genomics programs in different parts of the world, there has been a tremendous increase in the number of protein structures in the Protein Data Bank (PDB) (1). Now there are 34 000 3D structures available in this entity. Analysis of the 3D structure of protein molecules is greatly enhanced by understanding the relationship between the individual protein molecules. Furthermore, knowledge of the 3D structural relationship between different protein molecules is a key issue in understanding the structure and function. In order to find the common structural region, one need to lay one molecule over the other by appropriate rotation and translation and this process is termed as superposition of the 3D structures. Several programs are available in the literature (2–9) for this purpose. Most of these programs are stand-alone versions and have their own merits and demerits. Two most recent ones are web-based servers, namely, SSM (8) and SuperPose (9). The program SSM uses the procedure of matching graphs generated using the secondary structural elements followed by the alignment of Ca atoms of the protein molecule. Using one of the programs (9), SuperPose, we experienced problems while trying to superpose multiple structures as well as portions of molecules. In fact, it was difficult to superpose different subunits available in multi-subunit protein structures. In addition, most of the existing programs use only the first model of the ensemble in the case of structures solved using NMR technique and there is no provision for the users to superpose all the models in the ensemble. It is well known that water molecules play a vital role in protein structures, aiding in stabilizing the protein fold and in ligand design (10–14). In addition, investigations on the invariant water molecules in several well studied homologous protein structures shed light on the specific roles of water molecules such as catalytic, structural and functional (15–18). Thus, it is necessary to find the invariant and common water molecules (for definition see below) in homologous protein structures, for which 3D structural superposition step is crucial. But the existing programs do not have provisions for the users to identify the invariant and common water molecules. Hence, we created a unique computing server to superpose the three-dimensional structures and to find the invariant and common water molecules in homologous protein structures. BACKGROUND The water molecules present in two highly similar (the best example is native structure and its mutant structure) or highly *To whom correspondence should be addressed. Tel: +91 080 23601409; Fax: +91 080 23600085; Email: , This work is dedicated to the late professor M. Sundaralingam The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors The Author 2006. Published by Oxford University Press. All rights reserved. The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact Nucleic Acids Research, 2006, Vol. 34, Web Server issue homologous structures (the inhibitor free and inhibitor bound structure) are known as invariant water molecules. Further, such situation is also possible in multi-subunit protein structures. For example, if a molecule has four identical subunits, the water molecules that interact with the residues in the same position in different subunits (e.g. subunit A and B) can be considered as invariant water molecules. On the other hand, common water molecules are those, which lie at the interface and interact with the selected subunits. In the computing server, two widely recognized programs STAMP (19) and ProFit (A. C. R. Martin, http://www.bioinf. org.uk/software/profit/) are deployed for superposition purposes. The program STAMP uses multiple sequence alignment using the amino acid sequence information followed by an initial superposition of structures. In contrast, the program ProFit uses the McLachlan fitting algorithm, essentially a steepest descent minimization (3). The user-friendly molecular visualization tool RASMOL (20) is interfaced to view the superposed molecules in the client machine. This server is developed using PERL, HTML and JAVASCRIPT. Ploticus [Copy right 1998–2002, Stephan C. Grugg ()], a data display engine is used for generating plots to display root mean square deviation (r.m.s.d.) graphically. DATA PRESENTATION AND AVAILABILITY The software is developed and optimized for Intel based Solaris (Version 10.0) and is driven by 3.0 GHz pentium IV W129 processor equipped with 2 GB RD RAM. This operating system is chosen for better security, scalability and reliability. The software and its functionalities are well tested on Windows 95/98/2000, Linu (...truncated)