SKPDB: a structural database of shikimate pathway enzymes (pdf)

Article PDF cannot be displayed. You can download it here:

http://www.biomedcentral.com/content/pdf/1471-2105-11-12.pdf

SKPDB: a structural database of shikimate pathway enzymes

Helen A Arcuri 0 4 Geraldo FD Zafalon 3 Evandro A Marucci 3 Carlos E Bonalumi 3 Nelson JF da Silveira 2 Jos M Machado 3 Walter F de Azevedo Jr 1 5 Mrio S Palma 0 4 0 CEIS/Departamento de Biologia, Instituto de Biociencias , UNESP, Rio Claro, Sao Paulo , Brasil 1 Faculdade de Biociencias, PUCRS , Porto Alegre, Rio Grande do Sul , Brasil 2 Departamento de Ciencias Exatas, UNIFAL , Alfenas, Minas Gerais , Brasil 3 Departamento de Ciencia da Computacao e Estatistica, UNESP/IBILCE , Sao Jose do Rio Preto, Sao Paulo , Brasil 4 CEIS/Departamento de Biologia, Instituto de Biociencias , UNESP, Rio Claro, Sao Paulo , Brasil 5 Faculdade de Biociencias, PUCRS , Porto Alegre, Rio Grande do Sul , Brasil Background: The functional and structural characterisation of enzymes that belong to microbial metabolic pathways is very important for structure-based drug design. The main interest in studying shikimate pathway enzymes involves the fact that they are essential for bacteria but do not occur in humans, making them selective targets for design of drugs that do not directly impact humans. Description: The ShiKimate Pathway DataBase (SKPDB) is a relational database applied to the study of shikimate pathway enzymes in microorganisms and plants. The current database is updated regularly with the addition of new data; there are currently 8902 enzymes of the shikimate pathway from different sources. The database contains extensive information on each enzyme, including detailed descriptions about sequence, references, and structural and functional studies. All files (primary sequence, atomic coordinates and quality scores) are available for downloading. The modeled structures can be viewed using the Jmol program. Conclusions: The SKPDB provides a large number of structural models to be used in docking simulations, virtual screening initiatives and drug design. It is freely accessible at http://lsbzix.rc.unesp.br/skpdb/. - Background The functional and structural characterisation of enzymes belonging to microbial metabolic pathways is very important for structure-based drug design [1]. The main interest in the study of shikimate pathway enzymes (Figure 1) involves the fact that they are not present in humans, but are essential for algae, higher plants, bacteria, fungi, and apicomplexan parasites. This makes them attractive targets for development of new antimicrobial drugs and decreases the impact of drugs in humans [2,3]. This pathway links the metabolism of carbohydrates to the biosynthesis of aromatic compounds through seven metabolic steps, where phosphoenolpyruvate (PEP) and erythrose 4-phosphate are converted to chorismate, which in turn is the common precursor for synthesising a series of aromatic compounds, naphtoquinones, menaquinones, and mycobactins [3,4]. Inhibition of the shikimate pathway has been effective in controlling bacterial growth [5], and in mycobacteria, this pathway has been shown to be essential for the viability of Mycobacterium tuberculosis [6-8]. The functional and structural characterisation of a protein sequence is one of the most frequent problems in structural molecular biology. This task is usually facilitated by an accurate three-dimensional (3D) structure of the studied protein, which is best determined by experimental methods such as X-ray crystallography and NMR spectroscopy [9]. In the absence of an experimentally determined 3D structure, the modeling (comparative or by homology) can sometimes provide a useful 3D model for a target protein [10]. In the present work, we used comparative modeling at a large scale for predicting protein structures through the program MODELLER [11]. The automation of large-scale comparative modeling involves assembling a software pipeline, which consists of modules for fold assignment, template selection, target-template alignment, model generation, and model evaluation. Computer programs for these individual operations already exist, and it may seem trivial to combine them [11,12]. One example of large-scale Figure 1 The sequence of seven metabolic steps in the shikimate pathway, from phosphoenolpyruvate and erythrose 4-phosphate to chorismate, adapted from the site: http://www. chem.qmul.ac.uk/iubmb/enzyme/reaction/misc/shikim.html. comparative modeling for complete genomes has been described for sequences encoded in the Mycobacterium tuberculosis and Xylella fastidiosa genomes in the DBMODELING database [13,14]. The challenge in largescale comparative modeling is to build an automated, fast, robust, sensitive, and accurate pipeline applicable to whole genomes; such a pipeline should perform at least as well as a human expert on individual proteins. However, since the accuracy of structural models is highly dependent on sequence identity between template and target, it is necessary to make clear to the user that only models presenting high structural quality should be used in such efforts. Molecular modeling of these enzymes generated the SKPDB database, in which all structural models were built by using alignments presenting more than 30% sequence identity, generating models with medium and high accuracy [10,15]. SKPDB is a relational database of protein structure predicted by comparative modeling or solved by X-ray crystallography, applied to the study of shikimate pathway enzymes of microorganisms and plants. This database is freely accessible for all users on the Web, providing us with a large number of structural models for use in structure-based virtual screening and molecular docking analysis. Furthermore, SKPDB also provides a docking interface, which allows the user to carry out geometric docking simulations against the molecular models available in the database. Construction and Content Molecular modeling in large scale Homology modeling is usually the method of choice when there is a clear relationship of homology between the sequences of a target protein and at least one experimentally determined three-dimensional structure. This computational technique is based on the assumption that tertiary structures of two proteins will be similar if their sequences are related, and it is the approach most likely to give accurate results [16]. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily due to the growth in the number of experimentally determined protein structures and because of the improvements in the modeling software. It is currently possible to model with useful accuracy significant parts of approximately one half of all known protein sequences [17]. The molecular modeling in this work was performed by the MODELLER version 9v4 [10,18] program, which is a computer program for comparative protein structure modeling http://salilab.org/modeller. The program extracts atom-atom distance and dihedral angle restraints on the target from the template structure, and combines them with general rules of protein structure such as bond length and angl (...truncated)