Nucleic Acids Research

http://nar.oxfordjournals.org

List of Papers (Total 27,745)

Expansion of the Gene Ontology knowledgebase and resources

The Gene Ontology (GO) is a comprehensive resource of computable knowledge regarding the functions of genes and gene products. As such, it is extensively used by the biomedical research community for the analysis of -omics and related data. Our continued focus is on improving the quality and utility of the GO resources, and we welcome and encourage input from researchers in all...

TSTMP: target selection for structural genomics of human transmembrane proteins

The TSTMP database is designed to help the target selection of human transmembrane proteins for structural genomics projects and structure modeling studies. Currently, there are only 60 known 3D structures among the polytopic human transmembrane proteins and about a further 600 could be modeled using existing structures. Although there are a great number of human transmembrane...

TransportDB 2.0: a database for exploring membrane transporters in sequenced genomes from all domains of life

All cellular life contains an extensive array of membrane transport proteins. The vast majority of these transporters have not been experimentally characterized. We have developed a bioinformatic pipeline to identify and annotate complete sets of transporters in any sequenced genome. This pipeline is now fully automated enabling it to better keep pace with the accelerating rate...

The SWISS-MODEL Repository—new features and functionality

SWISS-MODEL Repository (SMR) is a database of annotated 3D protein structure models generated by the automated SWISS-MODEL homology modeling pipeline. It currently holds >400 000 high quality models covering almost 20% of Swiss-Prot/UniProtKB entries. In this manuscript, we provide an update of features and functionalities which have been implemented recently. We address...

RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures

RepeatsDB 2.0 (URL: http://repeatsdb.bio.unipd.it/) is an update of the database of annotated tandem repeat protein structures. Repeat proteins are a widespread class of non-globular proteins carrying heterogeneous functions involved in several diseases. Here we provide a new version of RepeatsDB with an improved classification schema including high quality annotations for ∼5400...

PCDDB: new developments at the Protein Circular Dichroism Data Bank

The Protein Circular Dichroism Data Bank (PCDDB) has been in operation for more than 5 years as a public repository for archiving circular dichroism spectroscopic data and associated bioinformatics and experimental metadata. Since its inception, many improvements and new developments have been made in data display, searching algorithms, data formats, data content, auxillary...

ECOD: new developments in the evolutionary classification of domains

Evolutionary Classification Of protein Domains (ECOD) (http://prodata.swmed.edu/ecod) comprehensively classifies protein with known spatial structures maintained by the Protein Data Bank (PDB) into evolutionary groups of protein domains. ECOD relies on a combination of automatic and manual weekly updates to achieve its high accuracy and coverage with a short update cycle. ECOD...

CATH: an expanded resource to predict protein function through structure and sequence

The latest version of the CATH-Gene3D protein structure classification database has recently been released (version 4.1, http://www.cathdb.info). The resource comprises over 300 000 domain structures and over 53 million protein domains classified into 2737 homologous superfamilies, doubling the number of predicted protein domains in the previous version. The daily-updated CATH-B...

Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures

The Protein Data Bank Japan (PDBj, http://pdbj.org), a member of the worldwide Protein Data Bank (wwPDB), accepts and processes the deposited data of experimentally determined macromolecular structures. While maintaining the archive in collaboration with other wwPDB partners, PDBj also provides a wide range of services and tools for analyzing structures and functions of proteins...

The RCSB protein data bank: integrative view of protein, gene and 3D structural information

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB, http://rcsb.org), the US data center for the global PDB archive, makes PDB data freely available to all users, from structural biologists to computational biologists and beyond. New tools and resources have been added to the RCSB PDB web portal in support of a ‘Structural View of Biology...

WERAM: a database of writers, erasers and readers of histone acetylation and methylation in eukaryotes

In this work, we developed a database WERAM (http://weram.biocuckoo.org/) for histone acetyltransferases, histone deacetylases, histone methyltransferases, histone demethylases and acetyl- or methyl-binding proteins, which catalyze, remove and recognize histone acetylation and methylation sites as ‘writers’, ‘erasers’ and ‘readers’, and synergistically determine the ‘histone code...

Membranome: a database for proteome-wide analysis of single-pass membrane proteins

The Membranome database was developed to assist analysis and computational modeling of single-pass (bitopic) transmembrane (TM) proteins and their complexes by providing structural information about these proteins on a genomic scale. The database currently collects data on >6000 bitopic proteins from Homo sapiens, Arabidopsis thaliana, Dictyostelium discoideum, Saccharomyces...

LinkProt: a database collecting information about biological links

Protein chains are known to fold into topologically complex shapes, such as knots, slipknots or complex lassos. This complex topology of the chain can be considered as an additional feature of a protein, separate from secondary and tertiary structures. Moreover, the complex topology can be defined also as one additional structural level. The LinkProt database (http://linkprot...

JET2 Viewer: a database of predicted multiple, possibly overlapping, protein–protein interaction sites for PDB structures

The database JET2 Viewer, openly accessible at http://www.jet2viewer.upmc.fr/, reports putative protein binding sites for all three-dimensional (3D) structures available in the Protein Data Bank (PDB). This knowledge base was generated by applying the computational method JET2 at large-scale on more than 20 000 chains. JET2 strategy yields very precise predictions of interacting...

FuzDB: database of fuzzy complexes, a tool to develop stochastic structure-function relationships for protein complexes and higher-order assemblies

FuzDB (http://protdyn-database.org) compiles experimentally observed fuzzy protein complexes, where intrinsic disorder (ID) is maintained upon interacting with a partner (protein, nucleic acid or small molecule) and directly impacts biological function. Entries in the database have both (i) structural evidence demonstrating the structural multiplicity or dynamic disorder of the...

DisProt 7.0: a major update of the database of disordered proteins

The Database of Protein Disorder (DisProt, URL: www.disprot.org) has been significantly updated and upgraded since its last major renewal in 2007. The current release holds information on more than 800 entries of IDPs/IDRs, i.e. intrinsically disordered proteins or regions that exist and function without a well-defined three-dimensional structure. We have re-curated previous...

ATGC database and ATGC-COGs: an updated resource for micro- and macro-evolutionary studies of prokaryotic genomes and protein family annotation

The Alignable Tight Genomic Clusters (ATGCs) database is a collection of closely related bacterial and archaeal genomes that provides several tools to aid research into evolutionary processes in the microbial world. Each ATGC is a taxonomy-independent cluster of 2 or more completely sequenced genomes that meet the objective criteria of a high degree of local gene order (synteny...

ADPriboDB: The database of ADP-ribosylated proteins

ADP-ribosylation refers to the addition of one or more ADP-ribose units onto proteins post-translationally. This protein modification is often added by ADP-ribosyltransferases, commonly known as PARPs, but it can also be added by other enzymes, including sirtuins or bacterial toxins. While past literature has utilized a variety of methods to identify ADP-ribosylated proteins...

CDD/SPARCLE: functional classification of proteins via subfamily domain architectures

NCBI's Conserved Domain Database (CDD) aims at annotating biomolecular sequences with the location of evolutionarily conserved protein domain footprints, and functional sites inferred from such footprints. An archive of pre-computed domain annotation is maintained for proteins tracked by NCBI's Entrez database, and live search services are offered as well. CDD curation staff...

InterPro in 2017—beyond protein family and domain annotations

InterPro (http://www.ebi.ac.uk/interpro/) is a freely available database used to classify protein sequences into families and to predict the presence of important domains and sites. InterProScan is the underlying software that allows both protein and nucleic acid sequences to be searched against InterPro's predictive models, which are provided by its member databases. Here, we...

PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements

The PANTHER database (Protein ANalysis THrough Evolutionary Relationships, http://pantherdb.org) contains comprehensive information on the evolution and function of protein-coding genes from 104 completely sequenced genomes. PANTHER software tools allow users to classify new protein sequences, and to analyze gene lists obtained from large-scale genomics experiments. In the past...

The neXtProt knowledgebase on human proteins: 2017 update

The neXtProt human protein knowledgebase (https://www.nextprot.org) continues to add new content and tools, with a focus on proteomics and genetic variation data. neXtProt now has proteomics data for over 85% of the human proteins, as well as new tools tailored to the proteomics community. Moreover, the neXtProt release 2016-08-25 includes over 8000 phenotypic observations for...

Uniclust databases of clustered and deeply annotated protein sequences and alignments

We present three clustered protein sequence databases, Uniclust90, Uniclust50, Uniclust30 and three databases of multiple sequence alignments (MSAs), Uniboost10, Uniboost20 and Uniboost30, as a resource for protein sequence analysis, function prediction and sequence searches. The Uniclust databases cluster UniProtKB sequences at the level of 90%, 50% and 30% pairwise sequence...

UniProt: the universal protein knowledgebase

The UniProt knowledgebase is a large resource of protein sequences and associated detailed annotation. The database contains over 60 million sequences, of which over half a million sequences have been curated by experts who critically review experimental and predicted data for each protein. The remainder are automatically annotated based on rule systems that rely on the expert...

TFBSbank: a platform to dissect the big data of protein–DNA interaction in human and model species

Genome-wide transcription factors (TFs) binding data has been extensively generated in the past few years, which poses a great challenge to data interpretation. Therefore, comprehensive and dedicated functional annotation databases for TF–DNA interaction are in great demands to manage, explore and utilize those invaluable data resources. Here, we constructed a platform ‘TFBSbank...