Database: The Journal of Biological Databases and Curation

<a href="http://www.oxfordjournals.org/our_journals/databa/about.html">http://www.oxfordjournals.org/our_journals/databa/about.html</a>

List of Papers (Total 687)

The drug-minded protein interaction database (DrumPID) for efficient target analysis and drug development

Jan 2016 | Meik Kunz, Chunguang Liang, Santosh Nilla, et al.

The drug-minded protein interaction database (DrumPID) has been designed to provide fast, tailored information on drugs and their protein networks including indications, protein targets and side-targets. Starting queries include compound, target and protein interactions and organism-specific protein families. Furthermore, drug name, chemical structures and their SMILES notation...

Jan 2016
Meik Kunz, Chunguang Liang, Santosh Nilla, et al.

Chemical-induced disease relation extraction with various linguistic features

Jan 2016 | Jinghang Gu, Longhua Qian, Guodong Zhou

Understanding the relations between chemicals and diseases is crucial in various biomedical tasks such as new drug discoveries and new therapy developments. While manually mining these relations from the biomedical literature is costly and time-consuming, such a procedure is often difficult to keep up-to-date. To address these issues, the BioCreative-V community proposed a...

Jan 2016
Jinghang Gu, Longhua Qian, Guodong Zhou

Genic insights from integrated human proteomics in GeneCards

Jan 2016 | Simon Fishilevich, Shahar Zimmerman, Asher Kohn, et al.

GeneCards is a one-stop shop for searchable human gene annotations (http://www.genecards.org/). Data are automatically mined from ∼120 sources and presented in an integrated web card for every human gene. We report the application of recent advances in proteomics to enhance gene annotation and classification in GeneCards. First, we constructed the Human Integrated Protein...

Jan 2016
Simon Fishilevich, Shahar Zimmerman, Asher Kohn, et al.

RSIADB, a collective resource for genome and transcriptome analyses in Rhizoctonia solani AG1 IA

Jan 2016 | Lei Chen, Peng Ai, Jinfeng Zhang, et al.

Rice [Oryza sativa (L.)] feeds more than half of the world’s population. Rhizoctonia solani is a major fungal pathogen of rice causing extreme crop losses in all rice-growing regions of the world. R. solani AG1 IA is a major cause of sheath blight in rice. In this study, we constructed a comprehensive and user-friendly web-based database, RSIADB, to analyse its draft genome and...

Jan 2016
Lei Chen, Peng Ai, Jinfeng Zhang, et al.

Centralizing content and distributing labor: a community model for curating the very long tail of microbial genomes

Jan 2016 | Tim E. Putman, Sebastian Burgstaller-Muehlbacher, Andra Waagmeester, et al.

The last 20 years of advancement in sequencing technologies have led to sequencing thousands of microbial genomes, creating mountains of genetic data. While efficiency in generating the data improves almost daily, applying meaningful relationships between taxonomic and genetic entities on this scale requires a structured and integrative approach. Currently, knowledge is...

Jan 2016
Tim E. Putman, Sebastian Burgstaller-Muehlbacher, Andra Waagmeester, et al.

CoDNaS 2.0: a comprehensive database of protein conformational diversity in the native state

Jan 2016 | Alexander Miguel Monzon, Cristian Oscar Rohr, María Silvina Fornasari, et al.

CoDNaS (conformational diversity of the native state) is a protein conformational diversity database. Conformational diversity describes structural differences between conformers that define the native state of proteins. It is a key concept to understand protein function and biological processes related to protein functions. CoDNaS offers a well curated database that is...

Jan 2016
Alexander Miguel Monzon, Cristian Oscar Rohr, María Silvina Fornasari, et al.

myPhyloDB: a local web server for the storage and analysis of metagenomic data

Jan 2016 | Daniel K. Manter, Matthew Korsa, Caleb Tebbe, et al.

myPhyloDB v.1.1.2 is a user-friendly personal database with a browser-interface designed to facilitate the storage, processing, analysis, and distribution of microbial community populations (e.g. 16S metagenomics data). MyPhyloDB archives raw sequencing files, and allows for easy selection of project(s)/sample(s) of any combination from all available data in the database. The...

Jan 2016
Daniel K. Manter, Matthew Korsa, Caleb Tebbe, et al.

CD-REST: a system for extracting chemical-induced disease relation in literature

Jan 2016 | Jun Xu, Yonghui Wu, Yaoyun Zhang, et al.

Mining chemical-induced disease relations embedded in the vast biomedical literature could facilitate a wide range of computational biomedical applications, such as pharmacovigilance. The BioCreative V organized a Chemical Disease Relation (CDR) Track regarding chemical-induced disease relation extraction from biomedical literature in 2015. We participated in all subtasks of this...

Jan 2016
Jun Xu, Yonghui Wu, Yaoyun Zhang, et al.

CCProf: exploring conformational change profile of proteins

Jan 2016 | Che-Wei Chang, Chai-Wei Chou, Darby Tien-Hao Chang

In many biological processes, proteins have important interactions with various molecules such as proteins, ions or ligands. Many proteins undergo conformational changes upon these interactions, where regions with large conformational changes are critical to the interactions. This work presents the CCProf platform, which provides conformational changes of entire proteins, named...

Jan 2016
Che-Wei Chang, Chai-Wei Chou, Darby Tien-Hao Chang

Discovering biomedical semantic relations in PubMed queries for information retrieval and database curation

Jan 2016 | Chung-Chi Huang, Zhiyong Lu

Identifying relevant papers from the literature is a common task in biocuration. Most current biomedical literature search systems primarily rely on matching user keywords. Semantic search, on the other hand, seeks to improve search accuracy by understanding the entities and contextual relations in user keywords. However, past research has mostly focused on semantically...

Jan 2016
Chung-Chi Huang, Zhiyong Lu

HNdb: an integrated database of gene and protein information on head and neck squamous cell carcinoma

Jan 2016 | Tiago Henrique, Nelson José Freitas da Silveira, Arthur Henrique Cunha Volpato, et al.

The total amount of scientific literature has grown rapidly in recent years. Specifically, there are several million citations in the field of cancer. This makes it difficult, if not impossible, to manually retrieve relevant information on the mechanisms that govern tumor behavior or the neoplastic process. Furthermore, cancer is a complex disease or, more accurately, a set of...

Jan 2016
Tiago Henrique, Nelson José Freitas da Silveira, Arthur Henrique Cunha Volpato, et al.

The Disease Portals, disease–gene annotation and the RGD disease ontology at the Rat Genome Database

Jan 2016 | G. Thomas Hayman, Stanley J. F. Laulederkind, Jennifer R. Smith, et al.

The Rat Genome Database (RGD; http://rgd.mcw.edu/) provides critical datasets and software tools to a diverse community of rat and non-rat researchers worldwide. To meet the needs of the many users whose research is disease oriented, RGD has created a series of Disease Portals and has prioritized its curation efforts on the datasets important to understanding the mechanisms of...

Jan 2016
G. Thomas Hayman, Stanley J. F. Laulederkind, Jennifer R. Smith, et al.

Assessing the state of the art in biomedical relation extraction: overview of the BioCreative V chemical-disease relation (CDR) task

Jan 2016 | Chih-Hsuan Wei, Yifan Peng, Robert Leaman, et al.

Manually curating chemicals, diseases and their relationships is significantly important to biomedical research, but it is plagued by its high cost and the rapid growth of the biomedical literature. In recent years, there has been a growing interest in developing computational approaches for automatic chemical-disease relation (CDR) extraction. Despite these attempts, the lack of...

Jan 2016
Chih-Hsuan Wei, Yifan Peng, Robert Leaman, et al.

GO annotation in InterPro: why stability does not indicate accuracy in a sea of changing annotations

Jan 2016 | Amaia Sangrador-Vegas, Alex L. Mitchell, Hsin-Yu Chang, et al.

The removal of annotation from biological databases is often perceived as an indicator of erroneous annotation. As a corollary, annotation stability is considered to be a measure of reliability. However, diverse data-driven events can affect the stability of annotations in both primary protein sequence databases and the protein family databases that are built upon the sequence...

Jan 2016
Amaia Sangrador-Vegas, Alex L. Mitchell, Hsin-Yu Chang, et al.

CSCdb: a cancer stem cells portal for markers, related genes and functional information

Jan 2016 | Yi Shen, Heming Yao, Ao Li, et al.

Cancer stem cells (CSCs), which have the ability to self-renew and differentiate into various tumor cell types, are a special class of tumor cells. Characterizing the genes involved in CSCs regulation is fundamental to understand the mechanisms underlying the biological process and develop treatment methods for tumor therapy. Recently, much effort has been expended in the study...

Jan 2016
Yi Shen, Heming Yao, Ao Li, et al.

High-performance integrated virtual environment (HIVE): a robust infrastructure for next-generation sequence data analysis

Jan 2016 | Vahan Simonyan, Konstantin Chumakov, Hayley Dingerdissen, et al.

The High-performance Integrated Virtual Environment (HIVE) is a distributed storage and compute environment designed primarily to handle next-generation sequencing (NGS) data. This multicomponent cloud infrastructure provides secure web access for authorized users to deposit, retrieve, annotate and compute on NGS data, and to analyse the outcomes using web interface visual...

Jan 2016
Vahan Simonyan, Konstantin Chumakov, Hayley Dingerdissen, et al.

From one to many: expanding the Saccharomyces cerevisiae reference genome panel

Jan 2016 | Stacia R. Engel, Shuai Weng, Gail Binkley, et al.

In recent years, thousands of Saccharomyces cerevisiae genomes have been sequenced to varying degrees of completion. The Saccharomyces Genome Database (SGD) has long been the keeper of the original eukaryotic reference genome sequence, which was derived primarily from S. cerevisiae strain S288C. Because new technologies are pushing S. cerevisiae annotation past the limits of any...

Jan 2016
Stacia R. Engel, Shuai Weng, Gail Binkley, et al.

KinetochoreDB: a comprehensive online resource for the kinetochore and its related proteins

Jan 2016 | Chen Li, Steve Androulakis, Ashley M. Buckle, et al.

KinetochoreDB is an online resource for the kinetochore and its related proteins. It provides comprehensive annotations on 1554 related protein entries in terms of their amino acid sequence, protein domain context, protein 3D structure, predicted intrinsically disordered region, protein–protein interaction, post-translational modification site, functional domain and key metabolic...

Jan 2016
Chen Li, Steve Androulakis, Ashley M. Buckle, et al.

Sustainable funding for biocuration: The Arabidopsis Information Resource (TAIR) as a case study of a subscription-based funding model

Jan 2016 | Leonore Reiser, Tanya Z. Berardini, Donghui Li, et al.

Databases and data repositories provide essential functions for the research community by integrating, curating, archiving and otherwise packaging data to facilitate discovery and reuse. Despite their importance, funding for maintenance of these resources is increasingly hard to obtain. Fueled by a desire to find long term, sustainable solutions to database funding, staff from...

Jan 2016
Leonore Reiser, Tanya Z. Berardini, Donghui Li, et al.

R-Syst::diatom: an open-access and curated barcode database for diatoms and freshwater monitoring

Jan 2016 | Frédéric Rimet, Philippe Chaumeil, François Keck, et al.

Diatoms are micro-algal indicators of freshwater pollution. Current standardized methodologies are based on microscopic determinations, which is time consuming and prone to identification uncertainties. The use of DNA-barcoding has been proposed as a way to avoid these flaws. Combining barcoding with next-generation sequencing enables collection of a large quantity of barcodes...

Jan 2016
Frédéric Rimet, Philippe Chaumeil, François Keck, et al.

Wikidata as a semantic framework for the Gene Wiki initiative

Jan 2016 | Sebastian Burgstaller-Muehlbacher, Andra Waagmeester, Elvira Mitraka, et al.

Open biological data are distributed over many resources making them challenging to integrate, to update and to disseminate quickly. Wikidata is a growing, open community database which can serve this purpose and also provides tight integration with Wikipedia. In order to improve the state of biological data, facilitate data management and dissemination, we imported all human and...

Jan 2016
Sebastian Burgstaller-Muehlbacher, Andra Waagmeester, Elvira Mitraka, et al.

HistoneDB 2.0: a histone database with variants—an integrated resource to explore histones and their variants

Jan 2016 | Eli J. Draizen, Alexey K. Shaytan, Leonardo Mariño-Ramírez, et al.

Compaction of DNA into chromatin is a characteristic feature of eukaryotic organisms. The core (H2A, H2B, H3, H4) and linker (H1) histone proteins are responsible for this compaction through the formation of nucleosomes and higher order chromatin aggregates. Moreover, histones are intricately involved in chromatin functioning and provide a means for genome dynamic regulation...

Jan 2016
Eli J. Draizen, Alexey K. Shaytan, Leonardo Mariño-Ramírez, et al.

Chado use case: storing genomic, genetic and breeding data of Rosaceae and Gossypium crops in Chado

Jan 2016 | Sook Jung, Taein Lee, Stephen Ficklin, et al.

The Genome Database for Rosaceae (GDR) and CottonGen are comprehensive online data repositories that provide access to integrated genomic, genetic and breeding data through search, visualization and analysis tools for Rosaceae crops and Gossypium (cotton). These online databases use Chado, an open-source, generic and ontology-driven database schema for biological data, as the...

Jan 2016
Sook Jung, Taein Lee, Stephen Ficklin, et al.

An integrative data analysis platform for gene set analysis and knowledge discovery in a data warehouse framework

Jan 2016 | Yi-An Chen, Lokesh P. Tripathi, Kenji Mizuguchi

Data analysis is one of the most critical and challenging steps in drug discovery and disease biology. A user-friendly resource to visualize and analyse high-throughput data provides a powerful medium for both experimental and computational biologists to understand vastly different biological data types and obtain a concise, simplified and meaningful output for better knowledge...

Jan 2016
Yi-An Chen, Lokesh P. Tripathi, Kenji Mizuguchi

dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions

Jan 2016 | Jiaxin Wu, Mengmeng Wu, Lianshuo Li, et al.

The recent advancement of the next generation sequencing technology has enabled the fast and low-cost detection of all genetic variants spreading across the entire human genome, making the application of whole-genome sequencing a tendency in the study of disease-causing genetic variants. Nevertheless, there still lacks a repository that collects predictions of functionally...

Jan 2016
Jiaxin Wu, Mengmeng Wu, Lianshuo Li, et al.