The mouse Gene Expression Database (GXD): 2017 update

Nucleic Acids Research, Jan 2017

The Gene Expression Database (GXD; www.informatics.jax.org/expression.shtml) is an extensive and well-curated community resource of mouse developmental expression information. Through curation of the scientific literature and by collaborations with large-scale expression projects, GXD collects and integrates data from RNA in situ hybridization, immunohistochemistry, RT-PCR, northern blot and western blot experiments. Expression data from both wild-type and mutant mice are included. The expression data are combined with genetic and phenotypic data in Mouse Genome Informatics (MGI) and made readily accessible to many types of database searches. At present, GXD includes over 1.5 million expression results and more than 300 000 images, all annotated with detailed and standardized metadata. Since our last report in 2014, we have added a large amount of data, we have enhanced data and database infrastructure, and we have implemented many new search and display features. Interface enhancements include: a new Mouse Developmental Anatomy Browser; interactive tissue-by-developmental stage and tissue-by-gene matrix views; capabilities to filter and sort expression data summaries; a batch search utility; gene-based expression overviews; and links to expression data from other species.

Article PDF cannot be displayed. You can download it here:

https://nar.oxfordjournals.org/content/45/D1/D730.full.pdf

The mouse Gene Expression Database (GXD): 2017 update

D730–D736 Nucleic Acids Research, 2017, Vol. 45, Database issue doi: 10.1093/nar/gkw1073 Published online 28 November 2016 The mouse Gene Expression Database (GXD): 2017 update Jacqueline H. Finger, Constance M. Smith, Terry F. Hayamizu, Ingeborg J. McCright, Jingxia Xu, Meiyee Law, David R. Shaw, Richard M. Baldarelli, Jon S. Beal, Olin Blodgett, Jeff W. Campbell, Lori E. Corbani, Jill R. Lewis, Kim L. Forthofer, Pete J. Frost, Sharon C. Giannatto, Lucie N. Hutchins, Dave B. Miers, Howie Motenko, Kevin R. Stone, Janan T. Eppig, James A. Kadin, Joel E. Richardson and Martin Ringwald* The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA Received September 14, 2016; Revised October 21, 2016; Editorial Decision October 24, 2016; Accepted October 28, 2016 The Gene Expression Database (GXD; www. informatics.jax.org/expression.shtml) is an extensive and well-curated community resource of mouse developmental expression information. Through curation of the scientific literature and by collaborations with large-scale expression projects, GXD collects and integrates data from RNA in situ hybridization, immunohistochemistry, RT-PCR, northern blot and western blot experiments. Expression data from both wild-type and mutant mice are included. The expression data are combined with genetic and phenotypic data in Mouse Genome Informatics (MGI) and made readily accessible to many types of database searches. At present, GXD includes over 1.5 million expression results and more than 300 000 images, all annotated with detailed and standardized metadata. Since our last report in 2014, we have added a large amount of data, we have enhanced data and database infrastructure, and we have implemented many new search and display features. Interface enhancements include: a new Mouse Developmental Anatomy Browser; interactive tissueby-developmental stage and tissue-by-gene matrix views; capabilities to filter and sort expression data summaries; a batch search utility; gene-based expression overviews; and links to expression data from other species. INTRODUCTION Developmental gene expression information from wild-type and mutant mice provides crucial insights into the molecular mechanisms of human development, differentiation and disease. The longstanding objective of GXD has been to capture and integrate these complex and heterogeneous expression data, generated by many laboratories and scattered through thousands of publications, and to make them readily accessible to researchers via biologically and biomedically relevant searches. Designed from the beginning as an open-ended system that can robustly integrate different types of expression data, GXD collects RNA and protein expression information from RNA in situ hybridization, immunohistochemistry, in situ reporter (knock in), RT-PCR, northern blot, and western blot experiments (1,2). Data are acquired from the literature and through collaboration with projects that generate the types of data GXD collects at a large scale. All these data are reviewed and annotated by GXD Curators in standardized ways by making extensive use of controlled vocabularies and ontologies. As an integral component of the larger Mouse Genome Informatics (MGI) resource (3–5), GXD combines the expression data with genetic, functional, phenotypic, and disease-oriented data, thereby enabling unique and powerful search capabilities. GXD and its user interfaces have been described previously (6,7). Here, we focus on progress made since our last report in the NAR Database Issue (8). DATA CONTENT AND PROGRESS IN DATA ACQUISITION Comprehensive literature survey We systematically survey journals to find all publications examining endogenous gene expression during mouse development. In a first curation step for each paper, we annotate the genes and ages analyzed and the expression assay types used. Annotations are based on the entire publication, including supplemental information, and employ official nomenclature for genes. This information, combined with bibliographic information from PubMed, is made available for searches via the Gene Expression Literature Query form (http://www.informatics.jax.org/gxdlit). GXD’s liter- * To whom correspondence should be addressed. Tel: +1 207 288 6436; Fax: +1 207 288 6830; Email:  C The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact ABSTRACT Nucleic Acids Research, 2017, Vol. 45, Database issue D731 Figure 1. The Assay Details Page––annotated expression data are displayed with citation information, gene, and experimental information, such as the assay type, probe visualization method and link to probe specifics. The results section of the page includes the level and pattern of expression in anatomical structures (annotated using the Mouse Developmental Anatomy Ontology). Image panes accompany the result annotations when available. The pane can be viewed in the context of the full figure by clicking on the image or the link in the result table. Specimen information includes the age, genetic background and specimen preparation methods. Further details can be viewed by expanding the ‘more’ toggle (boxed region indicated by red circle & arrow). If the specimen is from a mutant mouse, the official allele name is shown with links to all the information for that allele in MGI, including all the other expression data for this specific mouse mutant. ature content records are comprehensive and up-to-date from 1990 to the present. GXD has records for over 24 300 references and >15 500 genes. The Gene Expression Literature Query provides researchers with an effective tool for finding publications with specific expression data, and it helps GXD Curators to prioritize papers for detailed expression annotation. Detailed expression data In a second curation step, the expression data are annotated in detail. For each expression assay, we record information about the gene studied, the strength and pattern of expression in specific anatomical structures, the probes and experimental conditions used, and the age and genetic background of the specimen(s) analyzed. Images of the data accompany the annotations when available. Standard gene, D732 Nucleic Acids Research, 2017, Vol. 45, Database issue Figure 2. The Mouse Developmental Anatomy Browser––divided into three sections, the Anatomy Search, the Anatomical Term Detail and the Anatomical Tree View, the user can quickly find terms and explore the mouse anatomy ontology. As indicated in the Term Detail section, individual structures can have multiple parents; ‘branchial arch endoderm’ has Parent Terms ‘embryo endoderm’ (...truncated)


This is a preview of a remote PDF: https://nar.oxfordjournals.org/content/45/D1/D730.full.pdf
Article home page: http://nar.oxfordjournals.org/content/45/D1/D730.abstract

Jacqueline H. Finger, Constance M. Smith, Terry F. Hayamizu, Ingeborg J. McCright, Jingxia Xu, Meiyee Law, David R. Shaw, Richard M. Baldarelli, Jon S. Beal, Olin Blodgett, Jeff W. Campbell, Lori E. Corbani, Jill R. Lewis, Kim L. Forthofer, Pete J. Frost, Sharon C. Giannatto, Lucie N. Hutchins, Dave B. Miers, Howie Motenko, Kevin R. Stone, Janan T. Eppig, James A. Kadin, Joel E. Richardson, Martin Ringwald. The mouse Gene Expression Database (GXD): 2017 update, Nucleic Acids Research, 2017, pp. D730-D736, 45/D1, DOI: 10.1093/nar/gkw1073