Functional Categories Associated with Clusters of Genes That Are Co-Expressed across the NCI-60 Cancer Cell Lines

PLOS ONE, Dec 2019

Background The NCI-60 is a panel of 60 diverse human cancer cell lines used by the U.S. National Cancer Institute to screen compounds for anticancer activity. In the current study, gene expression levels from five platforms were integrated to yield a single composite transcriptome profile. The comprehensive and reliable nature of that dataset allows us to study gene co-expression across cancer cell lines. Methodology/Principal Findings Hierarchical clustering revealed numerous clusters of genes in which the genes co-vary across the NCI-60. To determine functional categorization associated with each cluster, we used the Gene Ontology (GO) Consortium database and the GoMiner tool. GO maps genes to hierarchically-organized biological process categories. GoMiner can leverage GO to perform ontological analyses of gene expression studies, generating a list of significant functional categories. Conclusions/Significance GoMiner analysis revealed many clusters of coregulated genes that are associated with functional groupings of GO biological process categories. Notably, those categories arising from coherent co-expression groupings reflect cancer-related themes such as adhesion, cell migration, RNA splicing, immune response and signal transduction. Thus, these clusters demonstrate transcriptional coregulation of functionally-related genes.

Functional Categories Associated with Clusters of Genes That Are Co-Expressed across the NCI-60 Cancer Cell Lines

et al. (2012) Functional Categories Associated with Clusters of Genes That Are Co- Expressed across the NCI-60 Cancer Cell Lines. PLoS ONE 7(1): e30317. doi:10.1371/journal.pone.0030317 Functional Categories Associated with Clusters of Genes That Are Co-Expressed across the NCI-60 Cancer Cell Lines Barry R. Zeeberg 0 1 William Reinhold 0 1 Rene Snajder 0 1 Gerhard G. Thallinger 0 1 John N. Weinstein 0 1 Kurt W. Kohn 0 1 Yves Pommier 0 1 Ilya Ulasov, University of Chicago, United States of America 0 Current address: Departments of Bioinformatics and Computational Biology and Systems Biology, M.D. Anderson Cancer Center , Houston, Texas , United States of America 1 1 Laboratory of Molecular Pharmacology, Center for Cancer Research, National Cancer Institute, National Institutes of Health (NIH) , Bethesda , Maryland, United States of America, 2 Institute for Genomics and Bioinformatics, Graz University of Technology , Graz , Austria Background: The NCI-60 is a panel of 60 diverse human cancer cell lines used by the U.S. National Cancer Institute to screen compounds for anticancer activity. In the current study, gene expression levels from five platforms were integrated to yield a single composite transcriptome profile. The comprehensive and reliable nature of that dataset allows us to study gene coexpression across cancer cell lines. Methodology/Principal Findings: Hierarchical clustering revealed numerous clusters of genes in which the genes co-vary across the NCI-60. To determine functional categorization associated with each cluster, we used the Gene Ontology (GO) Consortium database and the GoMiner tool. GO maps genes to hierarchically-organized biological process categories. GoMiner can leverage GO to perform ontological analyses of gene expression studies, generating a list of significant functional categories. Conclusions/Significance: GoMiner analysis revealed many clusters of coregulated genes that are associated with functional groupings of GO biological process categories. Notably, those categories arising from coherent co-expression groupings reflect cancer-related themes such as adhesion, cell migration, RNA splicing, immune response and signal transduction. Thus, these clusters demonstrate transcriptional coregulation of functionally-related genes. - Funding: This research was supported by the Intramural Research Program of the National Institutes of Health, National Cancer Institute, Center for Cancer Research, Research and the Austrian Ministry of Science and Research, GEN-AU project Bioinformatics Integration Network. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing Interests: The authors have declared that no competing interests exist. The NCI-60 is a panel of 60 human cancer cell lines that has been used by the Developmental Therapeutics Program (DTP) of the U.S. National Cancer Institute to screen compounds plus natural products since 1990 [1,2]. The NCI-60 panel includes cell lines from colorectal (CO), renal (RE), ovarian (OV), prostate (PR), lung (LC), breast (BR), and central nervous system (CNS) cancer origin, as well as leukemias (LE) and melanomas (ME). We and our many collaborators around the world have profiled the NCI-60 more comprehensively at the DNA, RNA, protein, mutation, functional, and pharmacological levels than any other set of cells in existence [1,2,3,4,5,6]. The NCI-60 data have been widely used in cancer research and bioinformatics, but the multiple datasets may be most informative for the recognition of complex biosignatures. Such biosignatures may in turn lead to increased understanding of cell phenotypes and pathway relationships within the cell. We previously developed GoMiner [7] and High-Throughput GoMiner [8], applications that organize lists of interesting genes (for example, under- and over-expressed genes from a microarray experiment) for biological interpretation in the context of the Gene Ontology [9,10]. GoMiner and related tools typically generate a list of significant functional categories. In addition to lists and tables, High-Throughput GoMiner can provide two kinds of clustered image maps (CIMs) as graphical output. Integrative categories versus experiments CIMs capture the relationships between categories and multiple experiments; individual categories versus genes CIMs capture the relationships between categories and genes. Both types of CIMs are used to present the results in the present work. In the past decade, systems biology has become increasingly prominent as the numbers of analyzable genes and biological parameters have increased, and is beginning to show their functional relationships. A standard approach for studying systems biology with genomic data is to cluster genes whose expression profiles co-vary either over a time course or across multiple samples. For example, Garraway et al. [11] performed an integrated supervised analysis of SNP array and gene expression data to identify MITF as a lineage survival oncogene amplified in malignant melanoma. A number of additional gene expression microarray demonstrate the potential of gene co-expression studies. For example, Prieto et al. [12] used the Affymetrix HGU133A platform to identify co-expression networks in a diversity of human tissue samples. Their network revealed a map of coexpression clusters organized in well-defined functional constellations. Two major regions in this network corresponded to genes involved in nuclear and mitochondrial metabolism. That study is not directly relevant to cancer, though, since no cancer tissues were included in the study. Choi et al. [13] did study cancer tissues, but had unfortunately culled published data from what would now be considered to be outdated (Affymetrix U95A) or unreliable (cDNA) platforms. Also, the data obtained on different platforms needed to be reconciled, and the date of the studies preceded the availability of reliable resources like AffyProbeMiner [14] and SpliceCenter [15]. Nevertheless, Choi was able to detect functional differences between normal growth and cancer in terms of gene coexpression changes in broad areas of physiology: energy metabolism, the cell cycle, immune activation and collagen production. Other studies have been focused on tissue-specific genes. Cho et al. [16] revealed many pathways related to the pathophysiology of lung cancer: Cytokine Network and TNF/Stress Related Signaling pathway pair; thrombin signaling and protease-activated receptors pathway; Cell Cycle: G1/S Check Point and Inhibition of Cellular Proliferation by Gleevec. Likewise, the studies of Lai et al. [17] were restricted to prostate cancer and developed a statistical method for identifying differential genegene coexpression patterns in different cell states. For a gene of interest, other genes are selected that have differential genegene coexpression patterns with this gene in different cell states. By us (...truncated)


This is a preview of a remote PDF: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0030317&type=printable
Article home page: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0030317

Barry R. Zeeberg, William Reinhold, René Snajder, Gerhard G. Thallinger, John N. Weinstein, Kurt W. Kohn, Yves Pommier. Functional Categories Associated with Clusters of Genes That Are Co-Expressed across the NCI-60 Cancer Cell Lines, PLOS ONE, 2012, 1, DOI: 10.1371/journal.pone.0030317