A High-Dimensional, Deep-Sequencing Study of Lung Adenocarcinoma in Female Never-Smokers

PLOS ONE, Dec 2019

Background Deep sequencing techniques provide a remarkable opportunity for comprehensive understanding of tumorigenesis at the molecular level. As omics studies become popular, integrative approaches need to be developed to move from a simple cataloguing of mutations and changes in gene expression to dissecting the molecular nature of carcinogenesis at the systemic level and understanding the complex networks that lead to cancer development. Results Here, we describe a high-throughput, multi-dimensional sequencing study of primary lung adenocarcinoma tumors and adjacent normal tissues of six Korean female never-smoker patients. Our data encompass results from exome-seq, RNA-seq, small RNA-seq, and MeDIP-seq. We identified and validated novel genetic aberrations, including 47 somatic mutations and 19 fusion transcripts. One of the fusions involves the c-RET gene, which was recently reported to form fusion genes that may function as drivers of carcinogenesis in lung cancer patients. We also characterized gene expression profiles, which we integrated with genomic aberrations and gene regulations into functional networks. The most prominent gene network module that emerged indicates that disturbances in G2/M transition and mitotic progression are causally linked to tumorigenesis in these patients. Also, results from the analysis strongly suggest that several novel microRNA-target interactions represent key regulatory elements of the gene network. Conclusions Our study not only provides an overview of the alterations occurring in lung adenocarcinoma at multiple levels from genome to transcriptome and epigenome, but also offers a model for integrative genomics analysis and proposes potential target pathways for the control of lung adenocarcinoma.

A High-Dimensional, Deep-Sequencing Study of Lung Adenocarcinoma in Female Never-Smokers

Deep-Sequencing Study of Lung Adenocarcinoma in Female Never-Smokers. PLoS ONE 8(2): e55596. doi:10.1371/journal.pone.0055596 A High-Dimensional, Deep-Sequencing Study of Lung Adenocarcinoma in Female Never-Smokers Sang Cheol Kim 0 Yeonjoo Jung 0 Jinah Park 0 Sooyoung Cho 0 Chaehwa Seo 0 Jaesang Kim 0 Pora Kim 0 Jehwan Park 0 Jihae Seo 0 Jiwoong Kim 0 Seongjin Park 0 Insu Jang 0 Namshin Kim 0 Jin Ok Yang 0 Byungwook Lee 0 Kyoohyoung Rho 0 Yeonhwa Jung 0 Juhee Keum 0 Jinseon Lee 0 Jungho Han 0 Sangeun Kang 0 Sujin Bae 0 So-Jung Choi 0 Sujin Kim 0 Jong-Eun Lee 0 Wankyu Kim 0 Jhingook Kim 0 Sanghyuk Lee 0 William C S Cho, Queen Elizabeth Hospital, Hong Kong 0 1 Korean Bioinformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology , Daejeon , Korea , 2 Ewha Research Center for Systems Biology (ERCSB), Ewha Womans University , Seoul , Korea , 3 Division of Life and Pharmaceutical Sciences and the Center for Cell Signaling and Drug Discovery Research, Ewha Womans University , Seoul , Korea , 4 Samsung Biomedical Research Institute (SBRI) and Cancer Research Institute, Samsung Medical Center, Sungkyunkwan University School of Medicine , Seoul , Korea , 5 Department of Pathology, Samsung Medical Center, Sungkyunkwan University School of Medicine , Seoul , Korea , 6 DNA Link Inc., Seoul, Korea, 7 Department of Thoracic Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine , Seoul , Korea Background: Deep sequencing techniques provide a remarkable opportunity for comprehensive understanding of tumorigenesis at the molecular level. As omics studies become popular, integrative approaches need to be developed to move from a simple cataloguing of mutations and changes in gene expression to dissecting the molecular nature of carcinogenesis at the systemic level and understanding the complex networks that lead to cancer development. Results: Here, we describe a high-throughput, multi-dimensional sequencing study of primary lung adenocarcinoma tumors and adjacent normal tissues of six Korean female never-smoker patients. Our data encompass results from exome-seq, RNAseq, small RNA-seq, and MeDIP-seq. We identified and validated novel genetic aberrations, including 47 somatic mutations and 19 fusion transcripts. One of the fusions involves the c-RET gene, which was recently reported to form fusion genes that may function as drivers of carcinogenesis in lung cancer patients. We also characterized gene expression profiles, which we integrated with genomic aberrations and gene regulations into functional networks. The most prominent gene network module that emerged indicates that disturbances in G2/M transition and mitotic progression are causally linked to tumorigenesis in these patients. Also, results from the analysis strongly suggest that several novel microRNA-target interactions represent key regulatory elements of the gene network. Conclusions: Our study not only provides an overview of the alterations occurring in lung adenocarcinoma at multiple levels from genome to transcriptome and epigenome, but also offers a model for integrative genomics analysis and proposes potential target pathways for the control of lung adenocarcinoma. - Funding: This work was funded by grants from the KRIBB Research Initiative Program (S.L.), National Research Foundation of Korea (grant numbers 20110019747 and 20110002321 to S.L. and grant number 20110014992 to WK), GIST Systems Biology Infrastructure Establishment Grant through ERCSB (SL and JK), and RPGrant 2011 of Ewha Womans University (SC). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing Interests: The authors acknowledge that two of the authors, Sujin Kim and Jong-Eun Lee belong to a commercial company, DNA Link Inc. The authors duly state here that no competing interests, financial or non-financial, are incurred by this affiliation. Furthermore, this does not alter the authors adherence to the PLOS ONE policies on sharing data and materials. . These authors contributed equally to this work. Recent advances in DNA sequencing technology have revolutionized genomics and biomedical research, especially in the field of cancer research [1]. Various types of mutations as well as large scale chromosomal aberrations are being reported and cataloged, and the rate of data accumulation will likely accelerate for the foreseeable future. This should certainly apply to lung cancer which is currently the second most common cancer and the primary cause of mortality among cancer-related death in the United States [2]. The first complete sequence of a lung adenocarcinoma genome revealed about 50 000 single nucleotide variations in the tumor relative to normal lung [3]. This was followed by the sequencing study of a small-cell lung cancer genome which highlighted the role of tobacco carcinogens such as polycyclic aromatic hydrocarbons in shaping mutational patterns in lung cancers from smokers [4]. Transcriptome analysis of multiple lung adenocarciFigure 1. Circos plot of somatic mutations, copy number variations, transcriptome expression, and structural variations. From inside to out, structural variations (purple and orange), copy number variations (gain in dark red, loss in dark blue, mRNA expression (up in gold, down in olive), differentially expressed microRNAs (up in red, down in green), DNA methylation with sky-blue background (up in dark orange, down in chartreuse), somatic mutations with a gene symbols, and chromosomal cytobands. doi:10.1371/journal.pone.0055596.g001 noma patients using next-generation sequencing (NGS) recently showed the existence of a fusion gene containing the tyrosine kinase domain of the c-RET oncogene in 1%2% of patients; this fusion leads to aberrant activation of RET kinase and is considered to be a new driver mutation of lung adenocarcinoma [5]. This finding was further confirmed through an independent study using a combination of targeted sequencing with an integrated molecular- and histopathology-based screening system [6]. Given that patients with c-RET fusions do not harbor mutations or fusions in EGFR, KRAS or ALK oncogenes, it is likely that c-RET fusion genes represent lung adenocarcinoma drivers and will lead to the definition of a new subclass of lung cancer [5]. Identifying mutations with high probabilities of being drivers, mutations that confer genes with oncogenic activity, is clearly a prototypical and certainly already a productive application of NGS, but the greater challenge is moving beyond the simple cataloging of mutations and establishing means for integrating diverse high-throughput data generated by NGS [7] to understand cancer at the multiple levels of gene networks and signaling pathways [8]. In this report, we describe a high-dimensional, highthroughput sequencing study of primary lung tumors and matched normal tissues isolated from 6 Korean female never-smoker pat (...truncated)


This is a preview of a remote PDF: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0055596&type=printable
Article home page: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0055596

Sang Cheol Kim, Yeonjoo Jung, Jinah Park, Sooyoung Cho, Chaehwa Seo, Jaesang Kim, Pora Kim, Jehwan Park, Jihae Seo, Jiwoong Kim, Seongjin Park, Insu Jang, Namshin Kim, Jin Ok Yang, Byungwook Lee, Kyoohyoung Rho, Yeonhwa Jung, Juhee Keum, Jinseon Lee, Jungho Han, Sangeun Kang, Sujin Bae, So-Jung Choi, Sujin Kim, Jong-Eun Lee, Wankyu Kim, Jhingook Kim, Sanghyuk Lee. A High-Dimensional, Deep-Sequencing Study of Lung Adenocarcinoma in Female Never-Smokers, PLOS ONE, 2013, 2, DOI: 10.1371/journal.pone.0055596