Does Selection against Transcriptional Interference Shape Retroelement-Free Regions in Mammalian Genomes?

PLOS ONE, Nov 2008

Background Eukaryotic genomes are scattered with retroelements that proliferate through retrotransposition. Although retroelements make up around 40 percent of the human genome, large regions are found to be completely devoid of retroelements. This has been hypothesised to be a result of genomic regions being intolerant to insertions of retroelements. The inadvertent transcriptional activity of retroelements may affect neighbouring genes, which in turn could be detrimental to an organism. We speculate that such retroelement transcription, or transcriptional interference, is a contributing factor in generating and maintaining retroelement-free regions in the human genome. Methodology/Principal Findings Based on the known transcriptional properties of retroelements, we expect long interspersed elements (LINEs) to be able to display a high degree of transcriptional interference. In contrast, we expect short interspersed elements (SINEs) to display very low levels of transcriptional interference. We find that genomic regions devoid of long interspersed elements (LINEs) are enriched for protein-coding genes, but that this is not the case for regions devoid of short interspersed elements (SINEs). This is expected if genes are subject to selection against transcriptional interference. We do not find microRNAs to be associated with genomic regions devoid of either SINEs or LINEs. We further observe an increased relative activity of genes overlapping LINE-free regions during early embryogenesis, where activity of LINEs has been identified previously. Conclusions/Significance Our observations are consistent with the notion that selection against transcriptional interference has contributed to the maintenance and/or generation of retroelement-free regions in the human genome.

Does Selection against Transcriptional Interference Shape Retroelement-Free Regions in Mammalian Genomes?

Willerslev E (2008) Does Selection against Transcriptional Interference Shape Retroelement-Free Regions in Mammalian Genomes? PLoS ONE 3(11): e3760. doi:10.1371/journal.pone.0003760 Does Selection against Transcriptional Interference Shape Retroelement-Free Regions in Mammalian Genomes? Tobias Mourier 0 Eske Willerslev 0 Rodolfo Aramayo, Texas A&M University, United States of America 0 Ancient DNA and Evolution Group, Department of Biology, University of Copenhagen , Copenhagen , Denmark Background: Eukaryotic genomes are scattered with retroelements that proliferate through retrotransposition. Although retroelements make up around 40 percent of the human genome, large regions are found to be completely devoid of retroelements. This has been hypothesised to be a result of genomic regions being intolerant to insertions of retroelements. The inadvertent transcriptional activity of retroelements may affect neighbouring genes, which in turn could be detrimental to an organism. We speculate that such retroelement transcription, or transcriptional interference, is a contributing factor in generating and maintaining retroelement-free regions in the human genome. Methodology/Principal Findings: Based on the known transcriptional properties of retroelements, we expect long interspersed elements (LINEs) to be able to display a high degree of transcriptional interference. In contrast, we expect short interspersed elements (SINEs) to display very low levels of transcriptional interference. We find that genomic regions devoid of long interspersed elements (LINEs) are enriched for protein-coding genes, but that this is not the case for regions devoid of short interspersed elements (SINEs). This is expected if genes are subject to selection against transcriptional interference. We do not find microRNAs to be associated with genomic regions devoid of either SINEs or LINEs. We further observe an increased relative activity of genes overlapping LINE-free regions during early embryogenesis, where activity of LINEs has been identified previously. Conclusions/Significance: Our observations are consistent with the notion that selection against transcriptional interference has contributed to the maintenance and/or generation of retroelement-free regions in the human genome. - Transposable elements are genetic elements that are capable of proliferating withinand even betweengenomes. The elements can be broadly divided into two classes [1]: Class I elements transpose via an RNA intermediate that is reverse transcribed to DNA. We henceforth refer to class I elements as retroelements. Class II elements transpose via a DNA intermediate. With a few recorded exceptions (e.g. refs [2,3]) retroelements are found in all eukaryotic genomes examined, and nearly half of the human genome sequence can be attributed to the activity of retroelements [4]. Recently, Simons and colleagues identified almost 1000 regions in the genomes of human and mouse of at least 10 kilo base pairs (kbp) in size with no transposable elements [5]. Such regions termed TFRs for Transposon-Free Regionswere found to be conserved among other mammals, and associated with microRNAs and genes encoding transcription factors [5,6]. The authors hypothesized that the TFRs encode regions of essential regulatory information that are intolerant to the insertion of transposable elements. The hypothesised selective disadvantage of transposable elements may in many cases be a result of disruption of the informational content of the sequence in which the transposable element is inserted. Yet, the transcriptional activity of transposable elements could be an additional contributor to the deleterious effects of transposable elements, which are presumably selected against in TFRs. This implies that retroelements are not just avoided in TFRs due to the insertion per se, but also to minimize spurious transcription from retroelements. I.e. it is not necessarily the insertion of a sequence that has a deleterious effect, but rather the subsequent transcriptional activity from the inserted sequence. Retroelements contain promoters and transcription factor binding sites necessary for their own transcription. Occasionally, the transcription may continue into adjacent regions. If these adjacent regions encode genes, the transcription may potentially result in transcripts containing both transposable element sequence and gene sequence [7,8], or for example, repress endogenous transcription of the neighbouring gene by promoter competition [9]. Transcriptional interference may potentially occur at different stages of transcription, of which some are experimentally verified and others are purely speculative (see [10] and references therein). Retroelements display a great divergence in transcriptional capacity and activity. Short interspersed elements (SINEs) contain a weak internal polymerase III promoter [11], usually not capable of initiating transcription by itself [12]. Further, the polymerase III generates only shorter transcripts. In contrast, long interspersed elements (LINEs) and Long terminal repeat (LTR) elements harbour polymerase II transcription start sites that are capable of transcribing into adjacent genomic regions [13,14]. LINEs even contain an additional promoter situated in the antisense orientation, which is known to transcribe neighbouring genes [15,16]. The difference in transcriptional features between different transposable elements predicts that the elements will differ in their capabilities in transcriptional interference of neighbouring genes. Firstly, polymerase II transcribed elements will be able to transcribe into adjacent genes, which is not expected for polymerase III transcribed elements. Secondly, as protein-coding genes and presumable microRNAs [17] are transcribed by polymerase II, promoter competition will exclusively be expected from transposable elements transcribed by this polymerase. Thirdly, any physical interaction between transcriptional complexes is expected to be most prominent from polymerase II transcribed elements, simply because these transcriptional complexes will move further along the genome. Consequently, the impact of transcriptional interference should be highest for LINEs and LTR elements, and we are thus able to test the hypothesis that transcriptional interference is contributing to the existence and maintenance of TFRs: Protein-coding genes and RNA genes that are sensitive to the deleterious effects of transcriptional interference should be enriched in genomic regions devoid of polymerase II transcribed transposable elements, whereas this should not be the case for regions devoid of polymerase III transcribed transposable elements. Which genes are then susceptible to the deleterious effects of transcriptional interference? Two conditions must be fulfilled. First, the precise regulation of the genes must be crucial to the organism, and second, the space and time (i.e. developmental stage (...truncated)


This is a preview of a remote PDF: http://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0003760&type=printable
Article home page: http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0003760

Tobias Mourier, Eske Willerslev. Does Selection against Transcriptional Interference Shape Retroelement-Free Regions in Mammalian Genomes?, PLOS ONE, 2008, Volume 3, Issue 11, DOI: 10.1371/journal.pone.0003760