Exploiting the Transcriptome of Euphrates Poplar, Populus euphratica (Salicaceae) to Develop and Characterize New EST-SSR Markers and Construct an EST-SSR Database

PLOS ONE, Dec 2019

Background Microsatellite markers or Simple Sequence Repeats (SSRs) are the most popular markers in population/conservation genetics. However, the development of novel microsatellite markers has been impeded by high costs, a lack of available sequence data and technical difficulties. New species-specific microsatellite markers were required to investigate the evolutionary history of the Euphratica tree, Populus euphratica, the only tree species found in the desert regions of Western China and adjacent Central Asian countries. Methodology/Principal Findings A total of 94,090 non-redundant Expressed Sequence Tags (ESTs) from P. euphratica comprising around 63 Mb of sequence data were searched for SSRs. 4,202 SSRs were found in 3,839 ESTs, with 311 ESTs containing multiple SSRs. The most common motif types were trinucleotides (37%) and hexanucleotides (33%) repeats. We developed primer pairs for all of the identified EST-SSRs (eSSRs) and selected 673 of these pairs at random for further validation. 575 pairs (85%) gave successful amplification, of which, 464 (80.7%) were polymorphic in six to 24 individuals from natural populations across Northern China. We also tested the transferability of the polymorphic eSSRs to nine other Populus species. In addition, to facilitate the use of these new eSSR markers by other researchers, we mapped them onto Populus trichocarpa scaffolds in silico and compiled our data into a web-based database (http://202.205.131.253:8080/poplar/resources/static_page/index.html). Conclusions The large set of validated eSSRs identified in this work will have many potential applications in studies on P. euphratica and other poplar species, in fields such as population genetics, comparative genomics, linkage mapping, QTL, and marker-assisted breeding. Their use will be facilitated by their incorporation into a user-friendly web-based database.

Exploiting the Transcriptome of Euphrates Poplar, Populus euphratica (Salicaceae) to Develop and Characterize New EST-SSR Markers and Construct an EST-SSR Database

Populus euphratica (Salicaceae) to Develop and Characterize New EST-SSR Markers and Construct an EST-SSR Database. PLoS ONE 8(4): e61337. doi:10.1371/journal.pone.0061337 Exploiting the Transcriptome of Euphrates Poplar, Populus euphratica (Salicaceae) to Develop and Characterize New EST-SSR Markers and Construct an EST- SSR Database Fang K. Du 0 Fang Xu 0 Hong Qu 0 Sisi Feng 0 Jijun Tang 0 Rongling Wu 0 Jean-Marc Lacape, CIRAD, France 0 1 Center for Computational Biology, National Engineering Laboratory for Tree Breeding, College of Biological Sciences and Biotechnology, Beijing Forestry University , Beijing, China, 2 Center for Bioinformatics , National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University , Beijing , China , 3 Department of Computer Science and Engineering, University of South Carolina , Columbia, South Carolina , United States of America Background: Microsatellite markers or Simple Sequence Repeats (SSRs) are the most popular markers in population/ conservation genetics. However, the development of novel microsatellite markers has been impeded by high costs, a lack of available sequence data and technical difficulties. New species-specific microsatellite markers were required to investigate the evolutionary history of the Euphratica tree, Populus euphratica, the only tree species found in the desert regions of Western China and adjacent Central Asian countries. Methodology/Principal Findings: A total of 94,090 non-redundant Expressed Sequence Tags (ESTs) from P. euphratica comprising around 63 Mb of sequence data were searched for SSRs. 4,202 SSRs were found in 3,839 ESTs, with 311 ESTs containing multiple SSRs. The most common motif types were trinucleotides (37%) and hexanucleotides (33%) repeats. We developed primer pairs for all of the identified EST-SSRs (eSSRs) and selected 673 of these pairs at random for further validation. 575 pairs (85%) gave successful amplification, of which, 464 (80.7%) were polymorphic in six to 24 individuals from natural populations across Northern China. We also tested the transferability of the polymorphic eSSRs to nine other Populus species. In addition, to facilitate the use of these new eSSR markers by other researchers, we mapped them onto Populus trichocarpa scaffolds in silico and compiled our data into a web-based database (http://202.205.131.253:8080/ poplar/resources/static_page/index.html). Conclusions: The large set of validated eSSRs identified in this work will have many potential applications in studies on P. euphratica and other poplar species, in fields such as population genetics, comparative genomics, linkage mapping, QTL, and marker-assisted breeding. Their use will be facilitated by their incorporation into a user-friendly web-based database. - Funding: This work is financially supported by Fundamental Research Funds for the Central Universities (TD 2012-01), the Research Fund for the Doctoral Programme of Higher Education of China to FKD; Changjiang Scholars Award and Thousand-person Plan Award to RLW. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing Interests: Co-author Rongling Wu is a PLOS ONE Editorial Board member; this does not alter the authors adherence to all the PLOS ONE policies on sharing data and materials. Populus euphratica Oliv. (Salicaceae) inhabits semi-arid areas, with a natural distribution ranging from Western China and the Middle East to Spain and Western Morocco. As a desert tree, it is considered to be salinity and drought tolerant [1]. Moreover, it plays an important role in maintaining the ecological equilibrium of desert environments by counteracting the effects of dust storms, stabilizing sand, and retaining water [2]. Changes in its habitats and distribution have caused natural stands of P. euphratica to diminish over the last few decades, and some populations have disappeared [3]. P. euphratica is a useful model organism for studying salt and drought resistance mechanisms in trees because it can tolerate NaCl concentrations of up to 450 mM [4,5] and mannitol concentrations up to 400 mM [6] in controlled experiments. Much of the work conducted on this species to date has focused on the physiological mechanisms responsible for its resistance to abiotic factors (e.g. [7] and references therein). More recent studies using Amplified Fragment Length Polymorphism (AFLP) or microsatellite analysis based on polyacrylamide gel electrophoresis (PAGE) with silver staining have provided useful information on its genetics and evolutionary history [80]. However, partly because of the scarcity of suitable molecular markers, much remains to be learned about the genetic factors responsible for the ability of P. euphratica to cope with various adverse environmental conditions. SSRs (simple sequence repeats) [11] are tandem repeated DNA sequences with a repeating motif of one to six base pairs. They are also known as STRs (short tandem repeats) [12], SSLPs (simple sequence length polymorphisms) [13], and microsatellites [14]. SSRs are common and widely distributed in prokaryotic and eukaryotic genomes [15]. Because of their high rates of mutation, they tend to be highly polymorphic [16]. This makes it possible to distinguish between alleles based on their length using either classical PAGE, or by DNA sequencing when working on a larger scale or when higher accuracy is required [17]. Because of these useful properties, SSRs are the most popular markers in population genetics [17] and have been used in many applications such as gene tagging, Quantitative Trait Locus (QTL) mapping, marker assisted selection (MAS), parentage analysis, fingerprinting, and phylogenetic and taxonomic studies [18]. Two categories of SSRs can be defined based on their locations in the genome: Expressed Sequence Tag-SSRs (eSSRs) are embedded in transcribed sequences [19], while genomic SSRs (gSSRs) are embedded in both transcribed and non-transcribed sequences. Until recently, the lack of the detailed genomic information for many plants has meant that SSR marker development was primarily conducted using selective hybridization strategies, which require fragmented genomic DNA that is bound to either a nylon membrane using repeat-containing probes or to biotinylated probes that are subsequently captured on streptavidin-coated beads [2023]. This is complicated by the difficulty of preparing uniformly digested genome DNA, the cumbersome nature of the molecular cloning, the limited scope for obtaining sufficiently long sequences, and the high cost of the overall process. However, thousands of ESTs can be obtained from a single cDNA library comprising fragments with lengths of 300500 bp, from which plenty of eSSRs can be designed. Because ESTs are conserved across phylogenetically related species, eSSRs developed for one species are often highly transferable and thi (...truncated)


This is a preview of a remote PDF: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0061337&type=printable
Article home page: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0061337

Fang K. Du, Fang Xu, Hong Qu, Sisi Feng, Jijun Tang, Rongling Wu. Exploiting the Transcriptome of Euphrates Poplar, Populus euphratica (Salicaceae) to Develop and Characterize New EST-SSR Markers and Construct an EST-SSR Database, PLOS ONE, 2013, Volume 8, Issue 4, DOI: 10.1371/journal.pone.0061337