InDel markers: An extended marker resource for molecular breeding in chickpea

PLOS ONE, Mar 2019

Chickpea is one of the most important food legumes that holds the key to meet rising global food and nutritional demand. In order to deploy molecular breeding approaches in crop improvement programs, user friendly and cost effective marker resources remain prerequisite. The advent of next generation sequencing (NGS) technology has resulted in the generation of several thousands of markers as part of several large scale genome sequencing and re-sequencing initiatives. Very recently, PCR based Insertion-deletions (InDels) are becoming a popular gel based genotyping solution because of their co-dominant, inexpensive, and highly polymorphic nature. With an objective to expand marker resources for genomics assisted breeding (GAB) in chickpea, whole genome re-sequencing data generated on five parental lines of one interspecific (ICC 4958 × PI 489777) and two intra-specific (ICC 283 × ICC 8261 and ICC 4958 × ICC 1882) mapping populations, were used for identification of InDels. A total of 231,658 InDels were identified using Dindel software with default parameters. Further, a total of 8,307 InDels with ≥20 bp size were selected for development of gel based markers, of which primers could be designed for 7,523 (90.56%) markers. On average, markers appeared at a frequency of 1,038 InDels/LG with a maximum number of markers on CaLG04 (1,952 InDels) and minimum on CaLG08 (360 InDels). In order to validate these InDels, a total of 423 primer pairs were randomly selected and tested on the selected parental lines. A high amplification rate of 80% was observed ranging from 46.06 to 58.01% polymorphism rate across parents on 3% agarose gel. This study clearly reflects the usefulness of available sequence data for the development of genome-wide InDels in chickpea that can further contribute and accelerate a wide range of genetic and molecular breeding activities in chickpea.

A PDF file should load here. If you do not see its contents the file may be temporarily unavailable at the journal website or you do not have a PDF plug-in installed and enabled in your browser.

Alternatively, you can download the file locally and open with any standalone PDF reader:

https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0213999&type=printable

InDel markers: An extended marker resource for molecular breeding in chickpea

March InDel markers: An extended marker resource for molecular breeding in chickpea Ankit Jain 0 1 Manish Roorkiwal 0 1 Sandip KaleID 0 1 Vanika Garg 0 1 Ramakrishna Yadala 0 1 Rajeev K. VarshneyID 0 1 0 International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) , Hyderabad , India , 2 Leibniz- Institut f u ?r Pflanzengenetik und Kulturpflanzenforschung (IPK) , Gatersleben , Germany 1 Editor: Swarup Kumar Parida, National Institute for Plant Genome Research , INDIA Chickpea is one of the most important food legumes that holds the key to meet rising global food and nutritional demand. In order to deploy molecular breeding approaches in crop improvement programs, user friendly and cost effective marker resources remain prerequisite. The advent of next generation sequencing (NGS) technology has resulted in the generation of several thousands of markers as part of several large scale genome sequencing and re-sequencing initiatives. Very recently, PCR based Insertion-deletions (InDels) are becoming a popular gel based genotyping solution because of their co-dominant, inexpensive, and highly polymorphic nature. With an objective to expand marker resources for genomics assisted breeding (GAB) in chickpea, whole genome re-sequencing data generated on five parental lines of one interspecific (ICC 4958 ? PI 489777) and two intra-specific (ICC 283 ? ICC 8261 and ICC 4958 ? ICC 1882) mapping populations, were used for identification of InDels. A total of 231,658 InDels were identified using Dindel software with default parameters. Further, a total of 8,307 InDels with 20 bp size were selected for development of gel based markers, of which primers could be designed for 7,523 (90.56%) markers. On average, markers appeared at a frequency of 1,038 InDels/LG with a maximum number of markers on CaLG04 (1,952 InDels) and minimum on CaLG08 (360 InDels). In order to validate these InDels, a total of 423 primer pairs were randomly selected and tested on the selected parental lines. A high amplification rate of 80% was observed ranging from 46.06 to 58.01% polymorphism rate across parents on 3% agarose gel. This study clearly reflects the usefulness of available sequence data for the development of genome-wide InDels in chickpea that can further contribute and accelerate a wide range of genetic and molecular breeding activities in chickpea. - Data Availability Statement: All relevant data are within the manuscript and its Supporting Information files. Funding: The authors are thankful to Bill & Melinda Gates Foundation (Tropical Legumes III), and Department of Biotechnology, Govt of India under IABF scheme for financial assistance. The work reported in this article was undertaken as a part of the CGIAR Research Program on Grain Legumes and Dryland Cereals (GLDC). ICRISAT is a member of the CGIAR. Introduction Chickpea (Cicer arietinum L.) is a self-pollinated crop with a basic chromosome number eight and ~740 Mbp genome size [ 1 ]. Chickpea is predominently grown on low input marginal lands of arid and semi-arid regions [ 2 ]. It is considered as an important component of subsistence farming in developing countries especially to resource poor farmers. A well balanced nutritional food with 20?30% protein, ~40% carbohydrates, minerals, vitamins, soluble and insoluble fiber, chickpea is an ideal human diet and animal feed, thus plays a significant role in food and nutritional security globally [ 3 ]. Like other legumes, chickpea is also known to symbiotically fix atmospheric nitrogen with rhizobia, thus improving the soil health which makes it ideal for crop rotation programs [ 4 ]. It is estimated that, chickpea can fix up to 140 Kg N ha-1, thus minimizing the application of additional Nitrogen fertilizer in the field [ 5 ]. Currently, chickpea is being grown across 55 nations with an acreage area over 14.56 million hectares resulting in an annual yield of 14.78 million tonnes (FAO 2017). About 1 t ha-1 average productivity falls far below the actual potential (6 t ha-1) of the crop when grown under optimal conditions. Variable abiotic stresses such as temperature, drought, salinity, and biotic factors such as Fusarium wilt (FW) caused by Fusarium oxysporum f.sp. ciceri and Ascochyta blight (AB) caused by Ascochyta rabiei (Pass.), are major factors contributing to productivity losses in chickpea [ 6 ]. Among abiotic stresses, terminal drought is a major production constraint as it delays flowering and affects seed yield. Drought alone is estimated to reduce yield in chickpea by 33% annually [ 7 ] and is expected to become more severe under predicted climate change scenarios. Therefore, there is a dire need to develop improved chickpea varieties that can withstand various biotic and abiotic stresses. Deployment of genomics assisted breeding (GAB), that is the integration of genomic approaches in breeding, is a powerful approach in enhancing crop productivity [ 8,9 ]. Thus, it is imperative to identify and further utilize genomic regions/genes/alleles that are responsible for higher crop productivity using GAB technologies. Until last decade, application of GAB approaches had been a challenging task because of the meagre availability of genomic resources, making chickpea an orphan crop. However, the last decade has witnessed a tremendous increase in the availability of genomic resources to harness the variability of germplasm resources. A shift from isozyme and random amplified polymorphic DNA (RAPD) to amplified fragment length polymorphism (AFLP), simple sequence repeat (SSRs), and single nucleotide polymorphisms (SNPs) has occurred and these markers have been applied in a variety of scenarios ranging from diversity studies to genetic maps construction and QTL analysis for some of the most important agronomic traits [10]. The advent of next-generation sequencing (NGS) and high-throughput genotyping technologies has reduced the genotyping and sequencing cost drastically, which enabled the availability of a draft genome, further enhancing the depth of understanding and extending the genomic resources in chickpea [ 1 ]. NGS based technologies have resulted in the availability of large genomic resources and enriched the marker repository. However, there is still a need to validate these in silico resources and develop markers that can be efficiently used with limited infrastructure requirements. In the recent past, polymorphism attributed by PCR based InDels have received more attention because of their co-dominant inheritance, reproducibility and easy to use nature [ 11 ]. InDels are structural variations distributed abundantly throughout the genome, arising as a result of polymerase slippage, transposons, unequal crossing-over etc., that may sometimes lead to the gain/loss of function in the organism [ 12?15 ]. The most common categories of InDels involve single base pair insertion and deletion, monomeric base pair expansion and multi base pair expansion [ 11,16?17 ]. However, InDels containing random sequences and transposon insertions are comparatively less prevalent among genomes [11]. InDels are being used in a variety of applications including population genetics, taxon diagnostic markers, genetic map construction and association mapping in different crop plants viz. rice (Oryza sativa L.) [18], Arabidopsis (Arabidopsis thaliana) [ 19 ], barley (Hordeum vulgare) [ 20 ], tomato (Solanum lycopersicum L.) [ 21 ], pepper (Capsicum annuum) [ 22 ], Phaseolus (Phaseolus vulgaris L.) [ 23 ] and Brassica (Brassica rapa) [ 24 ] etc. Further, as InDels can be genotyped with simple gel based size separation procedures and the absence of stutter bands makes InDels more 2 / 13 valuable. In some of the previous studies, InDels were also found to be more polymorphic than microsatellite markers [ 24,25 ]. The availability of the whole genome sequence of chickpea [ 1, 26, 27 ] has resulted in vast genome information and further paved the way for various large scale resequencing initiatives ([ 28,29 ], unpublished data) making it easy to capture the variation existing among genotypes. With an objective to enhance the marker repertoire and develop the breeder friendly markers with limited infrastructure requirements in chickpea, the current study focuses on identification and development of InDels in five chickpea parental lines. Some of these randomly selected markers have been validated for their efficacy on agarose gel electrophoresis. Materials and methods DNA isolation A high throughput mini-DNA extraction method was standardized, with certain modifications from an earlier method [ 30 ]. In brief, the method involved following steps: harvested leaves from 15 days old seedlings, were grounded using steel balls in preheated (65?C) CTAB buffer (100 mM Tris-HCl (pH-8), 1.4 M NaCl, 20 mM EDTA, CTAB (2?3% w/v)) with GenoGrinder (Spex CertiPrep, USA) at 1500 rpm for 2 mins. Ground samples with CTAB buffer were incubated for 10 mins at 65?C. After bringing CTAB buffer mixed grounded sample to room temperature, it was subjected to solvent extraction by mixing an equal volume of chloroformisoamyl alcohol (24:1), followed by centrifugation at 5500 rpm for 10 mins. The aqueous phase was collected and DNA precipitation was done by adding 0.7 volume of isopropyl alcohol, and subject to a brief incubation at -80?C. DNA was precipitated by centrifugation of the the mixture at 5500 rpm for 10 mins. In order to purify the DNA, precipitated DNA was suspended in low TE buffer (10 mM Tris EDTA (pH-8)) and simultaneously was subjected to RNAse treatment (10 mg/ml) for 30 mins at 37?C. An equal volume of phenol-chloroform-isoamyl alcohol (25:24:1) was added to RNAse treated DNA and it was centrifuged. The aqueous phase collected was subject to DNA precipitation with 10% of Sodium acetate (3M NaOAc (pH-5.2)) and double volume of ethanol, followed by overnight incubation at -80?C. DNA was precipitated and washed with 70% ethanol. After drying at room temperature pellets were re-suspended in low-salt TE and stored at 4?C until further use. Estimation of quality and quantity of DNA was done using both agarose gel electrophoresis and spectrophotometer (Shimadzu UV160A, Japan). Resequencing and InDels screening The whole genome re-sequencing (WGRS) data on selected five parental lines of chickpea inter- and intra-specific populations (ICC 1882, ICC 4958, ICC 283, ICC8261, PI 489777) generated as part of an earlier study [ 28 ] were used for InDels identification in the current study. The WGRS data for these parental lines were first aligned against the chickpea reference genome [ 1 ] using the BWA software and using default parameters [ 31 ]. The aligned data in BAM format were then used for searching InDels using Dindel software [ 32 ] with default parameters for diploid species. Briefly, DinDel first extracts all the indels from BAM file and group them in the window of 150 bp. It then identifys the candidate haplotypes and realing the reads around these candidate indels. Finally, it produces a vcf file with indel calls and qualities. The snpEff software was used for InDel annotation. In order to develop the cost effective gel based markers, InDels with size 20 bp were considered. Primers for the flanking region of the identified InDels were designed using Primer3 [ 33 ]. 3 / 13 PCR amplification For validating identified InDels, PCR reaction mix was prepared for 10 ?l volume consisting of DNA 10 ng, 0.1 U of Taq polymerase (Kappa), 5X Taq buffer with 25 mm MgCl2, 5 ?M of forward and reverse primers, 2.5mM dNTPs. PCR amplifications were conducted with ABI thermal cycler (PE Applied Biosystems, CA) using a common series of touchdown PCR amplification thermal profile. A touch down PCR amplification thermal profile consisted of 3 min of initial denaturation cycle at 94?C, followed by 10 cycles of denaturation at 94?C for 20 sec, annealing at 60?C for 20 sec and extension at 72?C for 30 sec, with a 0.5 ?C decrease per cycle followed by 40 cycles of denaturation, annealing at 55?C and extension for the same duration as before with a final extension at 72?C for 20 min. Amplicons were resolved on 3% agarose gel electrophoresis, visualized and documented under UV light using a gel documentation system (Alpha Innotech gel documentation system). On the basis of banding pattern of intraspecific (ICC 283 ? ICC 8261 and ICC 4958 ? ICC 1882) and interspecific parental lines (ICC 4958 ? PI 489777) data were recorded. Results Sequence analysis and InDels identification Resequencing data of five parental lines consisted of >106 million high quality reads with a minimum of 14.97 million reads for ICC 1882 to maximum of 43.57 million reads for ICC 4958. This accounted for 79.76% alignment of high quality reads with the reference genome. Sequence data showed an average genome coverage of 81.68%, with maximum for ICC 4958 (84.04%) and minimum for PI 489777 (79.76%) with mean depth ranging from 6.21 (ICC 1882) to 14.26 (ICC 4958) [ 28 ]. Screening of data with Dindel resulted in identification of a total 2,31,658 InDels across selected parental lines (Fig 1a). Of these identified InDels, 52.88% Fig 1. Distribution of InDels identified in five parental (A- ICC 1882, B- ICC 4958, C- ICC 283, D- ICC 8261 and E- PI 489777) along the eight linkage groups of chickpea. (A) Circular representation of the distribution of insertions and deletions in the chickpea genome. (B) Comparative distribution of insertion and deletions among five parental lines. 4 / 13 (1,22,512) were attributed to insertions and 47.12% (1,09,146) were attributed to deletions category (Table 1, Fig 1b). In total 1,61,784 (69.84%) unique InDels across these five different parents were identified. It was interesting to note that the range of insertions (51.97?54.18%) and deletions (45.82?48.02%) were not found to vary significantly among all the selected parental line individually. Among individual parental lines, maximum number of InDels (49.66%) were observed in PI 489777, the wild chickpea (C. reticulatum) accession, followed by ICC 4958 with 38,180 (16.91%), whereas abundance of InDels in remaining three parental lines was found almost in similar range i.e. 10.83% to 11.43% (Fig 1b, Table 1). With increase in InDels size, a decrease in abundance of InDels was observed (Fig 2a and 2b). A large proportion of identified InDels was contributed by homozygous InDels (~91?97%) among parental lines except ICC 4958 where homozygous InDels accounted for competitively lower proportion (~54.5%) than rest of the parental lines (Table 2). Establishment of PCR based InDels resource In order to establish a PCR based marker resource, we filtered out InDels with <20 bp size. As a result, 8,307 InDels with 20 bp size could be obtained. A higher abundance of InDels, was observed on CaLG04 with 1,952 InDels and minimum number was observed on CaLG08 with 360 InDels (Fig 3a). In total, 80.64% of InDels were observed in intergenic regions followed by 17% in intronic regions. Overall 2.36% of InDels were found to fall in coding region (Fig 3b and 3c). Based on length variations among InDels existing in parental lines, a total of 2,687 (32.35%) and 2,524 (30.38%) InDels were found to be polymorphic between ICC 4958 ? ICC 1882 and ICC 283 ? ICC 8261 respectively. Similarly, 6,275 (75.54%) InDels were identified as polymorphic between ICC 4958 and PI 489777. In total, 852 (10.26%) InDels were found to be polymorphic across all three populations undertaken in the study (Fig 4). Primer pairs could be designed successfully for 7,523 (90.56%) InDels and are now available for the chickpea community as supplementary material for use in their respective chickpea programs (S1 Table). Validation of selected InDels In order to validate the identified InDels, a total of 423 InDels were selected randomly and primer pairs were synthesized. The selected 423 primer pairs were used for amplification on five selected parental lines (S2 Table). Approximately 80% primer pairs resulted in successful amplification and amplicons were found producing expected band size on agarose gel. In total, 276 InDels were found to be polymorphic among the parental lines of inter-specific and intraspecific crosses. Overall, we observed 331 (ICC 4958 ? PI 489777) to 343 (ICC 1882 ? ICC 4958) primer pairs that yielded amplicons (excluding non-specific, multiple bands etc.) (Table 3). Based on amplification, a high polymorphic rate of 46.06% (158 InDels) and 56.43% (193 InDels) was observed between intra-specific parental lines and a polymorphic rate of 58.01% (192 InDels) was observed between parental lines of the inter-specific mapping population (Table 3). The maximum number of markers showing polymorphism between parental lines of the intra-specific populations namely ICC 283 ? ICC 8261 and ICC 1882 ? ICC 4958, came from CaLG01 with a polymorphic rate of 68% and 71.15% respectively. In the case of parental lines of ICC 4958 ? PI 489777, maximum polymorphism rate was observed for InDels present on CaLG08 (85.71%) followed by InDels on CaLG01 that showed 78% polymorphism rate (Table 3). Validation results indicated InDels from CaLG06 attributed the least polymorphism rate (33.33%-43.08%) among intra-specific parental lines, whereas, for inter-specific parental lines, InDels from CaLG02 showed the minimum polymorphism rate (15.79%) (Table 3). 5 / 13 InDelmarkersinchickpea 6 3 0 7 1 7 8 2 a 9 4 9 6 5 6 7 8 R .3 .1 .3 .5 .3 .3 .0 .2 0 0 0 0 0 0 0 0 s P s C I s n o i t r e 8 0 7 1 5 3 1 9 3 s 2 7 8 2 1 29 ,63 43 ,4 1 3 ,0 ,2 ,4 ,1 , In ,1 1 1 5 1 1 1 6 1 4 9 6 3 6 2 0 5 0 9 8 6 5 9 9 9 1 .0 .0 .2 .0 .0 .0 .0 a .0 0 0 0 0 0 0 0 R s n 8 o 3 459 lite 14 41 71 75 11 847 ,105 94 ,816 e ,3 ,6 ,5 ,1 ,2 , C 1 1 6 1 2 2 7 1 C D 2 I s n io 7 tr 1 4 3 1 7 6 5 1 e 2 7 5 7 0 69 ,04 02 ,1 0 s 7 ,9 ,8 ,7 ,5 , In ,2 1 1 6 1 2 2 8 2 1 8 3 7 6 3 6 0 8 5 4 4 4 6 7 7 0 .0 .0 .1 .0 .0 .0 .0 a .0 0 0 0 0 0 0 0 .s R e n i l s l n a t 83 ito p I e v i f s f n 3 n 5 7 3 3 9 e 2 le 32 4 7 0 0 4 8 2 ,8 ra CC eD 18 95 74 34 10 17 16 52 11 o io 7 sp tr 3 3 0 6 9 7 96 e 1 0 10 ,50 63 ,3 u s 10 ,17 63 ,8 ,2 , ro In ,2 1 9 3 1 2 2 6 1 g eagk Ra .090 .058 .045 .147 .035 .056 .069 .073 n 0 0 0 0 0 0 0 0 1 li 0 t .t 0 h s 9 ig n 9 sseo 8182 liteo 6 8 3 9 051 1239 r e 01 61 07 ,36 61 ,50 ,35 46 ,1 .0 ca C D ,2 9 8 3 7 1 1 5 1 ls en s C e o le I s D l.p D n In a In ito 8 fo rnu iftoon Isren ,1352 ,6161 869 ,3843 269 ,3801 ,0861 636 ,9531 ecadn j/o1731 ibu p bun ./10 itsrD rgou iteav i.roog .le1b ieagkn a10LG a20LG a30LG a40LG a50LG a60LG a70LG a80LG ltao :leaRa :tt//spd a L C C C C C C C C T R h T 6/13 Fig 2. Relationship between InDels frequency and InDel lengths. (A) All length InDels frequency distribution in chickpea. (B) >20 bp InDels length distribution. Total Fig 3. (A) Distribution of InDels ( 20 bp) across the eight linkage groups of chickpea. (B) Distribution of InDels in different genomic regions of Chickpea. (C) distribution of InDels in coding region of chickpea. 7 / 13 Fig 4. Venn diagram reflecting number of polymorphic InDels between interspecific (ICC 4958 ? PI 489777) and intra-specific (ICC 283? ICC 8261 and ICC 4958 ? ICC 1882) chickpea parental lines. ICC 1882 ? ICC 4958 % Total amplified Polymorphic markers markers 68.00 52 37 63.89 36 21 52.17 48 17 60.71 56 27 45.24 41 15 43.08 66 22 66.67 37 16 63.64 7 3 56.43 343 158 ICC 4958 ? PI 489777 % Total amplified Polymorphic markers markers 71.15 50 39 58.33 38 6 35.42 45 24 48.21 51 36 36.59 41 25 33.33 65 40 43.24 34 16 42.86 7 6 46.06 331 192 Discussion In the era of NGS, a large number of genomic resources are being established. These resources have proven to be useful in enhancing genetic gains by implementation of GAB tools, and % 78.00 15.79 53.33 70.59 60.98 61.54 47.06 85.71 58.01 have successfully resulted in increased productivity in many crop plants [ 8,9 ]. In the case of chickpea, incorporation of molecular markers in improvement programmes has resulted in the development of some superior lines including lines with enhanced yield under rainfed conditions in JG 11 background [34], lines resistant to FW and AB in the genetic background of C 214 [ 35 ] and, lines resistant to FW in the genetic background of Pusa 256 [ 36 ]. Advanced molecular breeding approaches like genomic selection have already been initiated for yield related traits in chickpea [ 37,38 ]. In order to enhance the base of molecular markers available in chickpea, NGS technologies are being exploited and are aiding in the development of different types of user friendly markers across a vast range of applications [ 39,40 ]. Recently a study on flowering time in chickpea also identified one 11-bp deletion in the early flowering 3 (elf3) gene was found to be associated with early flowering in germplasm and successfully converted into KASP based InDel marker [ 41 ]. Although high-throughput genotyping platforms such as ?Axiom CicerSNP Array? have become available [ 42 ], high infrastructure/costs and the specialized manpower requirements associated with these technologies keep them out of reach for low technology laboratories in many developing countries. However, low infrastructure laboratories can deploy PCR based markers for genotyping their germplasm. Considering the high potential of InDels in comparison to SSRs and SNPs especially from the perspective of genotyping, we undertook resequencing data of five parental lines of interspecific and intra-specific RIL populations and screened InDels. The maximum number of InDels were found on CaLG04 and minimum number of InDels were observed at CaLG08 in four parental lines except wild chickpea PI 489777, which showed the lowest frequency of InDels on CaLG07 (Table 1). This observation is supported by previous studies reporting a large number of variants (SNPs and/or InDel) on pseudomolecule CaLG04 [ 28,29,43,44 ]. As per our study InDel markers have shown higher rate of polymorphism as compared to SSR markers and almost similar level when compared with SNP markers. For instance, genetic mapping study based on using SSR on mapping population of these parental lines (ICC 4958 ? ICC 1882 and ICC 8261 ? ICC 283) showed polymorphism rate of <10% [43] however InDel markers based on parental polymorpshism in present study showed polymorphism rate of ~50% (Table 3). Similarly, genotyping on these SNP markers on these population (ICC 4958 ? ICC 1882 and ICC 8261 ? ICC 283) resulted in polymorphism rate of ~40% [ 42 ]. With an increase in InDels size, a decrease in abundance of InDels was observed (Fig 2a and 2b). A negative relationship was observed between the InDels size and abundance which has also been observed in previous studies in different crop plants [ 45 ]. In addition, the cost for genotyping using InDel markers is comparatively less and provide a cost effective approach for genotyping. For instance, several SNP genotyping service provider also provide genotyping at comparative cost with InDel markers, however SNP genotyping services are cost effective only in case are being used with large volumes of samples. However genotyping cost for InDel markers increase linearly with number of samples and can be undertaken with using commonly available equipment in lab [ 46 ]. It was interesting to observe that a decline in number of InDel loci with an increase in InDel length was not perfectly symmetrical. Due to difference in the size of different linkage groups and to make parallel comparison of InDels across the eight linkage groups, the total number of InDels was normalized in order to assess relative abundance (Ra) which is a measure of InDels distribution per Kb length of each linkage group (Table 1). Among all parental lines, CaLG04 continued to show maximum Ra than the rest of the linkage groups among different parental lines ranging from 0.147 InDels/ Kb for ICC 1882 and ICC 283 to 0.567 InDels/Kb for PI 489777. Our results indicated that despite showing the minimum number of InDels on CaLG08, this linkage group did not account for the least Ra for any of the parental lines, which could be due to the smaller size of CaLG08 in comparison to other linkage groups. A high abundance of homozygous InDels was 9 / 13 observed in the current study. Similar high abundance of homozygous InDels have also been observed for human genomes [ 47 ]. The huge abundance of homozygous InDels could be due to artifact, misclassification of heterozygous InDels possibly because of allele dropouts and heterozygous InDels being missed, which has also been observed in the case of humans by [ 47 ]. In order to resolve amplicons appropriately on agarose gels, InDels with 20 bp size were selected leading to a drastic reduction in the number of InDels. As an objective of this study, PCR based InDels were developed by identifying InDel sites and designing the primers for the flanking region. This led to the development of a resource with 7,523 primer pairs (S1 Table). Primers could not be designed for few InDels where primer designing criteria were not met. InDels size differences existing among parental lines showed polymorphic potential of these markers in intra-specific and inter-specific RIL parental lines, which was further reaffirmed by validating randomly selected 423 primer pairs using agarose gel electrophoresis. Interestingly, some markers which were monomorphic in a particular population during in silico identification, were found to be polymorphic during PCR amplification and validation process. Such differences in polymorphism could be due to sequencing errors in the WGRS data which might have resulted in such InDel errors [ 21 ]. Similarly, reverse cases where InDels showing length difference in silico were found to be monomorphic during PCR and validation process were also observed. Such observations could be attributed to the incapability of gel based systems to resolve shorter InDels thus giving the impression as monomorphic markers. Non-amplification of certain primer pairs could be attributed to mismatch at primer site or existence of secondary structures of primers at annealing temperature leading to failed amplifications. Highest polymorphism was observed between the parental lines of the inter-specific population using both the PCR based approach as well as when comparing the InDels across the parental combinations, this can be explained by the more diverse genetic background of the wild and the cultivated line. Conclusion The wealth of sequencing data generated using NGS technologies has resulted in the identification of millions of genome-wide markers. As the second most common variations after SNPs, InDels have the capability to affect/modify the function of genes. Unlike SNPs, InDels can be used in regular laboratories without much infrastructure and therefore can serve as a user friendly and cost effective marker system with a better polymorphic rate comparative to SSRs. The present study reports a repository of more than 7,000 potential InDel markers that might play an important role in different genetic studies and can be exploited for chickpea improvement through GAB approaches. Utility of these markers has also been established by using randomly selected 423 markers on 5 different chickpea accessions. Supporting information S1 Table. Detailed primer pair profile for InDels with 20 bp size. (XLSX) S2 Table. Polymorphism data for InDels validated using PCR amplification and gel electrophoresis, in five parental line combinations. (XLSX) Acknowledgments The authors are thankful to Dr. Millicent Rose Smith for diligent proofreading of this MS. 10 / 13 Author Contributions Conceptualization: Manish Roorkiwal, Rajeev K. Varshney. Data curation: Ankit Jain, Sandip Kale, Vanika Garg, Ramakrishna Yadala. Formal analysis: Manish Roorkiwal, Sandip Kale, Vanika Garg. Funding acquisition: Rajeev K. Varshney. Investigation: Ankit Jain. Methodology: Ankit Jain, Sandip Kale, Ramakrishna Yadala. Project administration: Manish Roorkiwal. Resources: Rajeev K. Varshney. Supervision: Manish Roorkiwal, Rajeev K. Varshney. Writing ? original draft: Ankit Jain. Writing ? review & editing: Manish Roorkiwal, Rajeev K. Varshney. 11 / 13 18. 25. Wu K, Yang M, Liu H, Tao Y, Mei J, Zhao Y. Genetic analysis and molecular characterization of Chinese sesame (Sesamum indicum L.) cultivars using insertion-deletion (InDel) and simple sequence repeat (SSR) markers. BMC Genet. 2014; 15: 35. https://doi.org/10.1186/1471-2156-15-35 PMID: 24641723 12 / 13 1. Varshney RK , Song C , Saxena RK , Azam S , Yu S , Sharpe AG , et al. Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement . Nat. Biotechnol . 2013 ; 31 : 240 - 246 . https://doi.org/10.1038/nbt.2491 PMID: 23354103 2. Croser JS , Ahmad F , Clarke HJ , Siddique KHM . Utilisation of wild Cicer in chickpea improvementprogress, constraints, and prospects . Crop Pasture Sci . 2003 ; 54 : 429 - 444 . 3. Jukanti AK , Gaur PM , Gowda CLL , Chibbar RN . Nutritional quality and health benefits of chickpea (Cicer arietinum L.): a review . Br. J. Nutr . 2012 ; 108 : S12 - S26 . 4. Gan Y , Johnston AM , Diane Knight J , McDonald C , Stevenson C. Nitrogen dynamics of chickpea: Effects of cultivar choice, N fertilization, Rhizobium inoculation, and cropping systems . Can. J. Plant Sci . 2010 ; 90 : 655 - 666 . 5. Saraf CS , Rupela OP , Hegde DM , Yadav RL , Shivkumar BG , Bhattarai S , et al. Biological nitrogen fixation and residual effects of winter grain legumes in rice and wheat cropping systems of the Indo-Gangetic Plain . In: Residual effects of legumes in rice and wheat cropping systems of the Indo-Gangetic plain , Kumar Rao JVDK , Johansen C , and Rego TJ, Eds., Oxford & IBH Publishing Co Pvt Ltd , New Delhi; 1998 ; pp. 14 - 30 . 6. Ramirez ML , Cendoya E , Nichea MJ , Zachetti VGL , Chulze SN . Impact of toxigenic fungi and mycotoxins in chickpea: a review . Curr. Opin. Food Sci . 2018 ; 23 : 32 - 37 . 7. Kashiwagi J , Krishnamurthy L , Purushothaman R , Upadhyaya HD , Gaur PM , Gowda CLL , et al. Scope for improvement of yield under drought through the root traits in chickpea (Cicer arietinum L.) . Field Crops Res . 2015 ; 170 : 47 - 54 . 8. Varshney RK , Graner A , Sorrells ME . Genomics-assisted breeding for crop improvement . Trends Plant Sci . 2005 ; 10 : 621 - 630 . https://doi.org/10.1016/j.tplants. 2005 . 10 .004 PMID: 16290213 9. Kole C , Muthamilarasan M , Henry R , Edwards D , Sharma R , Abberton M , et al. Application of genomics-assisted breeding for generation of climate resilient crops: progress and prospects . Front. Plant Sci . 2015 ; 6 : 563 . https://doi.org/10.3389/fpls. 2015 .00563 PMID: 26322050 10. Roorkiwal M , Jain A , Thudi M , Varshney RK . Advances in Chickpea Genomic Resources for Accelerating the Crop Improvement . In: The Chickpea Genome, Varshney et al. (eds.) , Springer International Publishing. 2017 pp. 53 - 68 . 11. Vasema?gi A , Gross R , Palm D , Paaver T , Primmer CR . Discovery and application of insertion-deletion (INDEL) polymorphisms for QTL mapping of early life-history traits in Atlantic salmon . BMC Genomics 2010 ; 11 : 156 . https://doi.org/10.1186/ 1471 -2164-11-156 PMID: 20210987 12. Lovett ST . Encoded errors: mutations and rearrangements mediated by misalignment at repetitive DNA sequences . Mol. Microbiol . 2004 ; 52 : 1243 - 1253 . https://doi.org/10.1111/j.1365- 2958 . 2004 . 04076 . x PMID : 15165229 13. Mullaney JM , Mills RE , Pittard WS , Devine SE . Small insertions and deletions (INDELs) in human genomes . Human Mol. Genet . 2010 ; 19 : R131 - R136 . 14. Terakami S , Matsumura Y , Kurita K , Kanamori H , Katayose Y , Yamamoto T , et al. Complete sequence of the chloroplast genome from pear (Pyrus pyrifolia): genome structure and comparative analysis . Tree Genet . Genomes 2012 ; 8 : 841 - 854 . 15. Rockah-Shmuel L , To?th-Petro?czy A?, Sela A , Wurtzel O , Sorek R , Tawfik DS . Correlated occurrence and bypass of frame-shifting insertion-deletions (InDels) to give functional proteins . PLoS Genet . 2013 ; 9: e1003882 . https://doi.org/10.1371/journal.pgen. 1003882 PMID: 24204297 16. Newman TL , Tuzun E , Morrison VA , Hayden KE , Ventura M , McGrath SD , et al. A genome-wide survey of structural variation between human and chimpanzee . Genome Res . 2005 ; 15 : 1344 - 1356 . https:// doi.org/10.1101/gr.4338005 PMID: 16169929 17. Mills RE , Luttig CT , Larkins CE , Beauchamp A , Tsui C , Pittard WS , et al. An initial map of insertion and deletion (INDEL) variation in the human genome . Genome Res . 2006 ; 16 : 1182 - 1190 . https://doi.org/ 10.1101/gr.4565806 PMID: 16902084 Wu DH , Wu HP , Wang CS , Tseng HY , Hwu KK . Genome-wide InDel marker system for application in rice breeding and mapping studies . Euphytica 2013 ; 192 : 131 - 143 . 19. P?curar DI , P?curar ML , Street N , Bussell JD , Pop TI , Gutierrez L , et al. A collection of INDEL markers for map-based cloning in seven Arabidopsis accessions . J. Exp. Bot . 2012 ; 63 : 2491 - 2501 . https://doi. org/10.1093/jxb/err422 PMID: 22282537 20. Zhou G , Zhang Q , Tan C , Zhang X , Li C . Development of genome-wide InDel markers and their integration with SSR, DArT and SNP markers in single barley map . BMC Genomics 2015 ; 16 : 804 . https://doi. org/10.1186/s12864-015 -2027-x PMID : 26474969 21. Yang J , Wang Y , Shen H , Yang W. In silico identification and experimental validation of insertion-deletion polymorphisms in tomato genome . DNA Res . 2014 ; 21 : 429 - 438 . https://doi.org/10.1093/dnares/ dsu008 PMID: 24618211 22. Li W , Cheng J , Wu Z , Qin C , Tan S , Tang X , et al. An InDel-based linkage map of hot pepper (Capsicum annuum) . Mol. Breed . 2015 ; 35 : 32 . https://doi.org/10.1007/s11032-015 -0219-3 PMID: 25620878 23. Moghaddam SM , Song Q , Mamidi S , Schmutz J , Lee R , Cregan P , et al. Developing market class specific InDel markers from next generation sequence data in Phaseolus vulgaris L. Front . Plant Sci . 2014 ; 5 : 185 . https://doi.org/10.3389/fpls. 2014 .00185 PMID: 24860578 24. Liu B , Wang Y , Zhai W , Deng J , Wang H , Cui Y , et al. Development of InDel markers for Brassica rapa based on whole-genome re-sequencing . Theor. Appl. Genet . 2013 ; 126 : 231 - 239 . https://doi.org/10. 1007/s00122-012 -1976-6 PMID: 22972202 26. Jain M , Misra G , Patel RK , Priya P , Jhanwar S , Khan AW , et al. A draft genome sequence of the pulse crop chickpea (Cicer arietinum L.) . Plant J . 2013 ; 74 : 715 - 729 . https://doi.org/10.1111/tpj.12173 27. Gupta S , Nawaz K , Parween S , Roy R , Sahu K , Kumar Pole A , et al. Draft genome sequence of Cicer reticulatum L., the wild progenitor of chickpea provides a resource for agronomic trait improvement . DNA Res . 2016 ; 24 : 1 - 10 . 28. Thudi M , Khan AW , Kumar V , Gaur PM , Katta K , Garg V , et al. Whole genome re-sequencing reveals genome-wide variations among parental lines of 16 mapping populations in chickpea (Cicer arietinum L.) . BMC Plant Biol . 2016 ; 16 : 10 . https://doi.org/10.1186/s12870-015 -0690-3 PMID: 26822060 29. Thudi M , Chitikineni A , Liu X , He W , Roorkiwal M , Yang W , et al. Recent breeding programs enhanced genetic diversity in both desi and kabuli varieties of chickpea (Cicer arietinum L.) . Sci. Rep . 2016 ; 6 : 38636 . https://doi.org/10.1038/srep38636 PMID: 27982107 30. Cuc LM , Mace ES , Crouch JH , Quang VD , Long TD , Varshney RK . Isolation and characterization of novel microsatellite markers and their application for diversity assessment in cultivated groundnut (Arachis hypogaea) . BMC Plant Biol . 2008 ; 8 : 55 . https://doi.org/10.1186/ 1471 -2229-8-55 PMID: 18482440 31. Li H , Durbin R . Fast and accurate short read alignment with Burrows-Wheeler Transform . Bioinformatics 2009 ; 25 : 1754 - 1760 . https://doi.org/10.1093/bioinformatics/btp324 PMID: 19451168 32. Albers CA , Lunter G , MacArthur DG , McVean G , Ouwehand WH , Durbin R. Dindel : accurate indel calls from short-read data . Genome Res . 2011 ; 21 : 961 - 973 . https://doi.org/10.1101/gr.112326.110 PMID: 20980555 33. Rozen S , Skaletsky H. Primer3 on the WWW for general users and for biologist programmers . Methods Mol. Biol . 2000 ; 132 : 365 - 386 . PMID: 10547847 34. Varshney RK , Gaur PM , Chamarthi SK , Krishnamurthy L , Tripathi S , Kashiwagi J , et al. Fast-track introgression of QTL-hotspot for root traits and other drought tolerance traits in JG 11, an elite and leading variety of chickpea . Plant Genome 2013 ; 6 : 1 -9 https://doi.org/10.3835/plantgenome2013. 07 .0022 35. Varshney RK , Mohan SM , Gaur PM , Chamarthi SK , Singh VK , Samineni S , et al. Marker-assisted backcrossing to introgress resistance to Fusarium wilt (FW) race 1 and Ascochyta blight (AB) in C 214, an elite cultivar of chickpea . Plant Genome 2014 ; 7 : 1 -11 https://doi.org/10.3835/plantgenome2013. 10 . 0035 36. Pratap A , Chaturvedi SK , Tomar R , Rajan N , Malviya N , Thudi M , et al. Marker-assisted introgression of resistance to fusarium wilt race 2 in Pusa 256, an elite cultivar of desi chickpea . Mol. Genet . Genomics 2017 ; 292 : 1237 - 1245 . https://doi.org/10.1007/s00438-017-1343-z PMID: 28668975 37. Roorkiwal M , Rathore A , Das RR , Singh MK , Jain A , Srinivasan S , et al. Genome-enabled prediction models for yield related traits in chickpea . Front. Plant Sci . 2016 ; 7 : 1666 . https://doi.org/10.3389/fpls. 2016 .01666 PMID: 27920780 38. Roorkiwal M , Jarquin D , Singh MK , Gaur PM , Bharadwaj C , Rathore A , et al. Genomic-enabled prediction models using multi-environment trials to estimate the effect of genotype ? environment interaction on prediction accuracy in chickpea . Sci. Rep . 2018 ; 8 : 11701 . https://doi.org/10.1038/s41598-018- 30027-2 39. Das S , Upadhyaya HD , Srivastava R , Bajaj D , Gowda CLL , Sharma S , et al. Genome-wide insertiondeletion (InDel) marker discovery and genotyping for genomics-assisted breeding applications in chickpea . DNA Res . 2015 ; 22 : 377 - 386 . https://doi.org/10.1093/dnares/dsv020 40. Srivastava R , Singh M , Bajaj D , Parida SK . A high-resolution InDel (insertion-deletion) markersanchored consensus genetic map identifies major QTLs governing pod number and seed yield in chickpea . Front. Plant Sci . 2016 ; 7 : 1362 . https://doi.org/10.3389/fpls. 2016 .01362 41. Ridge S , Deokar A , Lee R , Daba K , Macknight RC , Weller JL , et al. The chickpea early flowering 1 (Efl1) locus is an ortholog of Arabidopsis ELF3 . Plant Physiol . 2017 175: 802 - 815 https://doi.org/10. 1104/pp. 17 .00082 PMID: 28818860 42. Roorkiwal M , Jain A , Kale SM , Doddamani D , Chitikineni A , Thudi M , et al. Development and evaluation of high density SNP array (Axiom CicerSNP Array) for high resolution genetic mapping and breeding applications in chickpea . Plant Biotechnol. J . 2018 ; 16 : 890 - 901 . https://doi.org/10.1111/pbi.12836 43. Varshney RK , Thudi M , Nayak SN , Gaur PM , Kashiwagi J , Krishnamurthy L , et al. Genetic dissection of drought tolerance in chickpea (Cicer arietinum L.) . Theor. Appl. Genet . 2014 ; 127 : 445 - 462 . https://doi. org/10.1007/s00122-013 -2230-6 PMID: 24326458 44. Jaganathan D , Thudi M , Kale S , Azam S , Roorkiwal M , Gaur PM , et al. Genotyping-by-sequencing based intra-specific genetic map refines a ?QTL-hotspot? region for drought tolerance in chickpea . Mol. Gen. Genomics 2015 ; 290 : 559 - 571 . 45. Salathia N , Lee HN , Sangster TA , Morneau K , Landry CR , Schellenberg K , et al. Indel arrays: an affordable alternative for genotyping . Plant J . 2007 ; 51 : 727 - 737 . https://doi.org/10.1111/j. 1365 - 313X . 2007 . 03194 . x PMID : 17645438 46. Hou X , Li L , Peng Z , Wei B , Tang S , Ding M , et al. A platform of high-density INDEL/CAPS markers for map-based cloning in Arabidopsis . Plant J. 2010 ; 63 : 880 - 888 . https://doi.org/10.1111/j. 1365 - 313X . 2010 . 04277 . x PMID : 20561258 47. Jiang J , Gao Y , Hou Y , Li W , Zhang S , Zhang Q , et al. Whole-genome resequencing of Holstein bulls for Indel discovery and identification of genes associated with milk composition traits in dairy cattle . PloS One 2016 ; 11 : e0168946. https://doi.org/10.1371/journal.pone. 0168946 PMID: 28030618


This is a preview of a remote PDF: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0213999&type=printable

Ankit Jain, Manish Roorkiwal, Sandip Kale, Vanika Garg, Ramakrishna Yadala, Rajeev K. Varshney. InDel markers: An extended marker resource for molecular breeding in chickpea, PLOS ONE, 2019, DOI: 10.1371/journal.pone.0213999