Skip to main content

Utility of EST-derived SSR in cultivated peanut (Arachis hypogaea L.) and Arachiswild species



Lack of sufficient molecular markers hinders current genetic research in peanuts (Arachis hypogaea L.). It is necessary to develop more molecular markers for potential use in peanut genetic research. With the development of peanut EST projects, a vast amount of available EST sequence data has been generated. These data offered an opportunity to identify SSR in ESTs by data mining.


In this study, we investigated 24,238 ESTs for the identification and development of SSR markers. In total, 881 SSRs were identified from 780 SSR-containing unique ESTs. On an average, one SSR was found per 7.3 kb of EST sequence with tri-nucleotide motifs (63.9%) being the most abundant followed by di- (32.7%), tetra- (1.7%), hexa- (1.0%) and penta-nucleotide (0.7%) repeat types. The top six motifs included AG/TC (27.7%), AAG/TTC (17.4%), AAT/TTA (11.9%), ACC/TGG (7.72%), ACT/TGA (7.26%) and AT/TA (6.3%). Based on the 780 SSR-containing ESTs, a total of 290 primer pairs were successfully designed and used for validation of the amplification and assessment of the polymorphism among 22 genotypes of cultivated peanuts and 16 accessions of wild species. The results showed that 251 primer pairs yielded amplification products, of which 26 and 221 primer pairs exhibited polymorphism among the cultivated and wild species examined, respectively. Two to four alleles were found in cultivated peanuts, while 3–8 alleles presented in wild species. The apparent broad polymorphism was further confirmed by cloning and sequencing of amplified alleles. Sequence analysis of selected amplified alleles revealed that allelic diversity could be attributed mainly to differences in repeat type and length in the microsatellite regions. In addition, a few single base mutations were observed in the microsatellite flanking regions.


This study gives an insight into the frequency, type and distribution of peanut EST-SSRs and demonstrates successful development of EST-SSR markers in cultivated peanut. These EST-SSR markers could enrich the current resource of molecular markers for the peanut community and would be useful for qualitative and quantitative trait mapping, marker-assisted selection, and genetic diversity studies in cultivated peanut as well as related Arachis species. All of the 251 working primer pairs with names, motifs, repeat types, primer sequences, and alleles tested in cultivated and wild species are listed in Additional File 1.


Cultivated peanut (Arachis hypogaea L.) is grown on 25.5 million hectares with a total global production of about 35 million tons. It is an allotetraploid (2n = 4× = 40) and belongs to Arachis genus, which can be grouped into nine sections and includes approximately 80 species [1]. A large amount of morphological and agronomic variation is evident among accessions of cultivated peanuts, but extremely low levels of polymorphism were observed using restriction fragment length polymorphism (RFLP), randomly amplified polymorphic DNA (RAPD) and amplified fragment length polymorphisms (AFLP) [25]. Only simple-sequence repeats (SSRs) showed a potential for use in genetic studies of cultivated peanuts [611]. However it is expensive, labor-intensive and time-consuming to develop SSR markers from genomic DNA libraries [12]. To date, the number of available SSRs is grossly inadequate for mapping studies. Although several peanut genetic maps have been published [1316], the existing maps do not have sufficient markers to be highly useful for genetic studies. Thus, there is great need for development of novel SSR markers.

Recently, EST-SSRs have received much attention as the increasing amounts of ESTs being deposited in databases for various plants [1719]. EST-SSR can be rapidly developed from EST database by data mining at low cost, and due to their existence in transcribed region of genome, they can lead to the development of gene-based maps which may help to identify candidate function genes and increase the efficiency of marker-assisted selection [20]. In addition, EST-SSRs show a higher level of transferability to closely related species than genomic SSR markers [17, 21] and can be served as anchor markers for comparative mapping and evolutionary studies [22]. Similar advantages of EST-SSRs have been reported for a number of plant species, such as grape [17], Medicago species [23], soybean [24], sugarcane [25], maize [18, 19, 24, 26], rice [18, 2729], rye [2931], and wheat [27, 32, 33], indicating that EST-SSR markers have potential for use in peanut genetic studies.

In peanut, only two studies described the development of EST-SSR in cultivated peanut and wild species [34, 35]. Luo et al (2005) developed 44 EST-SSR markers from 1,350 cultivated peanut ESTs, nine of which exhibited polymorphism among 24 cultivated peanut lines. Proite et al (2007) developed 188 EST-SSRs from 8,785 A. stenosperma (Arachis species) ESTs, of which, 21 were polymorphic for an AA genome mapping population and 4 for a range of cultivated peanut genotypes. In this study, we screened a much larger number of ESTs (24, 238) from cultivated peanut with the following objectives: (1) to analyze the frequency and distribution of SSRs in transcribed regions of cultivated peanut genome; (2) to assess the validity of developed EST-SSR markers for detection of the polymorphism in cultivated peanut genotypes and their transferability to related wild species; (3) to develop new EST-SSR markers for both cultivated peanut and wild species.


Type and frequency of peanut EST-SSRs

A total of 24,238 ESTs with an average length of 550 bp were used to evaluate the presence of SSR motifs. To eliminate redundant sequences and improve the sequence quality, the TIGR Gene Indices Clustering Tools (TGICL) [36] was employed to obtain consensus sequences from overlapping clusters of ESTs. A cluster was defined here as a group of overlapping EST sequences (at least 50 nucleotides with 90% identity and unmatched length less than 20 nucleotides). Totally, 11,431 potential unique ESTs including 1,434 contigs and 9,997 singletons were generated. As shown in Table 1, a total of 881 SSRs were identified from 780 unique ESTs, with an average of one SSR per 7.3 kb. Of those, 85 (about 10.9%) ESTs contained more than one SSR and 59 (about 7.6%) were compound SSRs that have more than one repeat type. Analysis of SSR motifs revealed that the proportion of SSR unit sizes was not evenly distributed. The occurrences of different repeat units were tri- (63.9%), di- (32.7%), tetra- (1.7%), penta- (0.7%), and hexa-nucleotide (1.0%). The mean SSR length of each unit varied between 18 and 37 bp. The overall average SSR length was 20 bp with a maximum of 86 bp di-nucleotide repeat (AG/CT). A total of 27 SSR motifs were listed in Table 2. The AG/CT was the most frequent motif and accounted for 27.7%, followed by AAG/TTC (17.37%), AAT/TTA (11.9%), ACC/TGG (7.7%), ACT/TGA (7.26%) and AT/TA (6.3%). The remaining motifs presented a frequency of 23.3%. GC-only repeat was not observed.

Table 1 Summary of SSR search after sequences assembled and categorized
Table 2 Occurrence and number of repeats of 27 SSR motifs in cultivated peanut (Arachis hypogaea L.)

Primer design and validation

Among the 780 SSR-containing ESTs, 490 did not qualify for primer design as the flanking sequences were too short or poor quality. Therefore, only 290 primer pairs were designed and employed for validation of genic SSR markers (Table 3). Of these EST-SSRs, 65, 178 and 47 were observed in 5' untranslated terminal regions (UTR), translated regions and 3' UTR, respectively. After optimization, 251 primer pairs (86.5%) were successfully amplified in all cultivated peanut and wild species tested (Table 3), while the rest failed to give PCR products at various annealing temperature and Mg2+ concentrations. Out of 251 working primer pairs, 182 amplified the expected size of amplicons, 41 yielded PCR products larger than expected, revealing that an intron is inside the amplicons, and the amplified products of the remaining 28 primer pairs were smaller than expected, suggesting the occurrence of deletion within the genomic sequences or a lack of specificity (Additional File 1).

Table 3 Characteristics of cultivated peanut (Arachis hypogaea L.) EST-SSR and efficiency of markers development

EST-SSR polymorphism

In the present study, 251 valid EST-SSR primer pairs were used for assessment of the polymorphism among cultivated and wild Arachis species. Within cultivated peanuts, 26 (10.3%) EST-SSRs exhibited polymorphism (Table 3). A total of 55 alleles were detected and the average number of alleles per SSR marker was 2.1 with a range of 2–4 alleles based on the dominant scoring of the SSR bands characterized by the presence or absence of a particular band (Additional File 1). The PIC values ranged from 0.09 to 0.69 with an average value of 0.33. The greatest variation of SSR alleles was found for EM-78, which interacted with 4 alleles in 22 cultivated peanuts genotypes and the PIC value was 0.69.

The polymorphism of 251 cultivated peanut-derived EST-SSR in 16 accessions of wild species was evaluated. The results showed that 221 of 251 EST-SSR loci (88%) were polymorphic (Table 3), with a total of 867 alleles (Additional File 1). Allelic diversity was estimated for those polymorphic EST-SSR markers. The number of alleles detected among 16 wild species ranged from 2 to 9, with an average of 3.9 alleles per locus (Additional File 1). A maximum of 9 alleles were observed for primer EM-71. The PIC values varied between 0.594 and 0.820 with an average value of 0.721.

Sequence comparison of SSR bands

For further understanding of the EST-SSR polymorphism at the nucleotide level, the amplified products of primer EM-31 from two genotypes of cultivated peanuts and three accessions of wild species were cloned and sequenced (Figure 1, Figure 2). All the sequenced alleles from both cultivars and wild species were highly identical to the original locus (EST sequence) from which the EST-SSR marker EM-31 was mined. Sequence alignment showed that all the primer-binding regions are highly conserved. Allelic diversity could be attributed mainly to differences in repeat type and length in the microsatellite regions, although some variations such as repeat number or insertions of additional motifs were observed in the microsatellite regions. In addition, a few single base substitutions were observed in the microsatellite flanking regions. Out of them, one occurred in A. cardenasii, one in A. duranensis, and two in A. pintoi.

Figure 1

Polyacrylamide gel electrophoresis patterns of microsatellite alleles amplified with the primer EM-31. The bands indicated by the arrows were sequenced. M represents the DNA molecular weight marker, and 1–38 represent PI 393531 (1), PI 390693 (2), Qiongshanhuasheng (3), Liaoningsilihong (4), Dedou (5), Guangliu (6), Sanyuening (7), Yueyou 20 (8), Spancross (9), Tennessee Red (10), Xiaoliuqiu (11), Yangjiangpudizan (12), Xihuagoudo (13), Padou (14), Bo-50 (15), Yingdejidouzai (16), Heyuanbanman (17), Tuosunxiaohuasheng (18), Sunoleic 97R (19), Tifrunner (20), Georgia Green (21), NC940-22 (22), A. villosa (23), A. stenosperma (24), A. correntina (25), A. cardenasii (26), A. magna (27), A. duranensis (28), A. chacoensis (29), A. batizocoi (30), A. helodes (31), A. monticola (32), A. pintoi (33), A. paraguariensis (34), A. pusilla (35), A. rigonii (36), A. appressipila (37), A. glabrata (38).

Figure 2

Alignment of sequences obtained from five SSR bands amplified by EM-31 primers and original SSR-derived EST sequence(EM-31). Primer sequences are indicated by underlined arrows. Repetitive sequences are indicated in dashed box. Point mutations and indel regions are marked by box with solid line.


Frequency and distribution of EST-SSRs

The frequency of SSRs in SSR-ESTs more accurately reflects the density of SSRs in the transcribed region of the genome. However, random sequencing within cDNA libraries usually resulted in a high proportion of redundant ESTs. In this study, to reduce the dataset size and avoid overestimation of the EST-SSR frequency, SSR search were performed following redundancy elimination. A total of 11,432 potential unique EST sequences (about 6.4 Mb) were used for SSR search and 6.8% (780) of ESTs contained specified SSR motifs, generating 881 unique SSRs. This is a relatively higher abundance of SSRs for peanut ESTs, compared to the previous reports for maize (1.4%), barley (3.4%), wheat (3.2%), soyghum (3.6%), rice (4.7%) [18], Medicago truncatula (3.0%) [23] and wild Arachis species [34]. The different abundance of SSRs was known to be dependent on the SSR search criteria, the size of the dataset, the database-mining tools and different species [22]. In this work, the frequency of occurrence for EST-derived SSRs was one EST-SSR in every 7.3 kb. In previous reports, an EST-SSR occurs every 13.8 kb in Arabidopsis thaliana, 3.4 kb in rice, 8.1 kb in maize, 7.4 kb in soybean, 11.1 kb in tomato, 20.0 kb in cotton and 14.0 kb in poplar [37]. The variations of frequencies among different studies were mainly due to the criteria used to identify SSR in the database mining.

In earlier reports, tri-nucleotide repeats were generally the most common motif found in both monocots [22] and dicots [23]. During the process of mining EST-SSRs in the various plant species, tri-nucleotide was also observed to be most frequent [26], regardless of the EST-SSR search criteria. Until now, only one report described that di-nucleotide repeats were most abundant followed by tri- or mono-nucleotide repeats in dicots [38]. In the present investigation, tri-nucleotide repeat was found to be abundant followed by di-nucleotide. In term of single SSR motif, the di-nucleotide motif (AG/TC) n was highest frequent [18, 39]. Among the di-nucleotide motifs, the two most dominant motif types were AG and AT, representing an average frequency of 24.7% and 6.4%, respectively. This was in agreement with recent studies in cultivated peanut (Arachis hypogaea L.) [35] and wild Arachis species [34]. In this work, the AAG with 17.4% of frequency following di-nucleotide motif AG was the most abundant in the ten tri-nucleotide motifs. In other plant species, the most frequent tri-nucleotide repeat motifs were (AAC/TTG) n in wheat, (AGG/TCC) n in rice, (CCG/GGC) n in maize, (AAG/TTC) n in soybean, and (CCG/GGC) n in barley and sorghum [18, 19, 39, 40]. The previous studies of Arabidopsis [37] and soybean [24] also suggested that the tri-nucleotide AAG motif may be common motif in dicots. In contrast, the abundance of the tri-nucleotide CCG repeat motif was favored overwhelmingly in cereal species [18, 19, 32] and also considered as a specific feature of monocot genome, which may be due to increasing the G + C content [26].

Validation and polymorphism of EST-SSR markers

In this study, a total of 290 designed primer pairs were used for validation of the EST-SSR markers. Of these, 251 (86.5%) yielded amplicons in both cultivated peanut and wild species. This result was similar to previous studies in which a success rate of 60–90% amplification has been reported [21, 25, 4042]. In those studies, they also reported a similar success rate of amplification for both genomic SSRs and EST-SSRs. However, EST-SSRs were reported to be less polymorphic than genomic SSRs in crop plants due to greater DNA sequence conservation in transcribed regions [17, 28, 4346]. Previous studies highlighted the fact that EST-SSR markers have higher transferability and better applicability than genomic SSR markers [17, 4749]. In addition to high transferability, EST-SSRs were good candidates for the development of conserved orthologous markers for genetic analysis and breeding of different species [22]. Pervious reports showed that the transferability of EST-SSRs from one species to another ranged from 40–89% [21, 23, 24, 27, 29, 40, 41, 50, 51]. Our results indicated that 100% of EST-SSR amplifiable primers for cultivated peanut can produce amplicons in Arachis wild species.

In the present investigation, the mean percentage of polymorphic loci of EST-SSR markers was 9.96% in cultivated peanuts. This value was lower than those of genomic SSR found in earlier studies [7, 12, 47], but higher than the percentage of polymorphic loci in cultivated peanut observed using RAPD (6.6%) [5] and AFLP (6.7%) [4]. No major difference was observed in terms of allele numbers and PIC values for the EST-SSR markers among the cultivated genotypes, while significant difference was observed among wild species. Therefore, the low level of EST-SSR polymorphism detected in cultivated peanuts may be compensated by their higher potential for cross-species transferability to wild species. In the present study, 100% transferability of EST-SSR with 86.6% polymorphism from cultivated peanut to wild Arachis species was observed. The value is higher than that of genomic SSR cross-transferability [10]. The high level of transferability indicated that these markers would be highly effective for molecular study of Arachis species. Since current molecular markers display a low level of genetic polymorphism in cultivated peanuts [24, 6, 52, 53], it is difficult to construct a high-density genetic linkage map for cultivated peanut which could be used in breeding programs. However, a genetic map constructed using wild species together with transferable molecular markers derived from cultivated peanuts would contribute to understanding the introgression of genes from wild species to cultivated peanuts [10, 13]. Therefore, the development of a set of transferable EST-SSR markers from cultivated peanuts will be a great benefit to construct a high-density genetic map of wild species. The map would allow the identification of markers, especially transferable EST-SSR markers, associated with resistance or other agronomic traits in wild species, and in turn, help to discover corresponding markers or genes in cultivated peanuts.

Additionally, a comparison of sequences of cross-species amplicons generated by primer EM-31 further confirmed the conservation and transferability of the developed EST-SSR loci. In general, the amplified regions were found to be similar to the original peanut EST sequences from which the SSRs were developed and their comparisons across species (Figure 2) correlated the observed 'cross-species alleles' precisely with the expected length variations. Furthermore, in addition to the variation of the number of SSR repeat, the allele sequences also indicated that a few additional point mutation in the SSR motifs flanking regions. Similar variation has been reported in earlier studies [39, 47, 54, 55]. This phenomenon is supposed to be the innate evolving nature of the genome, and thus can be indicative of the evolutionary relationships of the tested taxa [47].


EST-SSR markers developed in this study will complement the genomic SSR markers and provide a valuable resource for linkage mapping, gene and QTL identification, and marker-assisted selection in peanut genetic study. Since these markers were developed based on expressed sequence and they are conserved across Arachis genus, they may be valuable for comparative genome mapping and functional analysis of candidate genes. In addition, these markers may be potentially useful for study of pods traits because majority of these EST sequences were derived from pods at three developmental stages.


Plant materials and DNA extraction

In the present study, twenty-two accessions of cultivated peanut(A.hypogaea L.) corresponding to two subspecies (hypogaea and fastigiata) and sixteen accessions of wild species from seven sections of the genus Arachis were used (Additional File 2). The leaf samples of each accession were collected from Peanut Germplasm Bank located in Crops Research Institute, Guangdong Academy of Agriculture, Guangzhou, China. the genomic DNA was extracted as described by Sharma [56].

Data mining for SSR marker

A total of 24,238 EST sequences including 20,160 developed by Guo et al (2008)[57] and 4078 retrieved from National Center of Biotechnology Information (NCBI) were used in this study. These ESTs were assembled using the TGICL program [36]. A Perl script known as MIcroSAtellite (MISA was used to mine microsatellites. In this work, SSRs were considered for primer design that fitted the following criteria: a minimum length of 14 bp, excluding polyA and polyT repeat, at least 7 repeat units in case of di-nucleotide and at least 5 repeat units for tri-, tetra-, penta- and hexa-nucleotide SSRs. Therefore, the paired numbers representing SSR motif length and the minimum repeat number in the MISA configuration file (misa.ini) were modified to 2–7, 3–5, 4–5, 5-5 and 6-5 (mono-type excluded).

Primer design and PCR amplification

Using Primer Premier 5 program (Whitehead Institute for Biomedical Research, Cambridge, Mass), primers were designed based on the following core criteria: (1) melting temperature (Tm) between 52°C and 63°C with 60°C as optimum; (2) product size ranging from 100 bp to 350 bp; (3) primer length ranging from 18 bp to 24 bp with amplification rate larger than 80%; (4) GC% content between 40% and 60%. The parameters were modified when unsuitable primer pairs were retrieved by the program. PCR analysis was performed in a total volume of 20 μl with the following cycling profile: 1 cycle of 5 min at 94°C, an annealing temperature of 55°C for 35 cycles (1 min at 94°C, 30 s at 55°C, 45 s at 72°C) and an additional cycle of 10 min at 72°C. Each of the primer pairs was screened twice to confirm the repeatability of the observed bands in each genotype. PCR products were separated on 6% polyacrylamide denaturing gels. The gels were silver stained for SSR bands detection.

Sequencing of PCR bands

The SSR alleles amplified in two cultivars and three wild species for EM-31 primer were individually cloned and sequenced. PCR amplification products were separated by 6% polyacrylamide gel and target allele bands were excised and dipped in 10 μl of nuclease free water for 30 min. Another round of PCR was made following the same protocol with recycled DNA as template. The second-round PCR products were separated in a 2% agarose gel and the target band was purified using TIANGEN Gel Extracting Kit (TIANGEN Inc. Beijing China). The purified PCR fragment from agarose gel was cloned using the Takara TA cloning kit pMD-18 (Takara, Dalian, China). The ligation product was transformed into competent Escherichia coli cells. The positive clones identified by PCR were sequenced by Invitrogen Company. The final edited sequences belonging to different genotypes were compared with the original SSR containing EST sequence using Omiga program [58], and the exported multiple sequence alignment was modified by Genedoc

Data scoring and statistical analysis

The allelic and genotypic frequencies were calculated for the samples analyzed. The genetic diversity of the samples as a whole was estimated based on the number of alleles per locus (total number of alleles/number of loci), the percentage of polymorphic loci (number of polymorphic loci/total number of loci analyzed) and polymorphism information content (PIC). The polymorphism was determined according to the presence or absence of the SSR locus. The value of PIC was calculated using the formula

where P i is the frequency of an individual genotype generated by a given EST-SSR primer pair and summation extends over n alleles.


  1. 1.

    Krapovickas A, Gregory WC: Taxonomia del genero Arachis (Leguminosae). Bonplandia. 1994, 8: 1-186.

    Google Scholar 

  2. 2.

    Halward TM, Stalker HT, Larue EA, Kochert G: Genetic variation detectable with molecular markers among unadapted germplasm resources of cultivated peanut and related wild species. Genome. 1991, 34: 1013-1020.

    Article  Google Scholar 

  3. 3.

    Kochert G, Halward T, Branch WD, Simpson CE: RFLP variability in peanut (Arachis hypogaea L.) cultivars and wild species. Theor Appl Genet. 1991, 81 (5): 565-570. 10.1007/BF00226719.

    PubMed  Article  Google Scholar 

  4. 4.

    He GH, Prakash CS: Identification of polymorphic DNA markers in cultivated peanut (Arachis hypogaea L.). Euphytical. 1997, 97 (2): 143-149. 10.1023/A:1002949813052.

    Article  Google Scholar 

  5. 5.

    Subramanian V, Gurtu S, Nageswara Rao RC, Nigam SN: Identification of DNA polymorphism in cultivated groundnut using random amplified polymorphic DNA (RAPD) assay. Genome. 2000, 43 (4): 656-660. 10.1139/gen-43-4-656.

    PubMed  Article  Google Scholar 

  6. 6.

    Hopkins MS, Casa AM, Wang T, Mitchell SE, Dean RE, Kochert GD, Kresovich S: Discovery and characterization of polymorphic simple sequence repeats (SSRs) in peanut. Crop Sci. 1999, 39 (4): 1243-1247.

    Article  Google Scholar 

  7. 7.

    Ferguson ME, Burow MD, Schulze SR, Bramel PJ, Paterson AH, Kresovich S, Mitchell S: Microsatellite identification and characterization in peanut (A. hypogaea L.). Theor Appl Genet. 2004, 108 (6): 1064-1070. 10.1007/s00122-003-1535-2.

    PubMed  Article  Google Scholar 

  8. 8.

    Moretzsohn Mde C, Hopkins MS, Mitchell SE, Kresovich S, Valls JF, Ferreira ME: Genetic diversity of peanut (Arachis hypogaea L.) and its wild relatives based on the analysis of hypervariable regions of the genome. BMC Plant Biol. 2004, 4: 11-10.1186/1471-2229-4-11.

    PubMed  Article  Google Scholar 

  9. 9.

    Tang R, Gao G, He L, Han Z, Shan S, Zhong R, Zhou C, Jiang J, Li Y, Zhuang W: Genetic Diversity in Cultivated Groundnut Based on SSR Markers. J Genet Genomics. 2007, 34 (5): 449-459. 10.1016/S1673-8527(07)60049-6.

    PubMed  Article  Google Scholar 

  10. 10.

    Gimenes MA, Hoshino AA, Barbosa AV, Palmieri DA, Lopes CR: Characterization and transferability of microsatellite markers of the cultivated peanut (Arachis hypogaea L.). BMC Plant Biol. 2007, 7: 9-10.1186/1471-2229-7-9.

    PubMed  PubMed Central  Article  Google Scholar 

  11. 11.

    He G, Meng RH, Gao H, Guo B, Gao G, Newman M, Pittman RN, Prakash CS: Simple sequence repeat markers for botanical varieties of cultivated peanut (Arachis hypogaea L.). Euphytical. 2005, 142 (1): 131-136. 10.1007/s10681-005-1043-3.

    Article  Google Scholar 

  12. 12.

    He G, Meng RH, Newman M, Gao GQ, Pittman RN, Prakash CS: Microsatellites as DNA markers in cultivated peanut (Arachis hypogaea L.). BMC Plant Biol. 2003, 3: 3-10.1186/1471-2229-3-3.

    PubMed  PubMed Central  Article  Google Scholar 

  13. 13.

    Moretzsohn MC, Leoi L, Proite K, Guimaraes PM, Leal-Bertioli SC, Gimenes MA, Martins WS, Valls JF, Grattapaglia D, Bertioli DJ: A microsatellite-based, gene-rich linkage map for the AA genome of Arachis (Fabaceae). Theor Appl Genet. 2005, 111 (6): 1060-1071. 10.1007/s00122-005-0028-x.

    PubMed  Article  Google Scholar 

  14. 14.

    Halward T, Stalker HT, Kochert G: Development of an RFLP linkage map in diploid peanut species. Theor Appl Genet. 1993, 87 (3): 379-384. 10.1007/BF01184927.

    PubMed  Article  Google Scholar 

  15. 15.

    Burow MD, Simpson CE, Starr JL, Paterson AH: Transmission genetics of chromatin from a synthetic amphidiploid to cultivated peanut (Arachis hypogaea L.) broadening the gene pool of a monophyletic polyploid species. Genetics. 2001, 159 (2): 823-837.

    PubMed  PubMed Central  Google Scholar 

  16. 16.

    Varshney RK, Bertioli DJ, Moretzsohn MC, Vadez V, Krishnamurthy L, Aruna R, Nigam SN, Moss BJ, Seetha K, Ravi K, et al: The first SSR-based genetic linkage map for cultivated groundnut (Arachis hypogaea L.). Theor Appl Genet. 2009, 118 (4): 729-739. 10.1007/s00122-008-0933-x.

    PubMed  Article  Google Scholar 

  17. 17.

    Scott KD, Eggler P, Seaton G, Rossetto M, Ablett EM, Lee LS, Henry RJ: Analysis of SSRs derived from grape ESTs. Theor Appl Genet. 2000, 100 (5): 723-726. 10.1007/s001220051344.

    Article  Google Scholar 

  18. 18.

    Kantety RV, La Rota M, Matthews DE, Sorrells ME: Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Mol Biol. 2002, 48 (5–6): 501-510. 10.1023/A:1014875206165.

    PubMed  Article  Google Scholar 

  19. 19.

    Varshney RK, Thiel T, Stein N, Langridge P, Graner A: In silico analysis on frequency and distribution of microsatellites in ESTs of some cereal species. Cell Mol Biol Lett. 2002, 7 (2A): 537-546.

    PubMed  Google Scholar 

  20. 20.

    Gupta PK, Rustgi S: Molecular markers from the transcribed/expressed region of the genome in higher plants. Funct Integr Genomics. 2004, 4 (3): 139-162. 10.1007/s10142-004-0107-0.

    PubMed  Article  Google Scholar 

  21. 21.

    Saha MC, Mian MA, Eujayl I, Zwonitzer JC, Wang L, May GD: Tall fescue EST-SSR markers with transferability across several grass species. Theor Appl Genet. 2004, 109 (4): 783-791. 10.1007/s00122-004-1681-1.

    PubMed  Article  Google Scholar 

  22. 22.

    Varshney RK, Graner A, Sorrells ME: Genic microsatellite markers in plants: features and applications. Trends Biotechnol. 2005, 23 (1): 48-55. 10.1016/j.tibtech.2004.11.005.

    PubMed  Article  Google Scholar 

  23. 23.

    Eujayl I, Sledge MK, Wang L, May GD, Chekhovskiy K, Zwonitzer JC, Mian MA: Medicago truncatula EST-SSRs reveal cross-species genetic markers for Medicago spp. Theor Appl Genet. 2004, 108 (3): 414-422. 10.1007/s00122-003-1450-6.

    PubMed  Article  Google Scholar 

  24. 24.

    Gao L, Tang J, Li H, Jia J: Analysis of microsatellites in major crops assessed by computational and experimental approaches. Mol Breed. 2003, 12 (3): 245-261. 10.1023/A:1026346121217.

    Article  Google Scholar 

  25. 25.

    Cordeiro GM, Casu R, McIntyre CL, Manners JM, Henry RJ: Microsatellite markers from sugarcane (Saccharum spp.) ESTs cross transferable to erianthus and sorghum. Plant Sci. 2001, 160 (6): 1115-1123. 10.1016/S0168-9452(01)00365-X.

    PubMed  Article  Google Scholar 

  26. 26.

    Morgante M, Hanafey M, Powell W: Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat Genet. 2002, 30 (2): 194-200. 10.1038/ng822.

    PubMed  Article  Google Scholar 

  27. 27.

    Yu JK, La Rota M, Kantety RV, Sorrells ME: EST derived SSR markers for comparative mapping in wheat and rice. Mol Genet Genomics. 2004, 271 (6): 742-751. 10.1007/s00438-004-1027-3.

    PubMed  Article  Google Scholar 

  28. 28.

    Cho YG, Ishii T, Temnykh S, Chen X, Lipovich L, McCouch SR, Park WD, Ayres N: Diversity of microsatellites derived from genomic libraries and GenBank sequences in rice (Oryza sativa L.). Theor Appl Genet. 2000, 100 (5): 713-722. 10.1007/s001220051343.

    Article  Google Scholar 

  29. 29.

    Varshney RK, Sigmund R, Börner A, Korzun V, Stein N, Sorrells ME, Langridge P, Graner A: Interspecific transferability and comparative mapping of barley EST-SSR markers in wheat, rye and rice. Plant Sci. 2005, 168 (1): 195-202. 10.1016/j.plantsci.2004.08.001.

    Article  Google Scholar 

  30. 30.

    Dracatos PM, Dumsday JL, Olle RS, Cogan NO, Dobrowolski MP, Fujimori M, Roderick H, Stewart AV, Smith KF, Forster JW: Development and characterization of EST-SSR markers for the crown rust pathogen of ryegrass (Puccinia coronata f.sp. lolii). Genome. 2006, 49 (6): 572-583. 10.1139/G06-006.

    PubMed  Article  Google Scholar 

  31. 31.

    Kuleung C, Baenziger PS, Dweikat I: Transferability of SSR markers among wheat, rye, and triticale. Theor Appl Genet. 2004, 108 (6): 1147-1150. 10.1007/s00122-003-1532-5.

    PubMed  Article  Google Scholar 

  32. 32.

    Gao LF, Jing RL, Huo NX, Li Y, Li XP, Zhou RH, Chang XP, Tang JF, Ma ZY, Jia JZ: One hundred and one new microsatellite loci derived from ESTs (EST-SSRs) in bread wheat. Theor Appl Genet. 2004, 108 (7): 1392-1400. 10.1007/s00122-003-1554-z.

    PubMed  Article  Google Scholar 

  33. 33.

    Gadaleta A, Mangini G, Mulè G, Blanco A: Characterization of dinucleotide and trinucleotide EST-derived microsatellites in the wheat genome. Euphytica. 2007, 153 (1–2): 73-85.

    Google Scholar 

  34. 34.

    Proite K, Leal-Bertioli SC, Bertioli DJ, Moretzsohn MC, da Silva FR, Martins NF, Guimaraes PM: ESTs from a wild Arachis species for gene discovery and marker development. BMC Plant Biol. 2007, 7: 7-10.1186/1471-2229-7-7.

    PubMed  PubMed Central  Article  Google Scholar 

  35. 35.

    Luo M, Dang P, Guo B, He G, Holbrook CC, Bausher MG, Lee RD: Generation of Expressed Sequence Tags (ESTs) for Gene Discovery and Marker Development in Cultivated Peanut. Crop Sci. 2005, 45: 346-353.

    Article  Google Scholar 

  36. 36.

    Pertea G, Huang X, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, et al: TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003, 19 (5): 651-652. 10.1093/bioinformatics/btg034.

    PubMed  Article  Google Scholar 

  37. 37.

    Cardle L, Ramsay L, Milbourne D, Macaulay M, Marshall D, Waugh R: Computational and experimental characterization of physically clustered simple sequence repeats in plants. Genetics. 2000, 156 (2): 847-854.

    PubMed  PubMed Central  Google Scholar 

  38. 38.

    Kumpatla SP, Mukhopadhyay S: Mining and survey of simple sequence repeats in expressed sequence tags of dicotyledonous species. Genome. 2005, 48 (6): 985-998. 10.1139/g05-060.

    PubMed  Article  Google Scholar 

  39. 39.

    Peakall R, Gilmore S, Keys W, Morgante M, Rafalski A: Cross-species amplification of soybean (Glycine max) simple sequence repeats (SSRs) within the genus and other legume genera: implications for the transferability of SSRs in plants. Mol Biol Evol. 1998, 15 (10): 1275-1287.

    PubMed  Article  Google Scholar 

  40. 40.

    Thiel T, Michalek W, Varshney RK, Graner A: Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor Appl Genet. 2003, 106 (3): 411-422.

    PubMed  Google Scholar 

  41. 41.

    Yu JK, Dake TM, Singh S, Benscher D, Li W, Gill B, Sorrells ME: Development and mapping of EST-derived simple sequence repeat markers for hexaploid wheat. Genome. 2004, 47 (5): 805-818. 10.1139/g04-057.

    PubMed  Article  Google Scholar 

  42. 42.

    Gupta PK, Rustgi S, Sharma S, Singh R, Kumar N, Balyan HS: Transferable EST-SSR markers for the study of polymorphism and genetic diversity in bread wheat. Mol Genet Genomics. 2003, 270 (4): 315-323. 10.1007/s00438-003-0921-4.

    PubMed  Article  Google Scholar 

  43. 43.

    Rungis D, Berube Y, Zhang J, Ralph S, Ritland CE, Ellis BE, Douglas C, Bohlmann J, Ritland K: Robust simple sequence repeat markers for spruce (Picea spp.) from expressed sequence tags. Theor Appl Genet. 2004, 109 (6): 1283-1294. 10.1007/s00122-004-1742-5.

    PubMed  Article  Google Scholar 

  44. 44.

    Eujayl I, Sorrells M, Baum M, Wolters P, Powell W: Assessment of genotypic variation among cultivated durum wheat based on EST-SSRS and genomic SSRS. Euphytica. 2001, 119: 39-43. 10.1023/A:1017537720475.

    Article  Google Scholar 

  45. 45.

    Chabane K, Ablett GA, Cordeiro GM, Valkoun J, Henry RJ: EST versus Genomic Derived Microsatellite Markers for Genotyping Wild and Cultivated Barley. Genet Resour Crop Evol. 2005, 5 (7): 903-909. 10.1007/s10722-003-6112-7.

    Article  Google Scholar 

  46. 46.

    Russell J, Booth A, Fuller J, Harrower B, Hedley P, Machray G, Powell W: A comparison of sequence-based polymorphism and haplotype content in transcribed and anonymous regions of the barley genome. Genome. 2004, 47 (2): 389-398.

    PubMed  Article  Google Scholar 

  47. 47.

    Aggarwal RK, Hendre PS, Varshney RK, Bhat PR, Krishnakumar V, Singh L: Identification, characterization and utilization of EST-derived genic microsatellite markers for genome analyses of coffee and related species. Theor Appl Genet. 2007, 114 (2): 359-372. 10.1007/s00122-006-0440-x.

    PubMed  Article  Google Scholar 

  48. 48.

    Wanga ML, Barkleya NA, Yua JK, Deana RE, Newmana ML, Sorrellsa ME, Pedersona GA: Transfer of simple sequence repeat (SSR) markers from major cereal crops to minor grass species for germplasm characterization and evaluation. Plant Genet Res. 2005, 3: 45-57. 10.1079/PGR200461.

    Article  Google Scholar 

  49. 49.

    Guo W, Wang W, Zhou B, Zhang T: Cross-species transferability of G. arboreum-derived EST-SSRs in the diploid species of Gossypium. Theor Appl Genet. 2006, 112 (8): 1573-1581. 10.1007/s00122-006-0261-y.

    PubMed  Article  Google Scholar 

  50. 50.

    Holton TA, Christopher JT, McClure L, Harker N, Henry RJ: Identification and mapping of polymorphic SSR markers from expressed gene sequences of barley and wheat. Mol Breed. 2002, 9 (2): 63-71. 10.1023/A:1026785207878.

    Article  Google Scholar 

  51. 51.

    Saha MC, Mian R, Zwonitzer JC, Chekhovskiy K, Hopkins AA: An SSR- and AFLP-based genetic linkage map of tall fescue (Festuca arundinacea Schreb.). Theor Appl Genet. 2005, 110 (2): 323-336. 10.1007/s00122-004-1843-1.

    PubMed  Article  Google Scholar 

  52. 52.

    Halward TM, Stalker HT, Larue EA, Kochert G: Use of single primer DNA amplification in genetic studies of peanut. Plant Mol Biol. 1992, 84: 201-208.

    Google Scholar 

  53. 53.

    Paik-Ro OG, Smith RL, Knauft DA: Restriction fragment length polymorphism evaluation of six peanut species within the Arachis section. Theor Appl Genet. 1992, 84 (1–2): 201-208.

    PubMed  Google Scholar 

  54. 54.

    Shepherd LD, Lambert DM: Mutational bias in penguin microsatellite DNA. J Hered. 2005, 96 (5): 566-571. 10.1093/jhered/esi070.

    PubMed  Article  Google Scholar 

  55. 55.

    Sethy NK, Choudhary S, Shokeen B, Bhatia S: Identification of microsatellite markers from Cicer reticulatum: molecular variation and phylogenetic analysis. Theor Appl Genet. 2006, 112 (2): 347-357. 10.1007/s00122-005-0135-8.

    PubMed  Article  Google Scholar 

  56. 56.

    Sharma KK, Lavanya M, Anjaiah V: A Method for Isolation and Purification of Peanut Genomic DNA Suitable for Analytical Applications. Plant Mol Biol Reptr. 2000, 18: 393a-393h. 10.1007/BF02825068.

    Article  Google Scholar 

  57. 57.

    Guo B, Chen X, Dang P, Scully BT, Liang X, Holbrook CC, Yu J, Culbreath AK: Peanut gene expression profiling in developing seeds at different reproduction stages during Aspergillus parasiticus infection. BMC Dev Biol. 2008, 8: 12-10.1186/1471-213X-8-12.

    PubMed  PubMed Central  Article  Google Scholar 

  58. 58.

    Kramer JA: Omiga: a PC-based sequence analysis tool. Mol Biotechnol. 2001, 19 (1): 97-106. 10.1385/MB:19:1:097.

    PubMed  Article  Google Scholar 

Download references


This research was funded by a grant from National High Technology Research Development Project (863) of China (No 2006AA0Z156, 2006AA10A115) and Science Foundation of Guangdong province (No 07117967).

Author information



Corresponding author

Correspondence to Xuanqiang Liang.

Additional information

Authors' contributions

All authors read and approved the final manuscript. XL participated in conceiving the study and drafting the manuscript. XC participated in conceiving the study, sequence analysis and drafting the manuscript. YH participated in conceiving the study, the development of SSR markers and data analysis. HL developed the SSRs and designed the SSR primers. GZ and SL planted and collected the peanut materials. BG participated in the development of SSR markers.

Electronic supplementary material


Additional File 1: List of EST-SSR primers developed from cultivated peanut ESTs. The file contains a table that lists primer names, repeat motifs, primer sequences, allele number and product length for the newly developed EST-SSR markers. (XLS 104 KB)


Additional File 2: List of cultivated peanut and wild species materials used in this study. The file includes a table that lists the name, type, ploidy and origin of 22 genotypes of cultivated peanuts and 16 accessions of wild species tested in this study. (XLS 20 KB)

Authors’ original submitted files for images

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Liang, X., Chen, X., Hong, Y. et al. Utility of EST-derived SSR in cultivated peanut (Arachis hypogaea L.) and Arachiswild species. BMC Plant Biol 9, 35 (2009).

Download citation


  • Wild Species
  • Polymorphism Information Content
  • Amplify Fragment Length Polymorphism
  • Repeat Type
  • Arachis Species