- Research article
- Open Access
Transcript profiling reveals expression differences in wild-type and glabrous soybean lines
© Hunt et al; licensee BioMed Central Ltd. 2011
- Received: 29 August 2011
- Accepted: 26 October 2011
- Published: 26 October 2011
Trichome hairs affect diverse agronomic characters such as seed weight and yield, prevent insect damage and reduce loss of water but their molecular control has not been extensively studied in soybean. Several detailed models for trichome development have been proposed for Arabidopsis thaliana, but their applicability to important crops such as cotton and soybean is not fully known.
Two high throughput transcript sequencing methods, Digital Gene Expression (DGE) Tag Profiling and RNA-Seq, were used to compare the transcriptional profiles in wild-type (cv. Clark standard, CS) and a mutant (cv. Clark glabrous, i.e., trichomeless or hairless, CG) soybean isoline that carries the dominant P1 allele. DGE data and RNA-Seq data were mapped to the cDNAs (Glyma models) predicted from the reference soybean genome, Williams 82. Extending the model length by 250 bp at both ends resulted in significantly more matches of authentic DGE tags indicating that many of the predicted gene models are prematurely truncated at the 5' and 3' UTRs. The genome-wide comparative study of the transcript profiles of the wild-type versus mutant line revealed a number of differentially expressed genes. One highly-expressed gene, Glyma04g35130, in wild-type soybean was of interest as it has high homology to the cotton gene GhRDL1 gene that has been identified as being involved in cotton fiber initiation and is a member of the BURP protein family. Sequence comparison of Glyma04g35130 among Williams 82 with our sequences derived from CS and CG isolines revealed various SNPs and indels including addition of one nucleotide C in the CG and insertion of ~60 bp in the third exon of CS that causes a frameshift mutation and premature truncation of peptides in both lines as compared to Williams 82.
Although not a candidate for the P1 locus, a BURP family member (Glyma04g35130) from soybean has been shown to be abundantly expressed in the CS line and very weakly expressed in the glabrous CG line. RNA-Seq and DGE data are compared and provide experimental data on the expression of predicted soybean gene models as well as an overview of the genes expressed in young shoot tips of two closely related isolines.
- Digital Gene Expression
- Soybean Genome
- Trichome Development
- Trichome Formation
- Digital Gene Expression Data
Plant trichomes are appendages that originate from epidermal cells and are present on the surface of various plant organs such as leaves, stems, pods, seed coats, flowers, and fruits. Trichome morphology, varying greatly among species, includes types that are unicellular, multicellular, glandular, non-glandular (as in soybean), single stalks (soybean), or branched structures (Arabidopsis) . Various functions have been ascribed to trichomes, including roles as attractants of pollinators, in protection from herbivores and UV light, and in transpiration and leaf temperature regulation [2–4].
The genetic control of non-glandular trichome initiation and development has been studied extensively in Arabidopsis and cotton. In Arabidopsis, several genes were identified that regulate trichome initiation and development. A knockout of GLABRA1 (GL1) results in glabrous Arabidopsis plants . The GL1 encodes a R2R3 MYB transcription factor that binds either GL3 or ENHANCER OF GLABRA3 (EGL3), basic helix-loop-helix (bHLH) transcription factors, which in turn bind to TRANSPARENT TESTA GLABRA (TTG) protein, a WD40 transcription factor [6, 7]. The binding of GL1-GL3/EGL3-TTG1 forms a ternary complex, which initiates the progression of an epidermal cell development into a trichome by binding to the GLABRA2 (GL2) gene, which encodes a homodomain/leucine zipper transcription factor .
Microarray gene expression analysis of two Arabidopsis mutants lacking trichomes with wild-type Arabidopsis trichomes identified several cell-wall related up-regulated genes . Transcriptome analyses of wild-type trichomes and the double mutant gl3-sst sim trichomes in Arabidopsis identified four new genes: HDG2, BLT, PEL3, and SVB that are potentially associated with trichome development .
Cotton fibers are single celled trichomes that develop from the surface of cotton seed . The development of cotton fibers goes through four stages of development: differentiation/fiber initiation, expansion/elongation, secondary cell wall biosynthesis, and maturity [11, 12]. Unlike Arabidopsis, the specific genes/proteins involved in cotton fiber initiation have not been clearly elucidated. Several different approaches have been taken to study cotton fiber initiation and elongation, including studying gene expression in normal fibers [12–14], comparing gene expression in fiber development mutants to normal cotton varieties [13, 15–17], and using existing EST or gene sequences from cotton or Arabidopsis clones [18–23].
Microarray studies comparing cotton fiber initiation mutants identified six clones falling into either BURP-containing protein or RD22-like protein that were over expressed in cotton fibers in wild-type compared with the mutant lines [15, 16]. These six clones are all members of the BURP domain gene family as the RD22 protein that was identified in Arabidopsis is also a member of the BURP domain family of proteins .
Soybean has 23 possible BURP domain containing genes which are classified into five subfamilies: BNM2-like, USP-like, RD22-like, PG1β-like, and BURPV (a new subfamily) depending on the translated products homology to these founding members of the BURP family [25, 26]. BURP genes are plant-specific and with diverse functions in plants [24, 25].
Unlike Arabidopsis and cotton, the developmental genetics of soybean trichomes has not been studied extensively. However, there are several soybean trichome developmental mutants available, including P1 (glabrous), pc (curly pubescence), Pd (dense pubescence), Ps (sparse pubescence), and p2 (puberulent) that are each controlled by a different single Mendelian locus . These mutants have been used to relate the importance of trichome to insect resistance [4, 28, 29], evapotranspiration [2, 30, 31] and other yield related characteristics. However, until now, none of these glabrous classical mutations has been studied at the molecular level. We studied the dominant P1 glabrous soybean mutant using two high throughput transcript sequencing technologies to reveal major expression differences between the two genotypes. RNA and DNA blots further characterized a highly differentially expressed BURP family member Glyma04g35130 that varied between the two genotypes and may be associated with trichome development in soybean although it is not a candidate for the P1 locus.
DGE library construction and identification of authentic tags
We first used Illumina DGE Tag Profiling to determine the differential gene expression between wild-type Clark standard (CS) and glabrous-mutant Clark glabrous (CG) in shoot tip tissue. The CG isoline was developed by backcrossing the P1 glabrous mutant into Clark for six generations . Total RNA isolated from shoot tips of both CS and CG plants was analyzed by Illumina DGE tag profiling to create transcriptome profiles of the two isolines. DGE tags are 16-nucleotide long and are designed to be derived from the 3'UTR of the transcript. DGE data provide a quantitative measure of transcript abundance in the RNA population and can also identify previously unannotated genes. The majority of DGE tags are expected to match only one location in the genome, with the remaining tags matching duplicated genes, alternate transcripts, antisense strands, or repeated sequences .
Transcriptome comparison of Clark standard and Clark glabrous with DGE tag profiling
Top DGE tags and RNA-Seq RPKM for genes that are over expressed either in Clark standard (a) or Clark glabrous (b).
DGE Tag ID
copper ion binding protein
BURP domain protein
no Glyma model
cyclic nucleotide-gated channel B
no Glyma model
small polyprotein 2
chlorophyll a-b binding protein 1
Blue copper protein precursor
cytochrome P450 monooxygenase CYP84A16
phosphatidylserine decarboxylase invertase/pectin methylesterase inhibitor family
lipid transfer protein
hypothetical protein invertase/pectin methylesterase inhibitor family
putative wall-associated kinase
zinc ion binding protein
copper amine oxidase
putative diphosphonucleotide phosphatase
Aspartic proteinase nepenthesin-1 precursor
no Glyma model
root nodule extensin
auxin down-regulated protein
receptor-like protein kinase
Among the tags overexpressed in the CS line, one particular tag corresponds to a gene located on Glyma04 chromosome, specifically Glyma04g35130, and showed >2000-fold expression difference between CS/CG = 2,545/1.06 tags per million aligned tags (Table 1). The Glyma04g35130 gene is a member of the BURP gene family. It has high homology to the cotton gene- RESISTANCE TO DROUGHT RD22-like 1 (GhRDL1), involved in cotton fiber initiation and member of the BURP protein domain family [15, 16]. Soybean has a total of 23 BURP domain containing genes and BURP gene family members from other species are known to have diverse functions . Some of the proposed functions of BURP family members include: regulation of fruit ripening in tomato [36, 37], response to drought stress induced by abscisic acid in Arabidopsis , tapetum development in rice , and seed coat development in soybean . In Clark, the DGE0000012 tag found to correspond to Glyma04g35130 is the 12th most abundant tag in the DGE data set. For perspective, the 4th most abundant tag with a normalized count of 4,903 tags matches a chlorophyll a/b binding factor as do several of the most abundant tags (Additional file 2).
For further verification of differential expression, we used DESeq package in R without replications as described . This condition relies on the assumption that in the isolines most genes will be similarly expressed, thus treating the two lines as repeats. This analysis produced the same list of significant up and down-regulated genes. Lists of all differentially expressed genes in CS versus CG or vice versa are shown in Additional file 4A & 4B, respectively, using the DESeq package.
Comparison of DGE data with RNA-Seq
The sequencing of CS and CG transcriptome by RNA-Seq generated 91.4 and 88.7 million 75-bp reads, respectively from an independent biological sample of the CS and CG shoot tips. These tags were mapped to the 78,744 soybean gene models using Bowtie . RNA-Seq data was normalized in reads per kilo base of gene model per million mapped reads (RPKM) as the sensitivity of RNA-Seq depends on the transcript length . RNA-Seq analysis revealed that at the cutoff point of 10 RPKM, a total of 11,574 and 14,378 genes were expressed in CS and CG, respectively. At a cutoff of 1 RPKM, however, 41,972 and 44,120 genes were expressed in CS and CG, respectively. Together, the results suggest that in the RNA-Seq transcriptome, ~50% of genes are expressed in both wild-type and mutant soybean.
Expression of BURP gene family members as measured by DGE and RNA-Seq.
RNA blots confirm the dramatic transcript level differences of Glyma04 BURPgene in Clark standard and Clark glabrous
DNA blot comparison of the Glyma04g35130 BURPgene in Clark standard and Clark glabrous
Sequence Analysis of Glyma04g35130BURP Gene of Clark standard and Clark glabrous
Expression analysis of soybean orthologs to known genes involved in trichome development reveal low transcript levels in young shoot tips of both lines
Comparison of DGE and RNA-Seq expression in soybean Clark standard and Clark glabrous of genes influencing trichome development in Arabidopsis
Other studies have shown that MYB transcription factor genes CAPRICE (CPC), TRICHOMELESS (TCL1) and TRIPTYCHON (TRY) are negative regulators of trichome development [46–48]. Elevated levels of SPLs (SQUAMOSA PROMOTER BINDING PROTEIN LIKE) produced fewer trichomes in Arabidopsis. SPL9 directly activates the expression of MYB transcription factor genes such as TRICHOMELESS1 (TCL1) and TRIPTYCHON (TRY), which are the negative regulators of trichome development . Again, no substantial differences were found between the two soybean genotypes (Table 3).
While microarrays have been used extensively to reveal physiological trends from transcriptome analyses of soybean developmental stages or organ systems, fewer reports to date have focused on transcriptome analysis of near isogenic lines using either microarrays [50, 51] or high throughput sequence analysis [52, 53]. Here we compared high throughput sequencing using Digital Gene Expression and RNA-Seq transcriptome profiles of wild-type soybean (CS) and a glabrous-mutant (CG) with the dominant P1 mutation in soybean. DGE produces 16-nucleotide long tags generally specific to 3' end of each mRNA that provide information on quantitative expression of genes, rare transcripts, and also reveals novel or unannotated genes. However, since DGE data often represent the 3' end, it is essential that the databases or reference genome contain that information. We found that many of the annotated gene models in the soybean gene do not extend sufficiently to represent the DGE tags and extending the models to 250 bases at the 5' and 3' ends enables many more tags to align to the models.
Compared to DGE, RNA-Seq produces even greater numbers of reads, up to hundreds of millions in one sequencing lane. The reads are also longer, generally 75 bp and correspond to the entire coding region thus giving more depth and range of coverage. The majority of the genes that are over-expressed in CS as compared to CG were also over expressed in RNA-Seq data or a vice versa but their expression fold changes were different. The use of different technology in DGE and RNA-Seq that produced 16 bp tags from 3'ends and 75 bp tags from whole transcripts, respectively, resulted in differences between DGE and RNA-Seq data. RNA-Seq is potentially a more comprehensive way to measure transcriptome abundance, composition, and splice variants, and it also enables discover of new exons or genes. Soybean has a large and highly duplicated genome, rich in paralogs and gene families. This presents a challenge when mapping DGE tags to a specific gene, since they could equally well map to the other gene homologs in the genome. Yet, both DGE and RNA-Seq data has enabled nearly the same trend of differential expression for many of the gene models.
DGE and RNA-Seq analyses of CS and CG soybean isolines revealed several hundred genes with differential expression. Among them, the Glyma04g35130 BURP gene had a strong transcript level differences between the two lines. Additional validation came from RNA blots, which confirmed that the Glyma04g35130 BURP gene was strongly expressed in CS tissues and not in the glabrous CG isolines. There are also structural (SNP) differences between the CS and CG isolines for this gene. However, the parallel of high transcript levels for trichome-containing plants breaks down for the cv. Williams 82 which has trichomes but also has a very low level of transcripts in shoot tips of the Glyma04g35130 BURP gene as shown by Northern blotting (data not shown). The most distinguishing structural feature difference between the Glyma04g35130 BURP genes in the three cultivars is the presence of the 60 bp repeats, and an additional exon in the CS and CG lines compared to cv. Williams 82, and the addition of one nucleotide C in CG as compared to the other two.
The Glyma04g35130 BURP gene showed high homology to the cotton gene RESISTANCE TO DROUGHT RD22-like 1 (GhRDL1) that is involved in cotton fiber initiation and is also a member of the BURP protein family. The Glyma04g35130 BURP gene and SCB1, seed coat burp domain protein 1 (Glyma07g28940) fall into one BURP protein family- BURPV, when 41 BURP proteins from different species were classified into 5 subfamilies . SCB1 may play a role in the differentiation of the seed coat parenchyma cells and is localized on the cell wall of soybean . But it should be noted that despite high sequence homology among the BURP domain containing genes, the function of each BURP protein seems to greatly vary among plants. The Glyma04g35130 BURP gene does not seem to have a direct role in trichome formation but the possibility is open that it may be indirectly involved in some soybean genotypes.
Although sequence comparison of transcripts from cv. Williams 82, CS, and CG showed 98% identity, but it also revealed various SNP's, insertions, and deletions in CS and CG when compared to cv. Williams 82 (Figure 7). These differences in the transcript sequences such as ~60 bp insertion in the third exon of CS and addition of one nucleotide C in CG resulted in premature stop codons and also disturbed the frame in both CS and CG (Figure 7 & 8). One might also expect differences in the upstream promoter regions of the Glyma04g35130 BURP between CS and CG genes based on the dramatic transcript level differences between the two genotypes as shown by DGE and confirmed by RNA blotting. The number of RFLPs seen in the CS vs. CG DNA blots suggested more family members that may differ by various indels. By comparing the BURP EST probe against the cv. Williams 82 soybean genome sequence , seven potential BURP gene family members were found that have sequence homology to the probe (Table 2) but only Glyma04g35130 stood out as highly differentially expressed between the two genotypes. Up to 23 total genes with BURP protein domains exist in soybean  but only seven are related to the Glyma04g35130 as assessed by e value of <10-6.
Some genes involved in the initiation of trichome development have been particularly well characterized in Arabidopsis. As shown in Table 3, the transcript levels of soybean orthologs to some of the Arabidopsis genes were very low and did not vary considerably between the two genotypes even in the RNA-Seq data that yielded nearly 70 million mapped reads from the young shoot tips of each genotype. It may be necessary to assay earlier stages of trichome development using laser capture microdissection to find transcripts in early trichome formation in specific cell types. Alternatively, soybean may have different and undiscovered mechanisms for trichome formation.
Digital gene profiling and high throughput RNA-Seq revealed thousands of genes expressed in young trifoliate shoot tips of soybean. The data show a direct comparison of both methods. Many genes show agreement of the same trend of gene expression between the isolines but the two techniques produce differences in the ratios. Both methods allowed distinguishing gene family members in many cases. Comparison of isolines delineated changes in transcript abundance between wild-type soybean and glabrous-mutant on a genome-wide scale. Many genes showed similar expression levels between the two isolines as expected but the data also delineated the genes that are over-expressed or under-expressed in CS and may provide an insight into trichome gene expression in soybean, as the CG mutants lack any non glandular trichomes. The identification of a highly expressed member of the BURP gene family, Glyma04g35130, in CS that has almost no transcript presence in CG, may indicate its involvement in trichome formation or function in certain genotypes although it is not a candidate for the dominant P1 locus. Orthologs for Arabidopsis genes involved in trichome development were only very weakly expressed and did not vary considerabley between the two genotypes. This study represents a first step in expanding the study of trichome genetics into the economically important soybean plant.
Plant Materials and Genetic Nomenclature
The two isolines of Glycine max used for this study-Clark standard (L58-231) (CS) and Clark glabrous (L62-1385) (CG) were obtained from the USDA Soybean Germplasm Collections (Department of Crop Sciences, USDA/ARS University of Illinois, Urbana IL). CG mutant was generated by introgression of the P1 glabrous mutant line (T145) into CS for six generations. Plants were grown in the greenhouse for one month and tissues were harvested and sampled from each plant including leaves (four stages from young to older leaves), shoot tips, root, hypocotyl, cotyledons, seed coats, and stem tissue. Multiple plant and tissue samples were used for each extraction in a 12 ml extraction volume. All tissues were quick frozen in liquid nitrogen and stored at -80°C. The tissues were then lyophilized and stored at -20°C.
DGE Library Construction and Data Analysis
Shoot tips from green house grown soybean isolines: CS and CG were collected approximately 4 weeks after planting and immediately frozen in liquid nitrogen. The RNA from multiple shoot tips and leaves was extracted using a modification of the McCarty method  using a 12 ml protocol with phenol chloroform extraction and lithium chloride precipitation.
Library construction was carried out at Illumina, Inc., San Diego, using illumina's DGE tag profiling technology. Briefly, double-stranded cDNA's were synthesized using oligo(dT) beads and cDNA's were digested with NlaIII or DpnII restriction enzymes and ligated to defined gene expression adapter (GEX NlaIII Adapter 1, containing another restriction enzyme MmeI). Following MmeI digestion of cDNA's, which cuts 17 bp downstream, the GEX Adapter 2 was ligated at the site of MmeI cleavage. The GEX Adapter 2 contains sequences complementary to the oligos attached to the flow cell surface. Tags flanked by both adapters were enriched by PCR using primers that anneal to the ends of the adapters. The PCR products were gel purified before loading onto the illumina cluster station for sequencing.
After adapter trimming, the tags were 16-nucleotide long corresponding to 3'end of the transcript. Approximately 5.2 million DGE tags were sequenced from each library and the total counts for each unique read were determined and a unique DGE ID number was assigned to each unique tag, resulting in approximately 85,000 tags for each library where at least one library contained at least 5 counts per tag. The sequences of the DGE sequence tags and counts in each library are shown in Additional File 1.
DGE tags were aligned to the 78,774 cDNA gene models (known as Glyma models) predicted from the soybean reference genome of cv. Williams 82  and available at the Phtozome web site  using Bowtie . Using a stringent criterion of 0 mismatches within the 16 nucleotide tag alignments, most of the tags aligned to the models but large numbers of tags did not. In order to retrieve alignments where the models did not call sufficient 3'UTR sequence, we extended the Glyma models at both the 5' and 3' ends by 250 bases in each direction. Of the 5.2 million raw DGE reads for each library, approximately 4.7 million aligned to the extended Glyma models. DGE data was normalized per million aligned reads.
In addition to alignments to the Glyma models, candidate soybean ESTs corresponding to the tags were used for further verification of the DGE differentially expressed tags referenced in the Table 1. First, each read was compared to the publically available soybean EST sequences available at NCBI via a BLASTN search. Each read was used to identify 100% matches, and only clones matching at least three separate ESTs were used for further analysis. The identified ESTs corresponding to each read were then compared with the non-redundant sequence database at NCBI, using BLASTX. Reads were included in the final list only if all three (or two, 100% identical to reads) had matching annotations. For differential gene expression analysis with count data using a negative binomial distribution without replication, the DESeq package in R was used .
The RNA from multiple shoot tips was extracted using a modification of the McCarty method  using a 12 ml protocol with phenol chloroform extraction and lithium chloride precipitation. The shoot tips were harvested from a second biological replication of ~4-week old plants grown in green house. Library construction and high-throughput sequencing was carried out using RNA-Seq technology at using Illumina GaII instruments by the Keck Center, University of Illinois.
RNA-Seq Allignment and Data Normalization
The 75 bp reads were mapped to the 78,744 Glyma cDNA gene models  using Bowtie  with up to 3 mismatches allowed and up to 25 alignments. A total of the 91.4 and 88.7 million reads were generated in each lane of Illumina sequencing for the CS and CG libraries, respectively. Of these, 65.4 (71%) and 70.3 (79%) million reads aligned to the 78,744 target Glyma models with the Bowtie criteria used. RNA-Seq data was normalized in reads per kilobase of gene model per million mapped reads (RRKM) as the RNA-Seq depends on the transcript length  as the reads will map to all positions of the transcript, unlike DGE tags which are predominantly found adjacent to the first DpnII site at the 3' end of the transcript. The RNA-Seq data discussed in this publication have been deposited in NCBI's Gene Expression Omnibus  and are accessible through GEO Series accession number GSE33155.
Annotation of Glyma models
Coding region gene models were collected from the masked soybean genome from Phytozome version 4.0 GFF file . In addition to the PFAM, KOG and Panther annotations downloaded from Phytozome, the 78,744 models (that include both high and low confidence models) were further annotated using BLASTX against the non-redundant (nr) database of the National Center for Biotechnology Information  and trEMBL and Swiss prot of the European Bioinformatics Institute  on a Time Logic CodeQuest DeCypher Engine.
BURP Gene Cloning and Sequence Analysis
Primers from the cv. Williams 82 genomic sequence [33, 34] were used to amplify the full-length BURP gene from CS and CG genomic DNA using the primers 5' ACATCATTCTAAAAGACATAGACTA3' and 5' TGACCTGTTAGCTTTATGAT3'. A cDNA sequence was amplified from CS root tissue using RT-PCR with primers designed on 5' and 3' untranslated regions (5' CCACCTAAACCATAAGTCCTATTGG3' and 5' CCTATTACTAAAATGTAGGTTCAGTAAAGGTAG3'). All genomic and cDNA sequences were cloned and confirmed by DNA sequencing. The cDNA and genomic sequences of Glyma04g35130 from both lines, CS and CG were compared to determine the number of introns and exons in the gene.
Total RNA was extracted from the frozen leaves, roots, hypocotyls, seed coats, and cotyledons of CS and CG using standard phenol chloroform method with lithium chloride precipitation . RNA samples were quantified by spectrophotometer and the integrity was confirmed using agarose gel electrophoresis. RNA was stored at -80°C until further use.
For RNA gel blot analysis, 10 μg of total RNA was electrophoresed through 1.2% agarose/1.1% formaldehyde gels  blotted onto nitrocellulose membranes (Schleicher & Schuell, Keene, NH) via capillary action with 10× SSC (1.5 M NaCl and 0.15 M sodium citrate, pH = 7) overnight. After blotting, RNA was cross-linked to the nitrocellulose membranes with UV radiation by a UV cross-linker (Stratagene, La Jolla, CA). Nitrocellulose RNA gel blots were then prehybridized, hybridized, washed, and exposed to Hyperfilm (Amersham, Piscataway, NJ) as described by Todd and Vodkin (1996) .
A 1.4 kb probe for BURP gene was amplified from EST (Gm-r1083-3435) and labeled with [α-32P]dATP by random primer reaction method .
For DNA blots, genomic DNA was isolated from lyophilized soybean shoot tips using the method described by Dellaporta in 1993  with minor modifications. Genomic DNAs were digested with six different restriction enzymes including BamHI, HindIII, EcoRI, DraI, BglII, and EcoRV in separate reactions. Ten micrograms of digested genomic DNA from each sample was separated on 0.7% agarose gels. The gels were then treated sequentially with depurination solution (0.25 M HCl), denaturation solution (1.5 M NaCl, 0.5 M NaOH), and neutralization solution (1 M Tris, 1.5 M NaCl [pH 7.4]). The gels were then taken through the same blotting transfer protocol described above for Northern blots along with prehybridization, hybridization (with the appropriate [α-32P]dATP labeled probed), washing, and exposure to Hyperfilm (Amersham, Piscataway, NJ). The same EST probe used for RNA blot was used in the DNA blots.
We are grateful to Sean Bloomfield, Achira Kulasekara, and Cameron Lowe for help with data analysis. The research was funded by support from the Illinois Soybean Association and the USDA.
- Werker E: Trichome diversity and development. Adv Bot Res. 2000, 31: 1-35.View ArticleGoogle Scholar
- Ghorashy SR, Pendelton JW, Bernard RL, Bauer ME: Effect of leaf pubescence on transpiration, photosynthetic rate and seed yield of three near-isogenic lines of soybeans. Crop Sci. 1971, 11: 426-427. 10.2135/cropsci1971.0011183X001100030035x.View ArticleGoogle Scholar
- Nielsen DC, Blad BL, Verma SB, Rosenberg NJ, Specht JE: Influence of soybean pubescence type on radiation balance. Agron J. 1984, 76: 924-929. 10.2134/agronj1984.00021962007600060015x.View ArticleGoogle Scholar
- Brodbeck BV, Andersen PC, Mizell RF, Oden S: Comparative nutrition and developmental biology of xylem-feeding leafhoppers reared on four genotypes of Glycine max. Environ Entomol. 2004, 33 (2): 165-173. 10.1603/0046-225X-33.2.165.View ArticleGoogle Scholar
- Herman PL, Marks MD: Trichome development in Arabidopsis thaliana. II. Isolation and complementation of the GLABROUS1 gene. Plant cell. 1989, 1: 1051-55.PubMedPubMed CentralView ArticleGoogle Scholar
- Payne CT, Zhang F, Lloyd AM: GL3 encodes a bHLH protein that regulates trichome development in Arabidopsis through interaction with GL1 and TTG1. Genetics. 2000, 156: 1349-62.PubMedPubMed CentralGoogle Scholar
- Zhao M, Morohashi K, Hatlestad G, Grotewold E, Lloyd A: The TTG1-bHLHMYB complex controls trichome cell fate and patterning through direct targeting of regulatory loci. Development. 2008, 135: 1991-1999. 10.1242/dev.016873.PubMedView ArticleGoogle Scholar
- Cristina MD, Sessa G, Dolan L, Linstead P, Baima S, Ruberti I, Morelli G: The Arabidopsis Athb-10 (GLABRA2) is an HD-Zip protein required for regulation of root hair development. Plant J. 1996, 10 (3): 393-402. 10.1046/j.1365-313X.1996.10030393.x.PubMedView ArticleGoogle Scholar
- Jakoby M, Falkenhan D, Mader MT, Brininstool G, Wischnitzki E, Platz N, Hudson A, Hülskamp M, Larkin J, Schnittger A: Transcriptional profiling of mature Arabidopsis trichomes reveals that NOECK encodes the MIXTA-like transcriptional regulator MYB106. Plant Physiol. 2008, 148: 1583-1602. 10.1104/pp.108.126979.PubMedPubMed CentralView ArticleGoogle Scholar
- Marks MD, Wenger JP, Gilding E, Jilk R, Dixon RA: Transcriptome analysis of Arabidopsis wild- type and gl3-sst sim trichomes identifies four additional genes required for trichome development. Mol Plant. 2009, 2: 803-822. 10.1093/mp/ssp037.PubMedPubMed CentralView ArticleGoogle Scholar
- Lee JJ, Woodward AW, Chen ZJ: Gene expression changes and early events in cotton fibre development. Ann Bot. 2007, 100: 1391-1401. 10.1093/aob/mcm232.PubMedPubMed CentralView ArticleGoogle Scholar
- Arpat AB, Waugh M, Sullivan JP, Gonzales M, Frisch D, Main D, Wood T, Leslie A, Wing RA, Wilkins TA: Functional genomics of cell elongation in developing cotton fibres. Plant Mol Biol. 2004, 54: 911-929. 10.1007/s11103-004-0392-y.PubMedView ArticleGoogle Scholar
- Taliercio EW, Boykin D: Analysis of gene expression in cotton fiber initials. BMC Plant Biol. 2007, 7: 22-10.1186/1471-2229-7-22.PubMedPubMed CentralView ArticleGoogle Scholar
- Alabady MS, Youn E, Wilkins TA: Double feature selection and cluster analyses in mining of microarray data from cotton. BMC Genomics. 2008, 9: 295-10.1186/1471-2164-9-295.PubMedPubMed CentralView ArticleGoogle Scholar
- Li XB, Cai L, Cheng NH, Liu JW: Molecular characterization of the cotton GhTUB1 gene that is preferentially expressed in fibre. Plant Physiol. 2002, 130: 666-674. 10.1104/pp.005538.PubMedPubMed CentralView ArticleGoogle Scholar
- Lee JJ, Hassan OS, Gao W, Wei NE, Kohel RJ, Chen XY, Payton P, Sze SH, Stelly DM, Chen ZJ: Development and Gene Expression Analyses of a Cotton Naked Seed Mutant. Planta. 2006, 223: 418-432. 10.1007/s00425-005-0098-7.PubMedView ArticleGoogle Scholar
- Wu AM, Ling C, Liu JY: Isolation of a cotton reversibly glycosylated polypeptide (GhRGP1) promoter and its expression activity in transgenic tobacco. J Plant Physiol. 2006, 163: 426-435. 10.1016/j.jplph.2005.06.014.PubMedView ArticleGoogle Scholar
- Loguercio LL, Zhang JQ, Wilkins TA: Differential regulation of size novel MYB-domain genes defines two distinc expression patterns in allotetraploid cotton (Gossypium hirsutum L.). Mol Gen Genet. 1999, 261: 660-671. 10.1007/s004380050009.View ArticleGoogle Scholar
- Sou J, Liang X, Pu L, Zhang Y, Xue Y: Identification of GhMYB109 encoding a R2R3 MYB transcription factor that expressed specifically in fiber initials and elongating fibers of cotton (Gossypium hirsutum L.). Biochim Biophys Acta. 2003, 1630: 25-34.View ArticleGoogle Scholar
- Humphries JA, Walker AR, Timmis JN, Orford SJ: Two WD-Repeat Genes from Cotton are Functional Homologues of the Arabidopsis thaliana TRANSPARENT TESTA GLABRA1 (TTG1) Gene. Plant Mol Biol. 2005, 57: 67-81. 10.1007/s11103-004-6768-1.PubMedView ArticleGoogle Scholar
- Luo M, Xiao Y, Li X, Lu X, Deng W, Li D, Hou L, Hu M, Li Y, Pei Y: GhDET2, a steroid 5α-reductase, plays an important role in cotton fibre cell initiation and elongation. Plant J. 2007, 51: 419-430. 10.1111/j.1365-313X.2007.03144.x.PubMedView ArticleGoogle Scholar
- Guan XY, Li QJ, Shan CM, Wang S, Mao YB, Wang LJ, Chen XY: The HD-Zip IV gene GaHOX1 from cotton is a functional homologue of the Arabidopsis GLABRA2. Physiol Plantarum. 2008, 134: 174-182. 10.1111/j.1399-3054.2008.01115.x.View ArticleGoogle Scholar
- Shangguan XX, Xu B, Yu ZX, Wang LJ, Chen XY: Promoter of a cotton fibre MYB gene functional in trichomes of Arabidopsis and glandular trichomes of tobacco. J Exp Bot. 2008, 59 (13): 3533-3542. 10.1093/jxb/ern204.PubMedPubMed CentralView ArticleGoogle Scholar
- Hattori J, Boutilier KA, van Lookeren Campagne MM, Miki BL: A conserved BURP domain defines a novel group of plant proteins with unusual primary structure. Mol Gen Genet. 1998, 259: 424-428. 10.1007/s004380050832.PubMedView ArticleGoogle Scholar
- Granger C, Coryell V, Khanna A, Keim P, Vodkin L, Shoemaker RC: Identification, structure, and differential expression of a BURP domain containing protein family in soybean. Genome. 2002, 45: 693-701. 10.1139/g02-032.PubMedView ArticleGoogle Scholar
- Xu H, Li Y, Yan Y, Wang K, Gao Y, Hu Y: Genome-scale identification of soybean BURP domain-containing genes and their expression under stress treatments. BMC Plant Biol. 2010, 10: 197-10.1186/1471-2229-10-197.PubMedPubMed CentralView ArticleGoogle Scholar
- Bernard RL, Singh BB: Inheritance of pubescence type in soybeans: glabrous, curly, dense, sparse and puberulent. Crop Sci. 1969, 9: 192-197. 10.2135/cropsci1969.0011183X000900020025x.View ArticleGoogle Scholar
- Singh BB, Hadley HH, Bernard RL: Morphology of pubescence in soybeans and its relationship to plant vigor. Crop Sci. 1971, 11: 13-16. 10.2135/cropsci1971.0011183X001100010004x.View ArticleGoogle Scholar
- Lam W-KF, Pedigo LP: Effect of trichome density on soybean pod feeding by adult bean leaf beetles (coleoptera: chrysomelidae). J Econ Entomol. 2001, 94 (6): 1459-1463. 10.1603/0022-0493-94.6.1459.PubMedView ArticleGoogle Scholar
- Baldocchi DD, Verma SB, Rosenberg NJ, Blad BL, Garay A, Specht JE: Leaf pubescence effects on the mass and energy exchange between soybean canopies and the atmosphere. Agron J. 1983, 75: 537-543. 10.2134/agronj1983.00021962007500030028x.View ArticleGoogle Scholar
- Specht JE, Blad BL, Garay AF: Water use efficiency in soybean pubescence density isolines - a calculation procedure for estimating daily values. Agron J. 1986, 78: 483-486. 10.2134/agronj1986.00021962007800030018x.View ArticleGoogle Scholar
- Saha S, Sparks AB, Rego C, Viatcheslav A, Wang CJ, Vogelstein B, Kinzler KW, Velculescu VE: Using the transcriptome to annotate the genome. Nat Biotech. 2002, 20: 508-512. 10.1038/nbt0502-508.View ArticleGoogle Scholar
- Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, et al: Genome sequence of the palaeopolyploid soybean. Nature. 2010, 463: 178-183. 10.1038/nature08670.PubMedView ArticleGoogle Scholar
- Joint Genome Institute/Phytozome/. [http://www.phytozome.net/soybean.php]
- Langmead B, Trapnell C, Pop M, Salzberg SL: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10: R25-10.1186/gb-2009-10-3-r25.PubMedPubMed CentralView ArticleGoogle Scholar
- Zheng L, Heupel RC, DellaPenna D: The β Subunit of tomato fruit polygalacturonase isoenzyme 1: isolation, characterization, and identification of unique structural features. Plant Cell. 1992, 4: 1147-1156.PubMedPubMed CentralGoogle Scholar
- Watson CF, Zheng L, DellaPenna D: Reduction of tomato polygalacturonase β subunit expression affects pectin solubilization and degradation during fruit ripening. Plant Cell. 1994, 6: 1623-1634.PubMedPubMed CentralGoogle Scholar
- Shinozaki KY, Shinozaki K: The plant hormone abscisic acid mediates the drought-induced expression but not the seed-specific expression of rd22, a gene responsive to dehydration stress in Arabidopsis thaliana. Mol Gen Genet. 1993, 238: 17-25.PubMedGoogle Scholar
- Wang A, Xia Q, Xie W, Datla R, Selvaraj G: The classical ubisch bodies carry a sporophytically produced structural protein (RAFTIN) that is essential for pollen development. Proc Natl Acad Sci USA. 2003, 100: 14487-14492. 10.1073/pnas.2231254100.PubMedPubMed CentralView ArticleGoogle Scholar
- Batchelor AK, Boutilier K, Miller SS, Hattori J, Bowman LA, Hu M, Lantin S, Johnson DA, Miki BL: SCB1, a BURP-domain protein gene, from developing soybean seed coats. Planta. 2002, 215: 523-532. 10.1007/s00425-002-0798-1.PubMedView ArticleGoogle Scholar
- Anders S, Huber W: Differential expression analysis for sequence count data. Genome Biol. 2010, 11: R106-10.1186/gb-2010-11-10-r106.PubMedPubMed CentralView ArticleGoogle Scholar
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5: 621-628. 10.1038/nmeth.1226.PubMedView ArticleGoogle Scholar
- Oppenheimer DG, Herman PL, Sivakumaran S, Esch J, Marks MD: A myb gene required for leaf trichome differentiation in Arabidopsis is expressed in stipules. Cell. 1991, 67: 483-493. 10.1016/0092-8674(91)90523-2.PubMedView ArticleGoogle Scholar
- Walker AR, Davison PA, Bolognesi-Winfield AC, James CM, Srinivasan N, Blundell TL, Esch JJ, Marks MD, Gray JC: The TRANSPARENT TESTA GLABRA1 locus, which regulates trichome differentiation and anthocyanin biosynthesis in Arabidopsis, encodes a WD40 repeat protein. Plant Cell. 1999, 11: 1337-1350.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang F, Gonzalez A, Zhao M, Payne CT, Lloyd A: A network of redundant bHLH proteins functions in all TTG1-dependent pathways of Arabidopsis. Development. 2003, 130: 4859-4869. 10.1242/dev.00681.PubMedView ArticleGoogle Scholar
- Wada T, Tachibana T, Shimura Y, Okada K: Epidermal cell differentiation in Arabidopsis determine by a Myb homolog, CPC. Science. 1997, 277: 1113-1116. 10.1126/science.277.5329.1113.PubMedView ArticleGoogle Scholar
- Schellmann S, Schnittger A, Kirik V, Wada T, Okada K, Beermann A, Thumfahrt J, Jurgens G, Hulskamp M: TRIPTYCHON and CAPRICE mediate lateral inhibition during trichome and root hair patterning in Arabidopsis. EMBO J. 2002, 21: 5036-5046. 10.1093/emboj/cdf524.PubMedPubMed CentralView ArticleGoogle Scholar
- Wang S, Kwak SH, Zeng Q, Ellis BE, Chen XY, Schiefelbein J, Chen JG: TRICHOMELESS1 regulates trichome patterning by suppressing GLABRA1 in Arabidopsis. Development. 2007, 134: 3873-3882. 10.1242/dev.009597.PubMedView ArticleGoogle Scholar
- Yu N, Cai WJ, Wang S, Shan CM, Wang LJ, Chen XY: Temporal control of trichome distribution by microRNA 156- targeted SPL genes in Arabidopsis thaliana. Plant Cell. 2010, 2322-2335.Google Scholar
- Zabala G, Vodkin LO: The wp mutation of Glycine max carries a gene-fragment-rich transposon of the CACTA superfamily. Plant Cell. 2005, 17: 2619-2632. 10.1105/tpc.105.033506.PubMedPubMed CentralView ArticleGoogle Scholar
- O'Rourke JA, Charlson DV, Gonzalex DO, Vodkin LO, Graham MA, Cianzio SR, Grusak MA, Shoemaker RC: Microarray analysis of iron deficiency chlorosis in near-isogenic soybean lines. BMC Genomics. 2007, 8: 476-10.1186/1471-2164-8-476.PubMedPubMed CentralView ArticleGoogle Scholar
- Tuteja JH, Zabala G, Varala K, Hudson M, Vodkin LO: Endogenous, tissue-specific short interfering RNAs silence the chalcone synthase gene family in Glycine max seed coats. Plant Cell. 2009, 21: 3063-3077. 10.1105/tpc.109.069856.PubMedPubMed CentralView ArticleGoogle Scholar
- Severin AJ, Peiffer GA, Xu WW, Hyten D, Bucciarelli B, O'Rourke JA, Bolon YT, Grant D, Farmer AD, May GD, Vance CPl, Shoemaker RC, Stupar RM: An integrative approach to genomics introgression mapping. Plant Physiol. 2010, 154: 3-12. 10.1104/pp.110.158949.PubMedPubMed CentralView ArticleGoogle Scholar
- McCarty DR: A simple method for extractions of RNA from maize tissue. Maize Genetics Cooperation News Letter. 1986, 60: 61.Google Scholar
- National Center for Biotechnology Information. [http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/]
- European Bioinformatics Institute. [http://www.ebi.ac.uk/uniprot/]
- Sambrook J, Fritsch EF, Maniatis T: Molecular Cloning: A LaboratoryManual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory; 1989.Google Scholar
- Todd JJ, Vodkin LO: Duplications that suppress and deletions that restore expression from a chalcone synthase multigene family. Plant Cell. 1996, 8: 687-699.PubMedPubMed CentralView ArticleGoogle Scholar
- Feinberg AP, Vogelstein B: A technique for radiolabeling DNA restriction endonuclease fragments to high specific activity. Anal Biochem. 1983, 132: 6-13. 10.1016/0003-2697(83)90418-9.PubMedView ArticleGoogle Scholar
- Dellaporta SL: Plant DNA miniprep version 2.1-2.3.Edited by: Freeling M,Walbot V. The Maize Handbook Springer-Verlag, New York; 1993:522-525.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.