- Research article
- Open Access
Transcriptomic changes due to water deficit define a general soybean response and accession-specific pathways for drought avoidance
BMC Plant Biology volume 15, Article number: 26 (2015)
Among abiotic stresses, drought is the most common reducer of crop yields. The slow-wilting soybean genotype PI 416937 is somewhat robust to water deficit and has been used previously to map the trait in a bi-parental population. Since drought stress response is a complex biological process, whole genome transcriptome analysis was performed to obtain a deeper understanding of the drought response in soybean.
Contrasting data from PI 416937 and the cultivar ‘Benning’, we developed a classification system to identify genes that were either responding to water-deficit in both genotypes or that had a genotype x environment (GxE) response. In spite of very different wilting phenotypes, 90% of classifiable genes had either constant expression in both genotypes (33%) or very similar response profiles (E genes, 57%). By further classifying E genes based on expression profiles, we were able to discern the functional specificity of transcriptional responses at particular stages of water-deficit, noting both the well-known reduction in photosynthesis genes as well as the less understood up-regulation of the protein transport pathway. Two percent of classifiable genes had a well-defined GxE response, many of which are located within slow-wilting QTLs. We consider these strong candidates for possible causal genes underlying PI 416937’s unique drought avoidance strategy.
There is a general and functionally significant transcriptional response to water deficit that involves not only known pathways, such as down-regulation of photosynthesis, but also up-regulation of protein transport and chromatin remodeling. Genes that show a genotypic difference are more likely to show an environmental response than genes that are constant between genotypes. In this study, at least five genes that clearly exhibited a genotype x environment response fell within known QTL and are very good candidates for further research into slow-wilting.
Soybean is a primary contributor to worldwide food production. Water deficit dramatically limits growth and yield in crop plants, particularly for soybean, and the problem will likely be exacerbated by climate change. Irrigation is costly and often not a viable option for many soybean farmers. According to the USDA Economic Research Service report, only 8% of the U.S. soybean acreage is irrigated (http://www.ers.usda.gov/). Therefore, the development of drought-tolerant cultivars is critical in order to reduce the impact of drought stress on soybean production.
From a soybean breeding perspective, cultivar development is limited by the narrow diversity of elite germplasm, particularly with regard to drought tolerance . Fortunately, a small number of land-races exhibit drought tolerance. One Japanese lace-race, PI 416937, retains yields in spite of drought  and was initially identified due to its slow-wilting phenotype. Further physiological characterization showed that PI 416937 has lower stomatal conductance , constant transpiration rate under vapor pressure deficit (VPD) above 2.0 kPa , and lower radiation use efficiency .
VPD is the difference between the water-vapor pressure in the air and the vapor pressure at which water-vapor condenses. At low VPD, dew forms, and, as VPD rises, plants transpire due to evaporation from the stomata. Interestingly, PI 46937 initially exhibits a conventional, linear increase in transpiration rate in response to VPD; yet, as the VPD continues to rise, the transpiration rate of PI 46937 stabilizes - a response that differentiates it from elite cultivars . Transpiration rate is reduced within 40 minutes after exposure to cycloheximide, a bacterially-derived compound which inhibits protein translation . This result indicates that symplastic/transcellular water pathway is maintained by continuous protein turnover. One explanation for PI 416937’s unique response to increased VPD is that the transcription of proteins mediating transpiration rate is being modulated relative to elite cultivars. To examine this possibility, we used deep sequencing of mRNAs (RNA-seq) to assay the transcriptomic response to water deficit in both PI 416937 and Benning, a common drought-sensitive cultivar.
Plant breeders are interested in identifying genes that confer drought-tolerance that can then be used for marker assisted selection. Since drought-tolerance is a highly complex trait, a whole-genome perspective is required. Still, previous attempts to understand drought tolerance using whole-genome transcript profiles often relied on the relative difference in pre- versus post-drought conditions for a single genotype . Observing the final product of an elaborate chain of transcriptional events does not easily translate to either a better understanding of the plant’s responses or to improved plant varieties. One way to focus the search for useful drought tolerance genes is to compare differential expression of genes between genotypes that exhibit varying levels of drought tolerance. Indeed, this has been done previously in soybean for a relatively uncharacterized soybean variety . While this study hinted genetic mechanisms that may confer drought resistance, the resistant variety used had not been extensively characterized in terms of its physiological response to water deficit, thus limiting the ability to connect genetic and physiological pathways. The study also illustrated the analytical difficulties of emphasizing only pairwise differences for samples that range across genotypes and environmental conditions. Here we apply a classification system to categorize genes based on the combination of genotypic and environmental response data. This approach allowed us to differentiate gene expression patterns that characterize a general soybean response from patterns that may be conferring PI 416937’s distinct transpiration rate profile. An additional benefit of comparing PI 416937 and Benning transcriptional profiles is that they are the parents for a mapping population previously used to identify slow-wilting QTL ; thus, genotypic differences in expression could be correlated to genetic polymorphisms segregating between the two lines.
PI 416937 exhibits a slow-wilting phenotype
As described in Methods, to create rapid water deficit, each genotype was gently removed from soil, washed, and exposed to constant ambient air for the remainder of the experiment. After 6 and 12 h of drying treatment, both genotypes did not show differences in wilting phenotype (Figure 1). However, the slow wilting genotype PI 416937 still maintained its shape whereas the fast wilting Benning was wrinkled and wilted after 24 hr of drying, clearly representing different levels of drought avoidance between two genotypes. After 36 hr, genotype PI 416937 also showed a wilting phenotype and Benning showed severe leaf curling.
Transcriptome data for sensitive and tolerant soybean genotypes is highly reproducible
A total of 24 samples comprised of two soybean genotypes without drying treatment (controls, 0 h) and imposed drought stress (6, 12, and 24 hr) were used for transcriptome sequencing using Illumina HiSeq2000 system (Table 1). One library of PI 416937 6 hr replicate 3 was lost during library preparation procedure, thus PI 416937 replicate 2 was sequenced twice. Hiseq 2000 sequencing resulted in from 9.5 million (M) to 26.4 M reads per sample. The reads for each biological replicate were mapped independently to the reference genome. There were no genes with significant differences at the transcriptional level between PI 416937 6 hr replicate 2 analyzed in two different lanes, showing that the sequencing reaction and subsequent analysis introduced very little error (Additional file 1). Moreover, across biological replicates, the number of gene models with no significant difference ranged from 99.10% and 99.98% (Additional file 1), indicating high reproducibility.
PI 416937 and Benning have similar transcriptional response to water deficit but exhibit numerous genotypic differences
We attempted to combine data across genotypes and time-points in order to classify these expression profiles of expression into biologically relevant categories. Our categories were based on varying degrees of genotypic versus environmental responses (Table 2 and ). Generally, the classification system took into account the coefficient of variation across time-points as well as the statistical significance as assessed by cuffdiff (see Methods).
Figure 2 illustrates gene expression profiles and their classification. G-only genes differed by genotype, but were relatively constant with regard to environmental change. E-only genes showed similar levels for both genotypes at individual time-points, but varied between time-points. G + E genes had both a genotypic difference and an environmental response. For GxE genes, the genetic background conditioned the environmental response. GxE genes had highly variable differences between the two genotypes at different time-points; for example, a GxE gene might have a log2 ratio FPKMBenning to FPKMPI-416937 of 1.2 at 6 hr, but a difference of 3.3 at 12 hr. These GxE were particularly interesting because they suggest the genes that might be mediating phenotypic differences in wilting response. Because of the highly stringent criteria used to define the above categories, there were many cases where ambiguous gene expression profiles clearly exhibited a response, but were undefined. We further categorized these genes depending on whether they exhibited an extreme environmental or genotypic response for at least one time point. These are defined with the ambiguous suffix in Table 2 and Figure 2 (see Methods for more details).
A large fraction of gene models were not tested due to lack of transcript from sampled tissues and conditions (Table 2). Thirty-three percent of classifiable genes (all genes except Untested, Low-expression, and Ambiguous) were expressed at constant levels regardless of drought stress or genotype. Even with the large number of genes showing constant expression, very few exhibited a G-only response: 1%, [100 * (G-only/(G-only + Constant)]. Indeed, 96% of classifiable genes that were differentially expressed between genotypes – G-only, GxE, G + E, and G + E-ambiguous - exhibited an environmental response. Therefore, assuming the ratio of GxE to G + E genes holds for the G + E-ambiguous category, genotype generally appears to interact with the environment in a nonlinear way. All genes are listed along with their categories and expression profiles (Additional file 2).
E gene profiles define a general soybean response
Because we used two diverse soybean genotypes in this study, we could postulate a generic transcription response of soybean to water deficit. In order to elucidate this response, we further characterized the expression profiles of genes that showed a shared environmental response but little (E-ambigous) or no (E-only) genotypic difference (Figure 2), which we refer to as E genes. We formalized eight models to represent the average expression profile of these genes (Figure 3C): Up-early, in which genes were expressed to their maximum level within the first 6 hrs; Up-linear, in which genes continually increased over the time-course; and Up-late, in which genes stayed constant till the 24 hr time-point. We similarly defined a Down-early, Down-linear, and Down-late. Peak and Trough expression patterns were either up-then-down or down-then-up, respectively, across the time-course. Note that the shape of the expression profile, not its absolute level, dictates its classification.
The fraction of up and down-regulated genes was similar (Figure 3A). Roughly half of the up-regulated genes exhibited a linear increase in expression. In contrast, the down-regulated genes were more evenly divided between early and linear responses. We additionally assessed the maximum magnitude relative to the control (0 hr) of all E-genes. Most genes had a range of between 1 and 3 log2 units (2 to 8-fold greater or less than 0 hr), but some exhibited very high changes in expression, on the order of 6 to 8 log2 units (Figure 3B). While there was the expected correlation between set size and range, both linearly and late down-regulated genes appear to change more extensively than up-regulated genes with similar profiles. Thus, on balance, the number of transcripts in the leaf should decline with time under drought.
We assessed each profile set separately for possible enrichment in functionally related genes. Using AgriGO, we found distinct and highly significant patterns of functional bias (Table 3). Indeed, the fact that these categories are quite distinct indicates that our choice to group E genes by expression profiles was generally valid. Genes associated with photosynthesis and lipid metabolism were rapidly reduced and remain low (Figure 3C). A distinct set of photosynthesis genes were also continually reduced across the time-course. Towards 24 hr, genes involved in translation were down-regulated, resulting in a general decline in cellular metabolism. On the other hand, protein transport genes were up-regulated rapidly and stayed at relatively high levels. As cell metabolism declined, proteolysis and autophagy genes were increasingly transcribed. No significant categories were associated with Up-late genes. This observation stands to reason as most cellular processes appeared to decline in activity as water deficit continued. Somewhat surprisingly, Peak genes also show little or no functional enrichment. Interestingly, a clear drop in the transcription of chromatin remodeling genes was observed at 6 to 12 hr; transcription returned to 0 hr levels at 24 hr time point. Both Peak and Trough genes may represent genes that are oscillating in circadian cycles, and have little to do with drought response. Chromatin remodeling genes generally appear to be constant regardless of time of day , suggesting that this response is a reaction to initial water deficit and downstream physiological symptoms.
We additionally assessed the GO enrichment of genes with very high-dynamic range in a category-wise fashion. These results generally bore out the functional enrichment analysis performed above, but were often less definitive (data not shown).
Genotypic differences in transcription
Given the utility of characterizing E gene profiles, we extended this analysis to GxE genes. In this case we included a ninth model, Constant, in addition to those described above. No E genes should be constant, so the Constant model was not applied to that group, whereas one of the two genotypes of a GxE gene might show constant expression across time points.
We initially characterized the relative frequency for each possible combination of environmental responses specific to each GxE gene (Figure 4A). The pattern observed deviates strongly from random expectation (p-value < 10−69, Chi-squared test). As shown, most combinations fall along the linear axis, indicating that, even for GxE genes, the basic environmental response is the same, differing only by magnitude at a particular point. Indeed, there are very few examples of up-regulation in one genotype and down-regulation in another. In terms of combinations that are enriched but do not fall on the linear axis, most of these are not far from the axis, indicating that, even when expression profiles are distinct, they are not dramatically different. The most aberrant combination involves genes that are down-regulated late in Benning and show up-regulation and then down-regulation, or ‘peak’ profiles, in PI 416937. In examining the profiles of these genes, we found that PI 416937 genes most commonly peaked at a much higher levels than the relatively constant Benning genes. Note, this did not have to be true, as a gene could start higher in Benning than PI 416937 and then decline late as in Glyma07g01940 (Figure 4B); the absolute value of a profile is normalized by the maximum expression value, thus only the shape of the profiles are considered. Though the number was too small for robust enrichment statistics, of the seven genes that did show a sharp peak in early expression in PI 41937, such as Glyma17g05520 or Glyma07g17361, most are annotated as being transcription factors or as having some regulatory function at the protein level.
One hypothesis to explain PI 416937’s slow-wilting response is that genes associated with water transport in PI 416937 have reduced expression during water deficit, thus reducing transpiration and facilitating water retention (see Introduction). Only a very small fraction of GxE genes that were down regulated in PI 416937 had strikingly different expression profiles in Benning. It is possible that the functionally significant changes in gene expression are not qualitative, such as differences in profile, but quantitative, as suggested by the sharp diagonal in Figure 4A. Thus, given that most GxE genes exhibited similar profiles, we looked for time points that were commonly differentiating the two genotypes.
Looking only at genes that had the same profile – that fell along the diagonal in Figure 4A – we analyzed the genotypic differences for each gene at each time-point (Figure 4C). For example, because the units of the y-axis are log2(FPKMPI 416937/FPKMBenning), positive values indicate that PI 416937 genes had higher expression than Benning genes at a given time-point. We observed that no particular time-point had a biased genotypic difference when considering all profiles regardless of profile type (‘All’ in Figure 4C). When we grouped genes based on up or down-regulation, we observed a small bias at the 6 hr time-point; in other words, for genes that were similarly down-regulated in both genotypes, PI 416937 genes were not down-regulated as substantially as Benning, particularly at 6 hr. It is possible that these genes represent, in effect, a delayed response to water deficit. Whether this response is causally related to resistance to wilting, or is merely a byproduct of undergoing less water deficit is unknown. The lack of any visual phenotype at this stage would suggest the former (Figure 1). Still, this observation is the opposite of what would be expected under a model in which PI 416937 differentially down-regulates expression of a subset of genes in order to reduce transpiration levels.
Genomic bias of GxE genes and known QTLs for slow canopy wilting
In our previous QTL study using 150 recombinant inbred lines (RILs) derived from a Benning and PI 416937 cross, seven QTL responsible for canopy wilting were identified. Of those, two and five favorable QTL alleles were found from Benning and PI 416937, respectively .
We compared the distributions of genes across the genome with those genes found within QTL intervals. There was no significant deviation from the expectation predicted by the genome-wide distribution (Figure 5). This finding is not surprising given that the QTL intervals are large and the majority of genes within a given interval are not expected to deviate sharply from the genome-wide distributions. Still, several genes within the known QTL have a clear GxE signal (Additional file 3 and Additional file 4) and are promising candidates for further investigation.
The distribution and/or expression levels of aquaporins are thought to be important in mediating PI 416937’s unique response to drought [4,6]. We additionally compared the categorical distribution of aquaporins to the genome-wide expectation (Figure 5). Though the sample is small, the distribution is significantly different than background (p-val < 0.05, Chi-squared test), indicating that aquaporins are more likely to respond transcriptionally to water deficit and also that they are more likely to have genotypic differences in their response. No aquaporin genes classified as being GxE-type genes fell within the known QTL interval.
Large-scale transcriptional reprogramming has long been interpreted as a mechanism of minimizing the effect of drought stress in plants [12,13]. The aim of this study was to identify a general response to drought stress in soybean and to compare differences at the transcriptional level between two accessions differing in canopy wilting phenotype. Although the drying treatment in this study is far from the actual drought stress under field conditions, it allowed us to measure transcriptional responses to water deficit, a major component of drought stress.
The majority of genes that we could confidently characterize a drought response were classified as E genes, indicating that they had roughly identical expression patterns for both genotypes (Figure 2 and Table 2). Prior to any noticeable phenotypic effect (Figure 1), dramatic transcriptional changes were occurring in both genotypes (Figure 2). While genes that are up or down-regulated late may be due to the physiological repercussions of canopy wilting, both early and linearly responsive genes are abundant (Figure 3) and likely responding to immediate water-deficit.
The most obvious response shared by sensitive and tolerant genotypes was down-regulation of photosynthesis related genes (Figure 3). There have been contradictory observations with regard to photosynthesis under drought stress, and this discrepancy is thought to be caused by differences in the severity of stress imposed on plants . When plants encountered mild or moderate drought stress, photosynthetic acclimation was observed [12,15-17]. In contrast, photosynthesis has been reported as one of the primary process to be adversely affected under severe drought [16-19]. Thus, our treatment appears to be simulating severe drought.
Another response shared by sensitive and tolerant genotypes was up-regulation of genes associated with autophagy and nutrient starvation. Autophagy is an essential protein degradation process induced by abiotic stresses such as starvation, drought, salt, pathogen, and oxidative stress [20,21]. Photosynthetic constraint is one cause of carbon starvation, and carbon starvation induces autophagy . The breakdown of oxidized proteins during oxidative stress and aggregated proteins in nutrient-starved cells can ensure cellular survival by maintaining cellular energy levels .
Prior to autophagy-related gene up-regulation, there was a rapid increase in genes involved in protein localization (Figure 3), primarily within the vesicular trafficking pathway. To our knowledge, this has not been observed in soybean, but has some precedent in Arabidopsis where up-regulation of related genes promoted osmotic stress tolerance . Interestingly, other reports in Arabidopsis have implicated the downregulation of vesicle-trafficking-related SNARE protein in salt tolerence ; suppression of the gene in roots suppressed the production of reactive oxygen species by preventing vesicle fusion with the tonoplast. The connection between salt and water stress is complex , but the above findings in conjuction with those presented here, indicate that the shoot and root are exhibiting very distinct vesicle-trafficking profiles.
Chromatin remodeling genes have an unusual Trough expression pattern in both genotypes (Figure 3 and Table 3). Chromatin regulation responses to drought, cold, and salinity stress have been described in Arabidopsis [26,27]. It was reported that the histone H3 modification correlates with gene activation of the drought stress-inducible genes, such as responsive to dehydration (RD) 29A, RD29B, and related to AP2.4 (RAP2.4) . Moreover some chromatin remodeling and modifying enzymes such as histone modification enzymes, linker histone H1, and components of chromatin remodeling complex have been shown to function in plant abiotic stress responses . The initial down-regulation of these genes may reflect the expansion of euchromatin associated with the major transcriptional reprogramming that is occurring even at early stages of water-deficit, while the late up-regulation counters this trend, returning much of the genome to heterochromatin, under extreme physiological stress .
We had strong evidence for the differential expression between genotypes for 2,138 transcripts for at least one time-point (Table 2). For 25% of these, we could say with confidence that the genotype was conditioning the environmental response (GxE genes in Table 2). Less than 4% of these genotypically different genes had a constant expression in both genotypes (G-only genes in Table 2). Note, this result is not predicted by the ratio of Constant to E-only genes (Table 2), suggesting that genes that differ between genotypes are generally disposed to be stress responsive. This stands to reason in that stress-response regimes are likely to be selected under unique local environmental conditions .
The three major categories enriched in GxE genes were photosynthesis, innate immune response, and apoptosis genes, with a FDR of 5.2E−06, 2.3E−07, and 4.9E−06, respectively. Photosynthesis genes were substantially down-regulated under drought stress in both soybeans, however, photosynthesis genes of tolerant soybean were less affected at an early stage (6 hr) of water-deficit (Additional file 5). This is supported by prior studies that showed lower decrease of net photosynthesis rate or chlorophyll content in tolerant versus sensitive genotype under salt or drought stress [26,30].
Perhaps more interesting are the innate immune response and apoptosis genes, which show dramatic differences between genotypes and across conditions. Immune response genes are also a major target of local adaptation and have been previously identified as eQTLs for differential drought response . Contrary to the expectation based on E-only profiles, apoptotic GxE genes are primarily down-regulated and vary most commonly in their initial expression levels (Additional file 6), indicating that physiological responses to wilting are not manifesting these differences. Still, the biochemical connection between water-deficit and apoptotic/immune response is tenuous, and functional enrichment in GxE categories may reflect overlapping local adaptations to stress in general, and not drought specifically. We anticipate that further fine-mapping studies will help resolve these questions.
To that end, one motivation for this study was the prior development of a genetic mapping population generated from a cross of the two lines assayed herein . We did not identify a significant relationship between genes within previously identified QTL regions and GxE genes (Figure 5). Additionally, the region containing the strongest QTL, qSW-Gm12, with an R2 of 0.27 , did not have a significant enrichment in GxE or G + E genes (not shown). This result is not unexpected given that the QTL are not particularly well resolved and they could be mediating differences in slow-canopy wilting through any number of mechanisms [32,33]. Still, each of the QTL regions did contain GxE genes, and we propose these genes to be prime targets for fine-mapping, particularly those that have strikingly distinct expression profiles and act early in water deficit (Additional file 3 and Additional file 4).
The large majority of GxE genes exhibited quantitative differences in expression levels at particular points rather than qualitatively different profiles (Figure 4A). The exception to this trend was a small group of regulatory proteins that peaked in PI 416937 and remained relatively low and constant in Benning until 24 hr (Figure 4B). Though none of these genes fell directly within the range specified by the QTL mapping discussed above, chromosomes 5 and 17 contain QTLs nearby two of the most striking GxE profiles, Glyma02g45280 and Glyma17g05520. These genes are another set of promising leads in identifying solutions to problems posed by drought.
Drought reduces yield in all crops, particularly soybeans. The response to drought is biochemically complex and entails major changes in gene expression. To that end, genome-wide expression data can be useful in improving plants to be robust to drought. However, it is difficult for plant researchers and breeders to employ genome-wide data because the results, in isolation, are often impressionistic and the experimental design does not focus on refining genomic loci that are causally underlying phenotypic variation. Here we used two relevant breeding lines, Benning and PI 416937 that have been used previously by our group as parents in a mapping population. These two lines exhibit strikingly different wilting responses, as shown here and in previous work, and their progeny were used to identify QTL underlying the slow-canopy wilting trait. We could therefore compare genes that have strikingly different profiles between genotypes with these QTL in order to resolve those QTL further and to understand their functions. To facilitate this comparison, we also developed a computational pipeline that allowed us to characterize the transcriptional response of each gene based on observations across the entire time-course and between the two genotypes. This approach allowed us to differentiate between genes that form a shared response and those that distinguish genotypes.
Taken together, we feel this study offers the following insights: 1) There is a general and functionally significant transcriptional response to water deficit that involves not only known pathways, such as down-regulation of photosynthesis, but also up-regulation of protein transport and chromatin remodeling; 2) Genes that show a genotypic difference are more likely to show an environmental response than genes that are constant between genotypes; 3) At least five genes that clearly exhibited a GxE response fell within the known QTL and are very good candidates for further research into slow-canopy wilting.
Plant materials and drought stress treatment
Both Benning (drought sensitive, elite US soybean cultivar) and PI 416937 (drought tolerant, Japanese landrace) were planted in the greenhouse on June 18, 2012 with 12/12 hours light/dark regime. At the R2 stage of flowering (September 7, 2012), plants were removed from pots, roots were washed, and the whole plants exposed to air. After 0, 6, 12, and 24 hr intervals, leaves were collected from both Benning and PI 416937 with three biological replicates, frozen in liquid nitrogen and stored at −80°.
Total RNA extraction and library preparation
Tissues were ground under liquid nitrogen. The total RNA from leaf tissues was extracted using Trizol reagent (Invitrogen) and RNA-Seq libraries were prepared using TruSeq RNA Sample Prep Kits (Illumina) according to the manufacture’s recommendations. RNA-Seq libraries were constructed from two genotypes, four treatment time (0, 6, 12, and 24 hr), and three biological replicates. All libraries were barcoded using 24 index adapters, quantified using Bioanalyzer DNA 1000 Chip (Agilent Technology 2100 Bioanalyzer) and normalized to 10 nM.
RNA sequencing and sequence analysis
All libraries were sequenced using the HiSeq2000 at the Genomics and Microarray Core at the University of Colorado Denver. Three lanes of HiSeq were used and each biological replicates was sequenced in different lanes according to proper blocking and randomization procedures . Libraries were pooled equimolarly. Using TopHat2 (version 2.0.8b), sequencing reads (Table 1) were aligned to the Glyma 1.1 transcriptome (54,175 genes; 73,320 transcripts) determined by the Soybean Genome Project, US Department of Energy, Joint Genome Institute  (http://www.phytozome.net/soybean.php). Technical replicates (PI 416937 6 h replicate 2 and 3) were merged prior to testing the transcriptome variances. Mapped reads were provided as an input to Cufflinks (version 2.1.1) for transcript assembly, and then, differential expression of genes between genotypes and between treatments were analyzed by CuffDiff . After consolidation of identical genes, 53,645 genes were included in the Cuffdiff analysis. Gene abundance is given as fragments per kilobase of exon per million fragments mapped (FPKM). Differential expression statistics were based on the log2 FPKM ratios .
Classifier system for genotypic response profiles
All genes are classified via the following set of rules. For brevity, the following acronyms are used:
GD, Genotypic difference at a single time-point calculated as log2 FPKM ratio of Benning genotype to PI 416937. In other words, a GD > 2 indicates a more than 4-fold difference in mean genotypic expression levels at a single time-point.
ED, Environmental difference calculated as log2 FPKM ratio of 0 hr to 6, 12, or 24 hr for a given genotype. All genes have a 0 hr ED of 0.
SGD, Significant GD with q-value <0.05.
SED, Significant ED with q-value <0.05)
CV + , 7 out of 8 time-points (4 time-points × 2 genotypes) had a coefficient of variation of <0.4 across the 3 replicates.
SS + , Same sign for GDs; not a mixture of positive and negative GDs.
The classification key is described below as pseudo-code using underlined control words ‘IF’, ‘THEN’, and ‘ELSE IF’:
IF No time-point in either genotype had an average FPKM of >1 THEN call Untested
ELSE IF the highest FPKM of any time-point was <4 & the average FPKM across all time-points was <2 THEN call Low-expression
ELSE IF CV+ & at least one SGD was >2.8-fold the GD at another time-point THEN call GxE
ELSE IF CV+ & ≥3 SGDs & both genotypes have ≥1 SEDs & SS+ THEN call G + E
ELSE IF CV+ & ≤1 SGDs & both genotypes have ≥1 SEDs THEN call E-only
ELSE IF CV+ & 0 SEDs for at least one genotype & ≥3 SGDs & SS+ THEN call G-only
ELSE IF CV+ & 0 SEDs for at least one genotype & ≤1 SGDs & no average ED >2 THEN call Constant
ELSE IF At least one average ED >2 & ≥1 SGD had a q-value <0.001 THEN call G + E-ambiguous
ELSE IF At least one average ED >2 THEN call E-ambiguous
ELSE call Ambiguous
Of note, log scores are used in expression data because fold changes in expression are more meaningful than absolute changes. Yet, as commonly occurs in expression data, expression of one gene may be, for example, 0 at 0 hr and 50 at 6 hr. Though this is an important result with regard to our biological questions, it presents conceptual and, certainly mathematical, problems. For our purposes we used the following heuristic: if expression (FPKM) was <1 for one entry and >5 for the compared entry, then we gave the difference a score of 5, which represents an upper limit for the expression range across the time-course (Figure 3B). Alternatively, if the entry was <1 and the compared entry was <5, then we gave it a score of 0. An identical approach was taken for comparisons that would result in a negative score. For range calculations (Figures 3B and 4C), such values were ignored.
Defining water-deficit response curves
The shape of expression profiles were characterized further. ED values in a time-course were normalized by the maximum ED. These profiles were then compared to nine explicit models, described graphically in Figure 3C. Explicit models are given as Additional file 7. The profile was classified as the model with the lowest sum of squared differences between model and observation for the given time-points.
We use AgriGO web service to perform GO enrichment analysis on specific expression groups using Singular Enrichment Analysis . Each analysis used ‘Glycine max v1.1’ as the species and ‘Soybean genome locus (phytozome v1.1)’ as the reference.
Availability of supporting data
The raw data sets supporting the results of this article are available in the Short Read Archive under BioProject accession: PRJNA259941 (http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/bioproject/?term=PRJNA259941).
Vapor pressure deficit
Quantitative trait locus
Recombinant inbred line
Fragments per kilobase of transcript per million mapped reads
Sneller C, Dombek D. Use of irrigation in selection for soybean yield potential under drought. Crop Sci. 1997;37(4):1141–7.
Sloane RJ, Patterson RP, Carter TE. Field drought tolerance of a soybean plant introduction. Crop Sci. 1990;30(1):118–23.
Tanaka Y, Fujii K, Shiraiwa T. Variability of Leaf Morphology and Stomatal Conductance in Soybean [Glycine max (L.) Merr.] Cultivars All rights reserved. Crop Sci. 2010;50(6):2525–32.
Fletcher ALR, Jr ALH ST. Transpiration responses to vapor pressure deficit in well watered’slow-wilting’ and commercial soybean. Environ Exp Bot. 2007;61:145–51.
Ries LL, Purcell LC, Carter TE, Edwards JT, King CA. Physiological traits contributing to differential canopy wilting in soybean under drought. Crop Sci. 2012;52(1):272.
Sadok W, Sinclair TR. Transpiration response of ‘slow-wilting’ and commercial soybean (Glycine max (L.) Merr.) genotypes to three aquaporin inhibitors. J Exp Bot. 2010;61(3):821–9.
Le DT, Nishiyama R, Watanabe Y, Tanaka M, Seki M, Ham LH, et al. Differential gene expression in soybean leaf tissues at late developmental stages under drought stress revealed by genome-wide transcriptome analysis. PLoS One. 2012;7(11):e49522.
Chen LM, Zhou XA, Li WB, Chang W, Zhou R, Wang C, et al. Genome-wide transcriptional analysis of two soybean genotypes under dehydration and rehydration conditions. BMC Genomics. 2013;14:687.
Abdel-Haleem H, Carter Jr TE, Purcell LC, King CA, Ries LL, Chen P, et al. Mapping of quantitative trait loci for canopy-wilting trait in soybean (Glycine max L. Merr). Theor Appl Genet. 2012;125(5):837–46.
Grishkevich V, Yanai I. The genomic determinants of genotype x environment interactions in gene expression. Trends Genet. 2013;29(8):479–87.
Covington MF, Maloof JN, Straume M, Kay SA, Harmer SL. Global transcriptome analysis reveals circadian regulation of key pathways in plant growth and development. Genome Biol. 2008;9(8):R130.
Thimm O, Bläsing O, Gibon Y, Nagel A, Meyer S, Krüger P, et al. Mapman: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J. 2004;37(6):914–39.
Yamaguchi-Shinozaki K, Shinozaki K. Organization of cis-acting regulatory elements in osmotic- and cold-stress-responsive promoters. Trends Plant Sci. 2005;10(2):88–94.
Des Marais DL, McKay JK, Richards JH, Sen S, Wayne T, Juenger TE. Physiological genomics of response to soil drying in diverse Arabidopsis accessions. Plant Cell. 2012;24(3):893–914.
Yamaguchi-Shinozaki K, Shinozaki K. A novel cis-acting element in an Arabidopsis gene is involved in responsiveness to drought, low-temperature, or high-salt stress. Plant Cell. 1994;6(2):251–64.
Watkinson JI, Sioson AA, Vasquez-Robinet C, Shukla M, Kumar D, Ellis M, et al. Photosynthetic acclimation is reflected in specific patterns of gene expression in drought-stressed loblolly pine. Plant Physiol. 2003;133(4):1702–16.
Chaves MM, Flexas J, Pinheiro C. Photosynthesis under drought and salt stress: regulation mechanisms from whole plant to cell. Ann Bot. 2009;103(4):551–60.
Zhou J, Wang X, Jiao Y, Qin Y, Liu X, He K, et al. Global genome expression analysis of rice in response to drought and high-salinity stresses in shoot, flag leaf, and panicle. Plant Mol Biol. 2007;63(5):591–608.
Degenkolbe T, Do P, Zuther E, Repsilber D, Walther D, Hincha D, et al. Expression profiling of rice cultivars differing in their tolerance to long-term drought stress. Plant Mol Biol. 2009;69(1–2):133–53.
Liu Y, Xiong Y, Bassham DC. Autophagy is required for tolerance of drought and salt stress in plants. Autophagy. 2009;5(7):954–63.
Vásquez-Robinet C, Watkinson JI, Sioson AA, Ramakrishnan N, Heath LS, Grene R. Differential expression of heat shock protein genes in preconditioning for photosynthetic acclimation in water-stressed loblolly pine. Plant Physiol and Biochem. 2010;48(4):256–64.
Han S, Yu B, Wang Y, Liu Y. Role of plant autophagy in stress response. Protein Cell. 2011;2(10):784–91.
McDowell NG. Mechanisms linking drought, hydraulics, carbon metabolism, and vegetation mortality. Plant Physiol. 2011;155(3):1051–9.
Mazel A, Leshem Y, Tiwari BS, Levine A. Induction of salt and osmotic stress tolerance by overexpression of an intracellular vesicle trafficking protein AtRab7 (AtRabG3e). Plant Physiol. 2004;134(1):118–28.
Leshem Y, Melamed-Book N, Cagnac O, Ronen G, Nishri Y, Solomon M, et al. Suppression of Arabidopsis vesicle-SNARE expression inhibited fusion of H2O2-containing vesicles with tonoplast and increased salt tolerance. Proc Natl Acad Sci U S A. 2006;103(47):18008–13.
Colom MR, Vazzana C. Photosynthesis and PSII functionality of drought-resistant and drought-sensitive weeping lovegrass plants. Environ Exp Bot. 2003;49(2):135–44.
Han S-K, Wagner D. Role of chromatin in water stress responses in plants. J Exp Bot. 2014;65(10):2785–99.
Kim J-M, To TK, Nishioka T, Seki M. Chromatin regulation functions in plant abiotic stress responses. Plant Cell Environ. 2010;33(4):604–11.
Kim JM, To TK, Ishida J, Matsui A, Kimura H, Seki M. Transition of chromatin status during the process of recovery from drought stress in Arabidopsis thaliana. Plant Cell Physiol. 2012;53(5):847–56.
Lu KX, Cao BH, Feng XP, He Y, Jiang DA. Photosynthetic response of salt-tolerant and sensitive soybean varieties. Photosynthetica. 2009;47(3):381–7.
Lowry DB, Logan TL, Santuari L, Hardtke CS, Richards JH, DeRose-Wilson LJ, et al. Expression quantitative trait locus mapping across water availability environments reveals contrasting associations with genomic features in Arabidopsis. Plant Cell. 2013;25(9):3266–79.
Abdel-Haleem H, Lee GJ, Boerma RH. Identification of QTL for increased fibrous roots in soybean. Theor Appl Genet. 2011;122(5):935–46.
King CA, Purcell LC, Brye KR. Differential wilting among soybean genotypes in response to water deficit. Crop Sci. 2009;49(1):290–8.
Auer PL, Doerge RW. Statistical design and analysis of RNA sequencing data. Genetics. 2010;185(2):405–16.
Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40(D1):D1178–86.
Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012;7(3):562–78.
Du Z, Zhou X, Ling Y, Zhang Z, Su Z. agriGO: a GO analysis toolkit for the agricultural community. Nucleic Acids Res. 2010;38(Web Server issue):W64–70.
Funding for this research was provided by the United Soybean Board. We thank Dr. Roger Boerma for his advice on the manuscript.
The authors declare that they have no competing interests.
JHS performed sequence analysis, interpreted the results, and wrote the manuscript. JNV performed expression analysis, interpreted the results, and wrote the manuscript. HAH performed drought treatment and prepared RNA samples, and CC prepared library for RNA-seq. BA and KDK provided assistance in the data analysis, management, and data interpretation. ZL and SAJ designed the experiment and provided guidance of the study as well as the manuscript. All authors have read and approved the final manuscript.
Jin Hee Shin and Justin N Vaughn contributed equally to this work.
Gene models with no significant difference at the transcriptional level between biological replicates.
An Excel spreadsheet containing all genes, their expression categories, and log 2 ratios for genotypic difference and environmental responses. 'Untested' genes are not included.
Profiles of GxE and G+E genes that overlap previously identified QTL. See Figure 2 for details.
Genes that overlap previously identified QTL and exhibit GxE or G+E expression profiles.
Profiles of photosynthesis genes exhibiting GxE patterns. See Figure 2 for details.
Explicit models used to characterize the expression profile of a particular gene. A value of 1 represents the maximum expression level relative to control.
About this article
Cite this article
Shin, J.H., Vaughn, J.N., Abdel-Haleem, H. et al. Transcriptomic changes due to water deficit define a general soybean response and accession-specific pathways for drought avoidance. BMC Plant Biol 15, 26 (2015) doi:10.1186/s12870-015-0422-8
- Drought stress
- Glycine max
- Quantitative trait loci (QTL)
- Genotype x environment