- Research article
- Open Access
Genome-wide annotation of the soybean WRKY family and functional characterization of genes involved in response to Phakopsora pachyrhiziinfection
BMC Plant Biology volume 14, Article number: 236 (2014)
Many previous studies have shown that soybean WRKY transcription factors are involved in the plant response to biotic and abiotic stresses. Phakopsora pachyrhizi is the causal agent of Asian Soybean Rust, one of the most important soybean diseases. There are evidences that WRKYs are involved in the resistance of some soybean genotypes against that fungus. The number of WRKY genes already annotated in soybean genome was underrepresented. In the present study, a genome-wide annotation of the soybean WRKY family was carried out and members involved in the response to P. pachyrhizi were identified.
As a result of a soybean genomic databases search, 182 WRKY-encoding genes were annotated and 33 putative pseudogenes identified. Genes involved in the response to P. pachyrhizi infection were identified using superSAGE, RNA-Seq of microdissected lesions and microarray experiments. Seventy-five genes were differentially expressed during fungal infection. The expression of eight WRKY genes was validated by RT-qPCR. The expression of these genes in a resistant genotype was earlier and/or stronger compared with a susceptible genotype in response to P. pachyrhizi infection. Soybean somatic embryos were transformed in order to overexpress or silence WRKY genes. Embryos overexpressing a WRKY gene were obtained, but they were unable to convert into plants. When infected with P. pachyrhizi, the leaves of the silenced transgenic line showed a higher number of lesions than the wild-type plants.
The present study reports a genome-wide annotation of soybean WRKY family. The participation of some members in response to P. pachyrhizi infection was demonstrated. The results contribute to the elucidation of gene function and suggest the manipulation of WRKYs as a strategy to increase fungal resistance in soybean plants.
Soybean (Glycine max) is one of the most important crops in the world. At present, one of the major diseases affecting soybean production is Asian Soybean Rust (ASR), which results from infection with Phakopsora pachyrhizi. Under conditions that are favorable for fungal propagation, infection results in yield losses ranging from 10 to 80% -.
Three infection types have been described on soybean accessions inoculated with P. pachyrhizi: (1) susceptible reaction characterized by "tan" lesions with many uredinia and prolific sporulation; (2) resistant reaction typified by reddish brown lesions with few uredinia and little to moderate sporulation; and (3) resistant reaction with no visible lesions or uredinia, conferring the immune phenotype ,. Six single dominant genes (Rpp1 to Rpp6) conditioning soybean resistance and/or immunity to P. pachyrhizi have been identified so far -. The effectiveness of these genes is limited through virulent ASR isolates that are able to overcome the resistance mechanism conferred by each of them ,. For this reason, the most successful method to control fungal spread is the application of fungicides, which are costly and have a negative impact on the environment, favor a selection of pathogen resistance and, in severe cases, are ineffective . In this context, understanding the molecular basis of the soybean defense against fungal infection and growth, identifying genes involved in susceptible or resistant response and characterizing their individual roles are key steps for engineering durable and quantitative disease resistance. Therefore, genetic transformation represents a powerful tool for functional studies.
Many studies have implicated a role for soybean WRKY transcription factors in the response to P. pachyrhizi infection -. WRKY genes might regulate the expression of defense genes, modulating immediate downstream target genes or activating/repressing other transcriptional factors .
WRKY transcription factors comprise one of the largest families of regulatory proteins in plants. Previous studies have identified 72 WRKY-encoding genes in Arabidopsis, approximately 100 members in rice -, 104 in poplar , 86 in Brachypodium distachyon, 80 in grape  and 116 and 102 genes in two different species of cotton . A genome-wide analysis in primitive eukaryotes  revealed the widespread occurrence of WRKY proteins.
The most prominent feature of these proteins is the WRKY domain, which is a highly conserved 60 amino acid region hallmarked by the heptapeptide WRKYGQK followed by a C2H2- or C2HC zinc-finger motif. As deduced from the results of a nuclear magnetic resonance analysis of a WRKY domain of AtWRKY4, the conserved WRKYGQK sequence is directly involved in DNA binding , but the zinc finger motif is also required . Most of the well-characterized WRKY proteins bind to the W-box element (C/T)TGAC(C/T) in the promoter region of the target genes . The specificity of the binding site is partly dependent on the DNA sequences adjacent to the W-box core, and the involvement of WRKY factors in protein complexes might be the major criteria in determining promoter selectivity .
The identification of 64 WRKY genes expressed in various soybean tissues and in response to abiotic stress was previously assessed using RT-PCR . However, due to the unavailability of the complete soybean genome sequence at that time, the number of members of this gene family was underrepresented. Yin et al.  identified 133 WRKY members in soybean genome. Now a day, several databases for soybean genome analysis are publicly available. PlantTFDB  SoyDB  and SoyTFKB  are transcription factor databases which contain valuable information, including protein sequence, protein domains, predicted tertiary structures and links to external databases. However, despite the usefulness, these databases have performed systematic annotations resulting in different numbers of soybean WRKY transcription factors and some incorrect gene models. So, until now, there is no a comprehensive curate list of soybean WRKY genes. Besides, there is inconsistent nomenclature for soybean WRKY members in the literature. The Phytozome database (http://www.phytozome.org) assigns names from Arabidopsis orthologs, while Zhou et al.  identified 64 soybean WRKY genes (deposited in http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/) and randomly assigned a number to each gene. Moreover, studies of the individual genes , have assigned numbers different from those proposed by Zhou et al. . The present study reports a genome-wide annotation of the WRKY family in soybean and a functional analysis of some genes involved in response to P. pachyrhizi infection.
Annotation and in silico characterization
In total, 182 potentially WRKY-encoding genes were identified and annotated in the present work (Table 1 and Additional file 1). Additionally, a total of 33 putative WRKY pseudogenes were found (Additional file 2). Some of them were identified in our search and other ones were previously described in the USM data set . Transcripts for 152 annotated WRKY genes were detected on SoyBase EST database (http://soybase.org/) and/or on five global expression experiments: SuperSAGE of soybean leaves 12, 24 and 48 hours after inoculation (hai) of P. pachyrhizi, RNA-Seq of microdissected lesions 10 days after inoculation of P. pachyrhizi, two different microarrays of leaves 12 and 120 hai of P. pachyrhizi (available in the current literature) and RNA-Seq expression data of healthy plants in different developmental stages , available at SoyBase . The GmWRKY genes were distributed over the 20 soybean chromosomes with protein sequences ranging from 121 to 1,356 amino acids in length (Table 1 and Additional file 1). There was an average of 9.1 WRKY genes per chromosome, with the highest number of genes (15 genes) located on chromosome 6.
The proteins were assigned to three major groups and subgroups in accordance with Eugelm et al. . Group I, II and III contained 31, 126 and 25 soybean WRKY genes, respectively (Table 1 and Additional file 1). A total of 13, 33, 42, 16 and 22 proteins were assigned to subgroups IIa, IIb, IIc, IId and IIe, respectively.
Although the WRKYGQK signature was highly conserved in the soybean WRKYs, 15 proteins with amino acid substitutions in the signature of the C-terminal domain were identified. These variant proteins were distributed among all groups, except subgroup IId. WRKYGKK was the most common variant and was shared by 11 genes. Other atypical sequences, such as WRKYGEK, WRKYEDK, WKKYGQK, CRKYGQK and WHQYGLK, occurred in single proteins. Nine WRKY proteins contained incomplete and/or amino acid substitutions in the zinc-finger sequence (Table 1 and Additional file 1). Some of these proteins contained patterns of zinc-finger motifs that have not been reported in the literature. Expression was detected for nine genes presenting modifications in the WRKY signature and for six genes with modifications in the zinc-finger motif, indicating that these genes might be functional. Moreover, another highly conserved domain, the zinc cluster, was identified upstream of the WRKY domain in IId gene members.
The phylogenetic approach performed with the WRKY domain sequences confirmed the division of GmWRKY members in the five groups (I, IIa + IIb, IIc, IId + IIe and III) (Figure 1 and Additional file 3). These groups correspond to the WRKY domain classification (groups and subgroups I, IIa, IIb, IIc, IId, IIe and III) that has already been demonstrated in other studies. Genes from Group IIa are closely related with those from Group IIb, while genes from Group IId are closely related with those from Group IIe.
Gene expression data
An overview of the differential expressed soybean WRKY genes that were modulated in response to P. pachyrhizi infection is presented in Table 2 and Additional file 4. The expression data were obtained from four global expression experiments: SuperSAGE of leaves 12, 24 and 48 hours after inoculation (hai), RNA-Seq of microdissected lesions 10 days after inoculation and two different microarrays of leaves 12 and 120 hai, available in the current literature ,. Seventy-five genes showed differential expression in at least one experiment, whereas 16 genes showed differential expression in more than one experiment. Genes from groups I, II and III responded to this stress condition.
Some of the genes that presented differential expression profiles in response to the fungus were randomly selected from each classification group for more detailed analyses. GmWRKY27 (Glyma15g00570) and GmWRKY125 (Glyma09g41050) were differentially expressed in three of the four experiments, while GmWRKY56 (Glyma08g23380), GmWRKY106 (Glyma07g02630) and GmWRKY20 (Glyma08g02580) in the two microarrays. GmWRKY139 (Glyma13g44730), GmWRKY46 (Glyma05g36970), GmWRKY57 (Glyma18g44560) were also analyzed because they were closely related to at least one of the genes evaluated above. Interestingly, none of these genes was expressed in rust infection lesions at ten days after fungus inoculation (RNA-Seq).
The differential expression of these genes was confirmed using RT-qPCR. The transcript levels during the course of fungus infection in a resistant genotype (PI561356) and in a susceptible genotype (Embrapa-48) were compared with those in the mock-inoculated plants (Figure 2).
The interaction among the genotypes, time-course and pathogen presence was highly significant (p < 0.0001). In the inoculated plants, the eight genes showed early expression in PI561356 (resistant) compared with Embrapa 48 (susceptible). In the Embrapa 48, the expression peaks were higher at 24 and/or 96 hai, while in PI561356, these peaks varied from one to 24 hai. Furthermore, GmWRKY56, GmWRKY106, GmWRKY20 and GmWRKY125 presented a stronger response in the resistant genotype. Interestingly, the homologous genes (GmWRKY27 and GmWRKY139, GmWRKY125 and GmWRKY57) did not overlap with their expression peaks in the resistant genotype. GmWRKY27 and GmWRKY57 showed higher expression levels at one hai followed by a decrease in expression, whereas GmWRKY139 and GmWRKY125 presented higher transcript levels at 12 hai.
GmWRKY27 overexpression and silencing in soybean plants
GmWRKY27 was selected for further functional characterization because it was one of the genes that showed differential expression in different experiments. Furthermore, it was also shown that this gene is involved in different abiotic stresses . To determine the functional role of the GmWRKY27 in response to P. pachyrhizi infection, soybean somatic embryos were transformed to obtain gene overexpression and silencing. In the overexpression experiments, GFP expression was detected in hygromycin-resistant globular embryos (Additional file 5A and B). The histodifferentiated embryos of nine independent transgenic lines (seven from Biobalistic and two from bombardment/Agrobacterium) were obtained. The presence of the T-DNA in the embryo genomes was confirmed using PCR, and the GmWRKY27 expression was significantly higher in the embryos of the four independent transgenic lines (Additional file 5C). However, the development of transgenic embryos overexpressing GmWRKY27 was not successful. As a consequence, those embryos were not able to develop into plants.
For gene silencing, a vector carrying a 176-bp inverted-repeat fragment sequence from GmWRKY27 was constructed. This fragment shared 83% similarity with the homologous region of GmWRKY139 and 70% and 67% similarity with GmWRKY56 and GmWRKY106 respectively. These data confirm the close relationship among the genes, which was also observed in the phylogenetic analysis (Figure 1). This high sequence similarity suggests that the silencing construct would target the four genes.
A more detailed structural analysis of the four homologous genes showed that the WRKYGQK signature, zinc-finger motif and other residues in the sequences were highly conserved among the four corresponding proteins (Figure 3A). The sequence identity of the complete proteins varied from 66% to 94% (Table 3). The four soybean genes were putative orthologs of AtWRKY40, AtWRKY18 and AtWRKY60 Arabidopsis genes, as shown in the phylogenetic tree (Additional file 3). The gene structure of GmWRKY27, GmWRKY139, GmWRKY56 and GmWRKY106 was similar, with the WRKY domain present in the fourth exon (Figure 3B). Interestingly, GmWRKY56 had four alternative transcripts, and one of the transcripts lacked the WRKY domain.
Two independent transgenic lines (cultivar BRSMG 68 Vencedora) carrying the silencing construct were obtained. The molecular analysis revealed that one of the repeats (176-bp fragment) was eliminated from the first line. Therefore, the post-transcriptional silencing was not triggered, which was confirmed using RT-qPCR (data not shown). In the second transgenic line (P3-2) the complete cassette was successfully integrated (data not shown). As anticipated, the results from the RT-qPCR analysis showed that the expression of the four homologous genes was significantly reduced (Figure 4). The transgenic line exhibited no major phenotypic alterations.
The silenced line was shown to be more susceptible to P. pachyrhizi
A detached leaf assay was performed to confirm the involvement of GmWRKY27, GmWRKY139, GmWRKY56 and GmWRKY106 in the soybean response to P. pachyrhizi infection. As previously described, detached leaf and intact plant bioassays revealed a high correlation . In the present study, "tan" lesions could be observed on all detached leaves of both transgenic and wild type samples at 12 days after P. pachyrhizi inoculation. However, the number of lesions was significantly higher in the leaves of the transgenic line (Figure 5). No visible differences were observed concerning the appearance of the lesions and pustule formation or eruption (data not shown).
Soybean WRKY genes
Whole genome sequencing  has facilitated the accurate annotation of soybean gene families. In this study, we present the annotation of 182 WRKY transcription factors in soybean. The transcripts of 152 genes were detected, suggesting they can be expressed at the protein level; however, specific conditions might be necessary for the successful transcription of the remaining genes.
As discussed before, there is inconsistent nomenclature for soybean WRKY members in the literature. To unify the terminology, we proposed a nomenclature based on the previously described WRKY-encoding genes , with some modifications. Data from sequence comparisons have shown that GmWRKY18 and GmWRKY35 is the same gene. In addition, GmWRKY3 does not exist in the soybean genome; indeed, this sequence represents a chimeric transcript produced through trans-splicing between N-terminal and C-terminal sequences from Glyma02g46690 and Glyma14g01980, respectively. The remaining 118 genes were numbered according to the order of the chromosomes (Table 1 and Additional file 1).
More WRKY genes have been identified in soybean than in other species, such as rice, Arabidopsis, cotton, grape and B. distachyon-. The duplication events have been greatly over-retained, specifically in the case of transcription factors . Thus, functional redundancy is a common feature in plant species. However, homologous genes might diverge in function providing a source of evolutionary novelty .
In soybean, the members of group I contained domains with a C2H2-type zinc-finger motif. The same characteristic is observed in Arabidopsis, while in rice, the WRKY domains of group I members include two types of zinc-finger motifs: C2H2 and C2HC ,.
Although the WRKYGQK signature was highly conserved among soybean WRKY proteins, as illustrated in Figure 6, variation was identified in 21 genes. Zhou et al.  previously showed that GmWRKY6 (Glyma08g15050) and GmWRKY21 (Glyma04g39650) contain the variant WRKYGKK rather than the conserved WRKYGQK motif. Slight variations in this region have also been reported in Arabidopsis, rice, tobacco, barley, canola and sunflower ,,-. Compared with Arabidopsis, which contains four WRKYGKK variants, the number of genes with a modified WRKYGQK motif is greater in soybean.
Some unusual GmWRKY-encoding genes (i.e., containing a modified WRKY signature and/or zinc-finger motif) produced mRNA (Table 2 and Additional file 4). Further analyses are necessary to determine whether these genes function as transcription factors or if they induce post-transcriptional regulation through RNAi, as previously suggested . Variant proteins might have abolished or decreased capacities to bind to the W-box ,. It has been suggested that WRKY proteins without the canonical WRKYGQK motif might have different binding sites ,, target genes and possibly divergent roles .
Despite the fact that the identification or prediction of many WRKY genes from different species has been previously achieved, only a small number of these have been functionally characterized. Information concerning the role of soybean genes (Glyma13g00380-GmWRKY13, Glyma04g39650-GmWRKY21, Glyma10g01450-GmWRKY54 and Glyma18g44560-GmWRKY57) during abiotic stress has been based on data obtained from heterologous expression systems ,. The data from expression analyses , or using transient gene silencing  supports a role for the WRKY genes in response to biotic stresses. Studies concerning global expression profiling have demonstrated the importance of WRKY-encoding genes in transcriptional reprogramming during P. pachyrhizi infection in soybean plants -.
To determine which soybean WRKY genes are involved in plant defense against P. pachyrhizi infection, we performed a series of analyses to examine their expression patterns after infection. We initially compared the microarray data available in the literature , with the results obtained from two additional experiments: SuperSAGE and RNA-Seq. Many genes were differentially expressed in only one library, while a few of them showed differential expression in more than one library. The modulation in the transcript levels of eight genes was validated, showing the reliability of data mining. The similar expression patterns in response to P. pachyrhizi infection was observed among closely related genes (Figure 1), such as GmWRKY61 (Glyma06g15220) and GmWRKY21 (Glyma04g39650), GmWRKY143 (Glyma14g11920) and GmWRKY63 (Glyma17g33920), GmWRKY106 and GmWRKY56, GmWRKY58 (Glyma04g40130) and GmWRKY97 (Glyma06g14720). This similar expression pattern suggests that these genes might share similar functions in disease resistance. The redundant function of GmWRKY genes might be beneficial in protecting the cell or organism under various stress conditions and eliciting multiple pathways that lead to the wide array of physiological responses that occur following pathogen infections .
Global expression data have suggested that the timing and the degree of induction of the defense pathway are determinants for the induction of soybean resistance to P. pachyrhizi,,,. In our study, the induced expression of GmWRKY20, GmWRKY27, GmWRKY46, GmWRKY57, GmWRKY56, GmWRKY106, GmWRKY125 and GmWRKY139 in response to P. pachyrhizi was earlier and/or stronger in the resistant genotype. The expression of most genes analyzed peaked at 12 hai in the resistant genotype; therefore, we propose that these genes might be involved in non-specific defense responses. Van de Mortel et al.  and Schneider et al.  reported that P. pachyrhizi infections induce biphasic global expression. Gene expression initially peaked at 12 hai, which corresponded with the early infection processes of appressoria formation and epidermal cell penetration. The authors suggested that this peak corresponded to a non-specific defense response similar to pathogen-triggered immunity. A second phase of gene expression, which began at 72 hai and continued until 288 hai, is coincident with haustoria formation and effector protein secretion. The authors suggested that this response is consistent with the activation of RPP2 and RPP3-mediated resistance. It has been shown that gene expression is rapid and increased in the incompatible interaction ,,.
The closely related genes GmWRKY27, GmWRKY139, GmWRKY56 and GmWRKY106 are putative orthologues of AtWRKY40, AtWRKY18 and AtWRKY60 Arabidopsis genes. In both species, these genes were classified into group IIa. The three Arabidopsis WRKYs are involved in stress responses, which include resistance against the bacteria Pseudomonas syringae and fungus Botrytis cinerea,. AtWRKY18 is a salicylic acid-induced gene that positively regulates SAR , and modulates PR gene expression; AtWRKY18 overexpression increases resistance to P. syringae. AtWRKY40 and AtWRKY60 proteins antagonize AtWRKY18 during P. syringae infection. The gain or loss of gene function in single, double or triple combination mutants resulted in increased susceptibility to B. cinerea. Some rice, barley and Brassica napus WRKY members from group IIa are also involved in the response to fungal and bacterial pathogens, as demonstrated using expression studies. OsWRKY62 and OsWRKY76 are upregulated in Magnaporthe grisea infected-leaves and downregulated in Xanthomonas oryzae-inoculated leaves . HvWRKY1 and HvWRKY2 play an important role in response to Blumeria graminis infection , and BnWRKY18 and BnWRKY40 play a role in the response to Sclerotinia sclerotiorum and Alternaria brassicae infections .
Most available information concerning soybean gene function is based on data obtained from heterologous expression systems. However, as the activity of many proteins frequently depends on specific interactions that are only found in homologous backgrounds, the present study was based on a homologous expression system. An RNA interference approach was used for the silencing of four soybean homologous genes (GmWRKY27, GmWRKY139, GmWRKY56 and GmWRKY106). The quadruple silencing is an advantage because a single knockout of transcription factors rarely results in altered phenotypes due to functional redundancy among closely related members . The transgenic RNAi line used in this study generated a significant reduction in the transcript levels of the four target genes. When infected with P. pachyrhizi, the transgenic line showed increased susceptibility to the fungus. Taken together, the results strongly suggest that at least one of the four genes might be involved in the soybean resistance phenotype.
Pandey et al.  silenced 64 soybean WRKYs individually using virus-induced gene silencing (VIGS) to test their involvement in Rpp2-mediated resistance against P. pachyrhizi infection. Three of these genes (GmWRKY45, GmWRKY40 and GmWRKY36) compromised the resistance phenotype when silenced. Phenotypic alterations were not evidenced when GmWRKY56 and GmWRKY106 genes were individually silenced. However, in the present study, an increased susceptibility to P. pachyrhizi infection was observed in the quadruple-silenced (GmWRKY27, GmWRKY139, GmWRKY56 and GmWRKY106) line, suggesting that this phenotype is a consequence of GmWRKY27 and/or GmWRKY139 silencing. Moreover, the four genes analyzed in this study could also play a synergistic role in the pathogenic defense response.
A previous study showed that GmWRKY27 is also strongly induced under conditions of drought and salt stress in the soybean . Altogether, these data suggest that this gene is probably involved in a non-specific response that occurs upstream of biotic and abiotic stress defense routes, in contrast with the specific Rpp2-response of the genes identified by Pandey et al.  in response to the fungal infection.
GmWRKY27 was selected for use in the overexpression study. Histodifferentiated embryos overexpressing this gene were obtained from four independent transformation experiments. However, the plants were not recovered. The most likely explanation is that the constitutive overexpression of the GmWRKY27 might affect the regeneration of plants. The use of constitutive promoters in investigation of genes whose constant overexpression has deleterious effects on the plant is a major limitation . Chen and Chen  reported that high levels of AtWRKY18 cause severe abnormalities in plant growth. Even at moderate levels, the individual or combinatorial overexpression of AtWRKY18, AtWRKY40 and AtWRKY60 leads to the development of smaller plants or death shortly after germination .
The deleterious effect of the excessive production of these WRKYs during plant growth suggested that the expression of this gene might require proper regulation during the activation of plant defense responses. However, in healthy plants, the expression of these genes is negatively regulated, as demonstrated by Chen and Chen  for the AtWRKY18.
To a certain extent, the lethality problems observed in this study could be partially overcome using tissue-specific, developmentally regulated or inducible promoters. Although the number of tissue-specific promoters has increased in recent years, soybean leaf-specific promoters are still unavailable.
In the present study, 182 WRKY transcription factors were annotated in soybean. Seventy-five genes were identified as involved in the soybean response to P. pachyrhizi infection based on transcriptional regulation. The participation of four genes in response to pathogen infection was demonstrated using an RNAi approach. Further investigations are required to provide clues regarding the functions of the individual genes. The results contribute to the elucidation of gene function and suggest the manipulation of WRKYs as a strategy to increase fungal resistance in soybean plants.
Database search and sequence annotation
To search for Glycine max (Gm) WRKY transcription factor we use two different approaches as follow: first we downloaded soybean proteome from Phytozome (http://www.phytozome.org) and SoyBase (http://soybase.org/) databases to perform a Batch BLAST using BLASTALL software . The WRKY domains previously identified in Arabidopsis, poplar  and soybean -, genomes were checked on the SMART Web Site and were used as queries to perform tblastp (e-value cut off of 10) searches. After doing Batch BLAST searches we checked for soybean WRKY genes in PlantTFDB (http://planttfdb.cbi.pku.edu.cn/) transcription factor database and USM data set .
Additionally, we used the coding sequences (CDS) to perform blast searches against the Phytozome database (www.phytozome.org) and PLAZA (http://bioinformatics.psb.ugent.be/plaza/) to retrieve any additional WRKY genes. The Phytozome database was also used to obtain the gene structures. The automated WRKY-predicted gene sequences that contained incorrect gene models (wrong start/stop codons or truncated proteins) were reannotated using GENSCAN  and FEGENESH  predictors, considering 2, 5 or 10-kb DNA sequences obtained from Gbrowse. The sequences were aligned with ClustalX v2.1 , and the domains manually examined. The sequences without conserved WRKYGQK domain signatures were discarded. The degree of conservation of the WRKYGQK and zinc finger domains was analyzed using the MEME suite (http://meme.sdsc.edu/meme/). The annotated genes were classified in groups and subgroups proposed consistent with the methods of Eugelm et al.  for Arabidopsis thaliana. A nomenclature for the WRKY-encoding genes identified in this work was adopted, according to the order of the chromosomes. The structures of the four soybean WRKY-encoding genes selected to the functional analysis and their alternative transcripts were analyzed using Fancy Gene v1.4 .
Soybean WRKY relationships
In order to classify the soybean WRKY genes identified, a phylogenetic approach was performed with two dataset: the first one contained only soybean WRKY sequences and the second included also Arabidopsis thaliana and Populus trichocarpa WRKY sequences, downloaded from PlantTFDB database. The multiple sequences alignments were performed with MUSCLE software , implemented in MEGA5 (Molecular Evolutionary analysis) software . Phylogenetic analyses were conducted with WRKY domain sequences using Bayesian approach implemented in BEAST1.7 software . The best-fit model of protein evolution was determined using ProTest , which selected the JTT model for protein matrix substitution. The Yule tree was selected as a tree prior for Bayesian analysis and 30.000.000 generations were performed with Markov Chain Monte Carlo (MCMC) algorithms. The trees were visualized and edited in FigTree v1.3.1 software .
Gene expression data mining
The GmWRKY CDSs were searched into RNA-Seq expression data  which is available at SoyBase . In addition, the expression profiles of the WRKY genes that were modulated in response to P. pachyrhizi infection were obtained from four different sources. The reaction of soybean plants to rust infection of the first three experiments was assessed by the inoculation of P. pachyrhizi spores collected in the field into plants maintained under greenhouse conditions at Embrapa Soja, Londrina, PR, Brazil. The sources used to obtain the expression profiles of the WRKY genes are described:
a) SuperSAGE: The libraries were constructed using the leaves of a soybean resistant genotype (PI561356), which carries the Rpp1 resistance gene, infected with P. pachyrhizi vs. uninfected leaves (mock inoculation/control) collected at 12, 24 and 48 hours after inoculation (hai). A Plant RNeasy kit (Qiagen) was used for RNA extraction and equal amounts of RNA from each sample were used to construct the RNA pools. The libraries (inoculated and mock) were constructed at GenXPro GmbH (Frankfurt, Germany) using previously described methods  and subsequently sequenced using the Illumina Genome Analyzer IIx. The SuperSAGE tags were analyzed using the DiscoverySpace software v.4.01  to identify unique (unitags) and differentially expressed tags (p ≤ 0.05). The libraries were constructed as part of the GenoSoja project (Brazilian Soybean Genome Consortium), and the results are available in the LGE (Laboratório de Genômica e Expressão, UNICAMP) Soybean Genome database  for members of the consortium.
b) RNA-Seq of lesion LCM (Laser Capture Microdissection): foliar segments (1 cm2) containing P. pachyrhizi lesions from two soybean resistant (PI561356) and susceptible BRS231  genotypes at the V2 growth stage were collected at 10 days after infection. The leaf segments were immediately fixed on ice in Farmer's solution , dehydrated and embedded on paraffin in accord with the methods of Cai and Lashbrook . Serial sections of 12-μm in thickness were generated using a rotary microtome and transferred to microscope membrane slides. Twenty sections containing a variable number of rust lesions were prepared for each biological replicate/treatment. The PixCell II LCM system (Arcturus) and CapSure Macro LCM (Arcturus) were used to collect the foliar cells within the lesion. Total RNA was extracted using the PicoPure RNA Isolation Kit (Arcturus) from the cells collected at a variable number of infection sites for each biological replicate. The synthesis of cDNA was conducted, and high-performance paired-end (108 bp) sequencing was performed on the Illumina genome analyzer GAAllx. Low-quality RNA-Seq reads were discarded. The reads (a total of 86,301,242) were aligned against the soybean genome, and the corresponding genes were predicted using the TopHat  and SOAP2  alignment programs. Gene expression was calculated using the FPKM (fragments per kilobase of exon per million fragments mapped) value . To identify differentially expressed genes, a pair-wise comparison between the FPKM values of both genotypes was performed using a t-test at the 99% confidence level. This library was constructed as part of the Biotecsur Consortium and the results are available  for members of the consortium.
c) Microarray : The expression o WRKY genes in the leaves of the soybean resistant genotype (PI970230), which carries the Rpp2 gene, and in the soybean susceptible genotype (Embrapa 48) in response to P. pachyrhizi infection were compared with that of uninfected leaves (mock inoculation). In the present study, the data obtained at 12 and 120 hai were considered because the highest gene expression was exhibited at these time points. Only the 46 probes previously described as WRKYs were examined. The specificity of probes was analyzed using the SoyBase and Phytozome databases. Probes with e-values <0.05 were considered.
d) Microarray : The global expression of the soybean cultivar Ankur (PI462312), which carries the Rpp3 resistance gene, which was inoculated with avirulent (Hawaii 94-1) and virulent (Taiwan 80-2) isolates of P. pachyrhizi, was analyzed. The Affy probe sets were searched using the tools available in the Soybase database. In the present study, only the WRKY probes that hybridized with a single locus in the soybean genome were selected. The data obtained at 12 and 120 hai were considered because the highest gene expression was exhibited at these time points. The genes with a p-value <0.05 were considered as differentially expressed.
P. pachyrhizi bioassay for gene expression analysis
Soybean plants were grown in a pot-based system maintained in greenhouse conditions at 28 ± 1°C under a 16/8 h light/dark cycle with a light intensity of 22.5 μEm-2s-1 in Embrapa Soja, Londrina, PR, Brazil. The Embrapa-48 genotype, which develops a "tan" lesion , was used as the susceptible standard, and the PI561356 genotype, which carries the Rpp1 resistance gene , was used as the resistant standard. ASR isolated from Brazilian fields was maintained in a susceptible cultivar. Spores harvested from leaves exhibiting sporulating uredinia and diluted in distilled water containing 0.05% Tween-20 to a final concentration of 3 × 105 spores/mL. The spore suspension was sprayed onto plantlets at the V2 developmental stage. The same solution without spores was used for the mock inoculation. Subsequently, the water-misted bags were placed over each pot for one day. One trifoliate leaf from each plant was collected at 1, 12, 24, 48, 96 and 192 hai, frozen in liquid nitrogen, and stored at -80°C. Three biological replicates from each genotype/treatment were analyzed.
Expression pattern analysis using reverse transcription and quantitative real-time PCR (real-time RT-qPCR)
Total RNA was extracted using TRIzol reagent (Invitrogen) and further treated with DNAse (Promega) according to the manufacturer's instruction. The first-strand cDNAs were obtained using 2 μg of DNA-free RNA using the M-MLV Reverse Transcriptase System (Invitrogen) with a 24-polyVT primer. The RT-qPCR was conducted using a StepOne Applied Biosystems Real-Time cycler™. The PCR-cycling conditions were implemented as follows: 5 min at 94°C, followed by 40 cycles of 10 s at 94°C, 15 s at 60°C and 15 s at 72°C, and a final step of 2 min at 60°C. A melting curve analysis was performed at the end of the PCR run over a range of 55-99°C, increasing the temperature stepwise by 0.1°C every 1 s. Each 25-μL reaction comprised 12.5 μL of diluted DNA template, 1X PCR buffer (Invitrogen), 2.4 mM of MgCl2, 0.024 mM of dNTPs, 0.1 μM of each primer, 2.5 μL of SYBR Green (1:100000-Molecular Probes Inc.) and 0.03 U of Platinum Taq DNA polymerase (Invitrogen). The cDNA (1:100) templates were evaluated. All PCR reactions were performed in technical quadruplicates. Reactions lacking templates were used as negative controls.
The PCR reactions were performed using gene-specific primers (Table 4). Primer-pairs designed to amplify F-Box proteins and metalloprotease sequences were used to normalize the amount of mRNA present in each sample. These genes were previously confirmed as good reference genes for the experimental conditions used in the present study . The expression analyses were performed after the comparative quantification of amplified products using the 2-ΔΔCt method . The results were statistically compared using variance analysis with three-factor factorial treatments: genotype, time and pathogen presence. The data were transformed using the weighted least squares method. The means were compared using Tukey's multiple comparison test.
Silencing and overexpression vectors construction
The open reading frame (ORF) of GmWRKY27 (Glyma15g00570), according to Phytozome v1.0, was amplified from the MGBR-46 Conquista soybean cultivar using a high-fidelity Taq DNA Polymerase (Pfu-Fermentas). The Gateway® System (Invitrogen) was used to recombine the PCR product into the overexpression pH7WG2D,1 vector . The T-DNA region of the resulting pH7WG2D,1-GmWRKY27 vector contained the GmWRKY27 gene ORF under the control of the CaMV 35S promoter, the hygromycin-phosphotransferase marker gene (hpt) and the green fluorescent protein reporter gene (gfp) (Figure 7A). A RNAi silencing vector was constructed using pH7GWIWG2(II),0 . The T-DNA region of the resulting pH7GWIWG2(II),0-GmWRKY27 contained inverted repeat fragments (176 bp) from the GmWRKY27 sequence, which were separated by an intron from the Arabidopsis genomic DNA sequence, under the control of the CaMV 35S promoter and the hygromycin-phosphotransferase marker gene (hpt) (Figure 7B). Both constructs were confirmed using DNA sequencing.
Soybean transformation and plant regeneration
Pods containing immature seeds of 3-5 mm in length from soybean cultivars MGBR 46 (Conquista), BRSMG 68 (Vencedora) and IAS5 were harvested from field grown plants. They are all susceptible to P. pachyrhizi. Somatic embryogenesis was induced from immature cotyledons and proliferated using the methods of Droste et al. . Proliferating embryogenic tissues were subjected to transformation through particle bombardment using a particle inflow gun (PIG)  following the procedure of Droste et al.  or using the combined methods of DNA-free particle bombardment and Agrobacterium transformation . After cultivating for three months in hygromycin-B selection medium, the hygromycin-resistant embryogenic soybean tissues were visually selected and individually cultured for the establishment of lines corresponding to putative independent transformation events.
Embryo histodifferentiation, conversion into plants and acclimation were carried out as previously described . All plants derived from an independent sample of hygromycin-resistant tissue were considered as cloned plants. The plants derived from non-transformed embryogenic tissues submitted to the same culture conditions were recovered and used as controls for molecular characterization and bioassays.
Screening for transgenic embryos and plants
Total DNA was extracted  from hygromycin-resistant histodifferentiated embryos and plant leaves. The putative transgenic embryos/plants were PCR-screened for the presence of the complete T-DNA using different primer combinations (Table 5). The PCR mixture consisted of 200 ng of template DNA, 0.4 mM of dNTPs, 0.4 μM of each primer, 2.5 mM of MgCl2, 1X Taq Buffer, 1 U of Taq DNA Polymerase (Invitrogen), and autoclaved distilled water in a final volume of 25 μl. The reactions were initially heated (5 min at 94°C) and subjected to 30 cycles of the following conditions: 45 s at 94°C, 45 s at 58°C and 1 min at 72°C. Subsequent to electrophoresis on a 1% agarose gel containing ethidium bromide (0.01 mg/L), the PCR products were visualized under ultraviolet light.
GFP expression was detected under blue light using an Olympus® fluorescence stereomicroscope equipped with a BP filter set containing a 488 nm excitation filter and a 505-530 nm emission filter. The images were captured using the QCapture Pro™ 6 software (QImaging®).
Gene overexpression or silencing was confirmed using RT-qPCR. The RNA extraction, cDNA synthesis and qPCR analysis were performed as described above.
A detached leaf method was used to evaluate the P. pachyrhizi infection . Three fully expanded leaves from each one transgenic line and two wild-type plants (2-month-old) were collected, rinsed with sterile distilled water and cut in 5 cm × 5 cm pieces. For the inoculation, 1 mL of a uredospore suspension (105 spores/mL) was dripped onto each leaf piece, which was subsequently placed with its abaxial side upwards in a Petri dish covered with wet filter paper. The material was incubated at 20°C under a 12/12 h light/dark cycle. The number of lesions and pustules (uredium) was recorded at 12 days after inoculation. A non-parametric Student's t-test was conducted to compare the effect of P. pachyrhizi on transgenic and non-transgenic plants. The results with p < 0.05 were considered significant.
Miles MR, Frederick RD, Hartman GL: Evaluation of soybean germplasm for resistance to Phakopsora pachyrhizi. Plant Health Prog 2006, doi:10.1094/PHP-2006-0104-01-RS.,
Ogle HJ, Byth DE, McLean R: Effect of rust (Phakopsora pachyrhizi) on soybean yield and quality in south-eastern Queensland. Australian J Agric Res. 1979, 30: 883-893. 10.1071/AR9790883.
Bromfield KR: Soybean Rust. Monograph No. 11. American Phytopathological Society, St. Paul, Minnesota; 1984.
Patil VS, Wuike RV, Thakare CS, Chirame BB: Viability of uredospores of Phakopsora pachyrhizi Syd. at different storage conditions. J Maharashtra Agric Universities. 1997, 22: 260-261.
Bromfield KR, Hartwig EE: Resistance to Soybean Rust and mode of inheritance. Crop Sci. 1980, 20: 254-255. 10.2135/cropsci1980.0011183X002000020026x.
Cheng YW, Chan KL: The breeding of `Tainung 3' soybean. J Taiwan Agric Res. 1968, 17: 30-35.
Hidayat OO, Somaatmadja S: Screening of soybean breeding lines for resistance to soybean rust (Phakopsora pachyrhizi Sydow). Soybean Rust Newsl. 1977, 1: 9-22.
Singh BB, Thapliyal PN: Breeding for resistance to Soybean Rust in India. Rust of Soybean: The Problem and Research Needs. Edited by: Ford RE, Sinclair JB. College of Agriculture, University of Illinois at Urbana-Champaign, Urbana, IL; 1977:62-65.
McLean RJ, Byth DE: Inheritance in resistance to rust (Phakopsora pachyrhizi) in soybeans. Aust J Agric Res. 1980, 31: 951-956. 10.1071/AR9800951.
Hartwig EE, Bromfield KR: Relationships among three genes conferring specific resistance to rust in soybeans. Crop Sci. 1983, 23: 237-239. 10.2135/cropsci1983.0011183X002300020012x.
Hartwig EE: Identification of a fourth major gene conferring resistance to soybean rust. Crop Sci. 1986, 26: 1135-1136. 10.2135/cropsci1986.0011183X002600060010x.
Monteros MJ, Missaoui AM, Phillips DV, Walker DR, Boerma HR: Mapping and confirmation of the `Hyuuga' red-brown lesion resistance gene for Asian Soybean Rust. Crop Sci. 2007, 47: 829-834. 10.2135/cropsci06.07.0462.
Garcia A, Calvo ES, de Souza KRA, Harada A, Hiromoto DM, Vieira LG: Molecular mapping of soybean rust (Phakopsora pachyrhizi) resistance genes: discovery of a novel locus and alleles. Theor Appl Genet. 2008, 117: 545-553. 10.1007/s00122-008-0798-z.
Li S, Smith JR, Ray JD, Frederick RD: Identification of a new soybean rust resistance gene in PI 567102B. Theor Appl Genet. 2012, 125 (1): 133-142. 10.1007/s00122-012-1821-y.
Bonde MR, Nester SE, Austin CN, Stone CL, Frederick RD, Hartman GL, Miles MR: Evaluation of virulence of Phakopsora pachyrhizi and P. meibomiae isolates. Plant Dis. 2006, 90: 708-716. 10.1094/PD-90-0708.
Sconyers LE, Kemerait RC, Brock J, Phillips DV, Jost PH, Sikora EJ, Gutierrez-Estrada A, Mueller JD, Marois JJ, Wright DL, Harmon CL: Asian soybean rust development in 2005: A perspective from the Southeastern United States. In APSnet Features 2006. doi:10.1094/APSnetFeatures-2006-0106.
van de Mortel M, Recknor JC, Graham MA, Nettleton D, Dittman JD, Nelson RT, Godoy CV, Abdelnoor RV, Almeida AMR, Baum TJ, Whitham SA: Distinct biphasic mRNA changes in response to Asian soybean rust infection. Mol Plant Microbe Interact. 2007, 20: 887-899. 10.1094/MPMI-20-8-0887.
Panthee DR, Yuan JS, Wright DL, Marois JJ, Mailhot D, Stewart CN: Gene expression analysis in soybean in response to the causal agent of Asian soybean rust (Phakopsora pachyrhizi Sydow) in an early growth stage. Funct Integr Genomics. 2007, 7: 291-301. 10.1007/s10142-007-0045-8.
Panthee DR, Marois JJ, Wright DL, Narvaez D, Yuan JS, Stewart CN: Differential expression of genes in soybean in response to the causal agent of Asian soybean rust (Phakopsora pachyrhizi Sydow) is soybean growth stage-specific. Theor Appl Genet. 2009, 118: 359-370. 10.1007/s00122-008-0905-1.
Choi JJ, Alkharouf NW, Schneider KT, Matthews BF, Frederick RD: Expression patterns in soybean resistant to Phakopsora pachyrhizi reveal the importance of peroxidases and lipoxygenases. Funct Integr Genomics. 2008, 8: 341-359. 10.1007/s10142-008-0080-0.
Tremblay A, Hosseini P, Alkharouf N, Li S, Matthewsa BF: Transcriptome analysis of a compatible response by Glycine max to Phakopsora pachyrhizi infection. Plant Sci. 2010, 179: 183-193. 10.1016/j.plantsci.2010.04.011.
Schneider KT, van de Mortel M, Bancroft TJ, Braun E, Nettleton D, Nelson RT, Frederick RD, Baum TJ, Graham MA, Whitham SA: Biphasic gene expression changes elicited by Phakopsora pachyrhizi in soybean correlate with fungal penetration and haustoria formation. Plant Physiol. 2011, 157: 355-371. 10.1104/pp.111.181149.
Pandey SP, Somssich IE: The role of WRKY transcription factors in plant immunity. Plant Physiol. 2009, 150: 1648-1655. 10.1104/pp.109.138990.
Eulgem T, Rushton PJ, Robatzek S, Somssich IE: The WRKY superfamily of plant transcription factors. Trends Plant Sci. 2000, 5: 199-206. 10.1016/S1360-1385(00)01600-9.
Xie Z, Zhang ZL, Zou X, Huang J, Ruas P, Thompson D, Shen QJ: Annotations and Functional Analyses of the Rice WRKY Gene Superfamily Reveal Positive and Negative Regulators of Abscisic Acid Signaling in Aleurone Cells. Plant Physiol. 2005, 137: 176-189. 10.1104/pp.104.054312.
Zhang Y, Wang L: The WRKY transcription factor superfamily: its origin in eukaryotes and expansion in plants. BMC Evolutionary Biol. 2005, 5: 1-12. 10.1186/1471-2148-5-1.
Wu KL, Guo ZJ, Wang HH, Li J: The WRKY family of transcription factors in rice and arabidopsis and their origins. DNA Res. 2005, 12: 9-26. 10.1093/dnares/12.1.9.
Ross CA, Liu Y, Shen QJ: The WRKY Gene Family in Rice (Oryza sativa). J Integr Plant Biol. 2007, 49: 827-842. 10.1111/j.1744-7909.2007.00504.x.
He HS, Dong Q, Shao YH, Jiang HY, Zhu SW, Cheng B, Xiang Y: Genome-wide survey and characterization of the WRKY gene family in Populus trichocarpa. Plant Cell Rep 2012, doi:10.1007/s00299-012-1241-0.,
Wen F, Zhu H, Li P, Jiang M, Mao W, Ong C, Chu Z: Genome-wide evolutionary characterization and expression analyses of wrky family genes in brachypodium distachyon. DNA Res 2014, 1-13. doi:10.1093/dnares/dst060.,
Zhang Y, Feng J: Identification and characterization of the grape WRKY family. BioMed Research International. Article ID. 2014, 787680: 14-
Dou L, Zhang X, Pang C, Song M, Wei H, Fan S, Yu S: Genome-wide analysis of the WRKY gene family in cotton. Mol Genet Genomics doi:10.1007/s00438-014-0872-y.,
Ülker B, Somssich IE: WRKY transcription factors: from DNA binding towards biological function. Curr Opin Plant Biol. 2004, 7: 491-498. 10.1016/j.pbi.2004.07.012.
Yamasaki K, Kigawa T, Inoue M, Tateno M, Yamasaki T, Yabuki T, Aoki M, Seki E, Matsuda T, Tomo Y, Hayami N, Terada T, Shirouzu M, Tanaka A, Seki M, Shinozaki K, Yokoyama S: Solution structure of an arabidopsis WRKY DNA binding domain. Plant Cell. 2005, 17: 944-956. 10.1105/tpc.104.026435.
Maeo K, Hayashi S, Kojima-Suzuki H, Morikami A, Nakamura K: Role of conserved residues of the WRKY domain in the DNA-binding of tobacco WRKY family proteins. Biosci Biotech Biochem. 2001, 65: 2428-2436. 10.1271/bbb.65.2428.
Eulgem T, Somssich IE: Networks of WRKY transcription factors in defense signaling. Curr Opin Plant Biol. 2007, 10: 366-371. 10.1016/j.pbi.2007.04.020.
Ciolkowski I, Wanke D, Birkenbihl RP, Somssich IE: Studies on DNA-binding selectivity of WRKY transcription factors lend structural clues into WRKY-domain function. Plant Mol Biol. 2008, 68: 81-92. 10.1007/s11103-008-9353-1.
Zhou Q, Tian A, Zou H, Xie Z, Lei G, Huang J, Wang C, Wang H, Zhang J, Chen S: Soybean WRKY-type transcription factor genes, GmWRKY13, GmWRKY21, and GmWRKY54, confer differential tolerance to abiotic stresses in transgenic Arabidopsis plants. Plant Biotechnol J. 2008, 6: 486-503. 10.1111/j.1467-7652.2008.00336.x.
Huang S, Gao Y, Liu J, Peng X, Niu X, Fei Z, Cao S, Liu Y: Genome-wide analysis of WRKY transcription factors in Solanum lycopersicum. Mol Genet Genomics. 2012, 287: 495-513. 10.1007/s00438-012-0696-6.
Zhang H, Jin JP, Tang L, Zhao Y, Gu XC, Gao G, Luo JC: PlantTFDB 2.0: update and improvement of the comprehensive plant transcription factor database. Nucleic Acids Res. 2011, 39: 1114-1117. 10.1093/nar/gkq1141.
Wang Z, Libault M, Joshi T, Valliyodan B, Nguyen H, Xu D, Stacey G, Cheng J: SoyDB: A Knowledge Database of Soybean Transcription Factors. BMC Plant Biol. 2010, 10: 14-26. 10.1186/1471-2229-10-14.
Soybean trancription factor knowledge base. , [http://www.igece.org/Soybean_TF/]
Zhang L, Wang X, Bi Y, Zhang C, Fan Y, Lei W: Isolation and functional analysis of transcription factor GmWRKY57b from soybean. Chin Sci Bulletin. 2008, 53: 3538-3545. 10.1007/s11434-008-0483-2.
Kang SG, Park E, Do KS: Identification of a pathogen-induced Glycine max transcription factor GmWRKY1. Plant Pathol J. 2009, 25: 381-388. 10.5423/PPJ.2009.25.4.381.
LGE genômica e expressão. , [http://www.lge.ibi.unicamp.br/soja/]
Severin AJ, Woody JL, Bolon Y, Joseph B, Diers BW, Farmer AD, Muehlbauer GJ, Nelson RT, Grant D, Specht JE, Graham MA, Cannon SB, May GD, Vance CP, Shoemaker RC: RNA-Seq Atlas of Glycine max: A guide to the soybean transcriptoma. BMC Plant Biol. 2010, 10: 160-176. 10.1186/1471-2229-10-160.
Grant D, Nelson R, Cannon S, Shoemaker R: SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Res. 2010, 38 (Database issue): D843-846. 10.1093/nar/gkp798.
Twizeyimana M, Bandyopadhyay R, Ojiambo P, Paul C, Hartman GL: A detached leaf method to evaluate soybean for resistance to rust National Soybean. Rust symposium. Proceedings of 2006 National Soybean rust symposium, Saint Louis. 2006.
Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, Xu D, Hellsten U, May GD, Yu Y, Sakurai T, Umezawa T, Bhattacharyya MK, Sandhu D, Valliyodan B, Lindquist E, Peto M, Grant D, Shu S, Goodstein D, Barry K, Futrell-Griggs M, Abernathy B, Du J, Tian Z, Zhu L, Gill N, Joshi T, Libault M, Sethuraman A, Zhang XC, Shinozaki K, Nguyen HT, Wing RA, Cregan P, Specht J, Grimwood J, Rokhsar D, Stacey G, Shoemaker RC, Jackson SA: Genome sequence of the palaeopolyploid soybean. Nature. 2010, 463: 178-184. 10.1038/nature08670.
Freeling M: Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition. Annual Rev Plant Biol. 2009, 60: 433-453. 10.1146/annurev.arplant.043008.092122.
Carretero-Paulet L, Galstyan A, Roig-Villanova I, Martinez-Garcia JF, Bilbao-Castro JR, Robertson DL: Genome wide classification and evolutionary analysis of the bHLH family of transcription factors in Arabidopsis, poplar, rice, moss and algae. Plant Physiol. 2010, 153: 1398-1412. 10.1104/pp.110.153593.
Rushton PJ, Bokowiec MT, Han S, Zhang H, Brannock JF, Chen X, Laudeman TW, Timko MP: Tobacco transcription factors: novel insights into transcriptional regulation in the solanaceae. Plant Physiol. 2008, 147: 280-295. 10.1104/pp.107.114041.
Rushton PJ, Somssich IE, Ringler P, Shen QJ: WRKY transcription factors. Trends Plant Sci. 2010, 15: 247-258. 10.1016/j.tplants.2010.02.006.
Mangelsen E, Kilian J, Berendzen KW, Kolukisaoglu UH, Harter K, Jansson C, Wanke D: Phylogenetic and comparative gene expression analysis of barley (Hordeum vulgare) WRKY transcription factor family reveals putatively retained functions between monocots and dicots. BMC Genomics. 2008, 9: 194-211. 10.1186/1471-2164-9-194.
van Verk MC, Pappaioannou D, Neeleman L, Bol JF, Linthorst HJM: A novel WRKY transcription factor is required for induction of PR-1a gene expression by salicylic acid and bacterial elicitors. Plant Physiol. 2008, 146: 1983-1995. 10.1104/pp.107.112789.
Yang B, Jiang Y, Rahman MH, Deyholos MK, Kav NNV: Identification and expression analysis of WRKY transcription factor in canola (Brassica napus L.) in response to fungal pathogens and hormone treatments. BMC Plant Biol. 2009, 9: 68-87. 10.1186/1471-2229-9-68.
Giacomelli JI, Ribichich KF, Dezar CA, Chan RL: Expression analyses indicate the involvement of sunflower WRKY transcription factors in stress responses, and phylogenetic reconstructions reveal the existence of a novel clade in the Asteraceae. Plant Sci. 2010, 178: 398-410. 10.1016/j.plantsci.2010.02.008.
Pandey AK, Yang C, Zhang C, Graham MA, Horstman HD, Lee Y, Zabotina OA, Hill JH, Pedley KF, Whitham SA: Functional Analysis of the Asian Soybean Rust Resistance Pathway Mediated by Rpp2. Mol Plant Microbe Interact. 2011, 24: 194-206. 10.1094/MPMI-08-10-0187.
Goellner K, Loehrer M, Langenbach C, Conrath U, Koch E, Schaffrath U: Phakopsora pachyrhizi, the causal agent of Asian soybean rust. Mol Plant Pathol. 2010, 11: 169-177. 10.1111/j.1364-3703.2009.00589.x.
Xu X, Chen C, Fan B, Chen Z: Physical and functional interactions between pathogen-induced Arabidopsis WRKY18, WRKY40, and WRKY60 transcription factors. Plant Cell. 2006, 18: 1310-1326. 10.1105/tpc.105.037523.
Chen HC, Lai Z, Shi J, Xiao Y, Chen Z, Xu X: Roles of arabidopsis WRKY18, WRKY40 and WRKY60 transcription factors in plant responses to abscisic acid and abiotic stress. BMC Plant Biol. 2010, 10: 281-296. 10.1186/1471-2229-10-281.
Yu D, Chen C, Chen Z: Evidence for an Important Role of WRKY DNA Binding Proteins in the Regulation of NPR1 Gene Expression. Plant Cell. 2001, 13: 1527-1539. 10.1105/tpc.13.7.1527.
Wang D, Amornsiripanitch N, Dong X: A Genomic approach to identify regulatory nodes in the transcriptional network of systemic acquired resistance in plants. Plos Pathog. 2006, 2: 1042-1050. 10.1371/journal.ppat.0020123.
Chen C, Chen Z: potentiation of developmentally regulated plant defense response by atwrky18, a pathogen-induced arabidopsis transcription factor. Plant Physiol. 2002, 129: 706-716. 10.1104/pp.001057.
Ryu H, Han M, Lee S, Cho JI, Sunggi HNR, Lee YH, Bhoo SH, Wang GL, Hahn TR, Jeon JS: A comprehensive expression analysis of the WRKY gene superfamily in rice plants during defense response. Plant Cell Rep. 2006, 25: 836-847. 10.1007/s00299-006-0138-1.
Zuo J, Chua N: Chemical-inducible systems for regulated expression of plant genes. Curr Opin Biotechnol. 2000, 11: 146-151. 10.1016/S0958-1669(00)00073-2.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215 (3): 403-410. 10.1016/S0022-2836(05)80360-2.
The GENSCAN Web Server at MIT. , [http://genes.mit.edu/GENSCAN.html]
FEGENESH. , [http://linux1.softberry.com/]
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997, 24: 4876-4882. 10.1093/nar/25.24.4876.
FancyGENE. , [http://bio.ieo.eu/fancygene/]
Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Drummond AJ, Rambaut A: BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007, 7: 214-10.1186/1471-2148-7-214.
Abascal F, Zardoya R, Posada D: ProtTest: selection of best-fit models of protein evolution. Bioinforma. 2005, 21: 2104-2105. 10.1093/bioinformatics/bti263.
Molecular Evolution, Phylogenetics and epidemiology - FigTree. , [http://tree.bio.ed.ac.uk/software/figtree/]
Matsumura H, Krueger DH, Kahl G, Terauchi R: SuperSAGE: A Modern Platform for Genome-Wide Quantitative Transcript Profiling. Curr Pharm Biotechnol. 2008, 9: 368-374. 10.2174/138920108785915157.
Robertson N, Oveisi-Fordorei M, Zuyderduyn SD, Varhol RJ, Fjell C, Marra M, Jones C, Siddiqui A: DiscoverySpace: an interactive data analysis application. Genome Biol. 2007, 8: R6-10.1186/gb-2007-8-1-r6.
LGE Genômica e Expressão. , [http://www.lge.ibi.unicamp.br/soja/]
Ribeiro AS, Moreira JUV, Pierozzi PHB, Rachid BF, Toledo JFF, Arias CAA, Soares RM, Godoy CV: Genetic control of Asian rust in soybean. Euphytica. 2007, 157: 15-25. 10.1007/s10681-007-9404-8.
Kerk NM, Ceserani T, Tausta SL, Sussex IM, Nelson TM, Ceserani T: Laser capture microdissection of cells from plant tissues. Plant Physiol. 2003, 132: 27-35. 10.1104/pp.102.018127.
Cai S, Lashbrook CC: Laser capture microdissection of plant cells from tape-transferred paraffin sections promotes recovery of structurally intact RNA for global gene profiling. Plant J. 2006, 48: 628-637. 10.1111/j.1365-313X.2006.02886.x.
Trapnell C, Pachter L, Salzberg SL: TopHat: discovering splice junctions with RNA-Seq. Bioinforma. 2009, 25: 1105-1111. 10.1093/bioinformatics/btp120.
Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K, Wang J: SOAP2: an improved ultrafast tool for short read alignment. Bioinforma. 2009, 25: 1966-1967. 10.1093/bioinformatics/btp336.
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5: 621-628. 10.1038/nmeth.1226.
Kim KS, Unfried JR, Hyten DL, Frederick RD, Hartman GL, Nelson RL, Song Q, Diers BW: Molecular mapping of soybean rust resistance in soybean accession PI 561356 and SNP haplotype analysis of the Rpp1 region in diverse germplasm. Theor Appl Genet. 2012, 125: 1339-1352. 10.1007/s00122-012-1932-5.
Libault M, Thibivilliers S, Bilgin DD, Radwan O, Benitez M, Clough SJ, Stacey G: Identification of four soybean reference genes for gene expression normalization. Plant Genome. 2008, 1: 44-54. 10.3835/plantgenome2008.02.0091.
Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)). Methods. 2001, 25: 402-408. 10.1006/meth.2001.1262.
Karimi M, Inze D, Depicker A: GATEWAY vectors for Agrobacterium-mediated plant transformation. Trends Plant Sci. 2002, 7: 193-195. 10.1016/S1360-1385(02)02251-3.
Droste A, Pasquali G, Bodanese-Zanettini MH: Transgenic fertile plants of soybean (Glycine max (L.) Merrill) obtained from bombarded embryogenic tissue. Euphytica. 2002, 127: 367-376. 10.1023/A:1020370913140.
Finer JJ, Vain P, Jones MW, McMullen MD: Development of the particle inflow gun for DNA delivery to plant cells. Plant Cell Rep. 1992, 11: 323-328. 10.1007/BF00233358.
Wiebke-Strohm B, Droste A, Pasquali G, Osório MB, Bucker-Neto L, Passaglia LMP, Bencke M, Homrich MS, Margis-Pinheiro M, Bodanese-Zanettini MH: Transgenic fertile soybean plants derived from somatic embryos transformed via the combined DNA-free particle bombardment and Agrobacterium system. Euphytica. 2011, 177: 343-354. 10.1007/s10681-010-0249-1.
Doyle JJ, Doyle JL: A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 1987, 19: 11-15.
We thank Dr. Elsa Mundstock and Gilberto P. Mesquita from "Núcleo de Assessoria Estatística” from Universidade Ferderal do Rio Grande do Sul for statistical support; to Dr. Cláudia Godoy for providing fungal isolates and Dr. Emerson Del Ponte, Dr. Cláudia Godoy, Dr. Juliano dos Santos, Larissa Bittecourt and Silvia Richter for their technical assistance. This work was supported by grants from the Conselho Nacional de Desenvolvimento Científico e Tecnológico and Consórcio Nacional do Genoma da Soja (CNPq-GENOSOJA) and BIOTECSUR (European Union/MERCOSUL).
The authors declare that they have no competing interest.
Conceived and designed the experiments: MB-M, BW-S, LBN, MM-P, MHB-Z, APC. Performed the experiments: MB-M, BW-S, FCM-G, MCCGdeC, RS, MBO. Vectors construction: MSH. Performed data analysis: MB-M, CC, BW-S, MCCGdeC, LBN, EM, GW, ACT-Z, MSH, RLMW. Wrote the paper: MB-M, BW-S, LBN. Revised the paper: MM-P, MHB-Z, RVA, ACT-Z. Supervised and coordinated the study: MHB-Z. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Glycine max WRKY transcription factors (Chromosome 4 to 20).(DOCX 62 KB)
Additional file 3: The tree was reconstructed using a Bayesian (BA) method. A total of 289 amino acid sequences from Glycine max, Arabidopsis thaliana and Populus trichocarpa and 65 sites corresponding to WRKY domain were included in the analysis. The posteriori probability values are labeled above the branches and only values higher than 70% are presented. The groups I, IIa, IIb, IIc, IId, IIe and III are indicated. *Differentially expressed genes in response to P. pachyrhizi infection. (PDF 782 KB)
Additional file 4: WRKY encoding-genes under P. pachyrhizi infection (Group IIb to III).(DOCX 30 KB)
Additional file 5: GmWRKY27 . GFP expression analyses in wild type (A) and hygromycin-resistant embryogenic tissues (B). GFP expression was detected under blue light using a fluorescence stereomicroscope Olympus®, equipped with a BP filter set containing a 488 nm excitation filter and a 505-530 nm emission filter. (C) Expression levels (RT-qPCR) of the GmWRKY27 in wild-type (WT) soybean plants and in histodifferentiated embryos of different transgenic soybean lines. Venc (BRSMG68 Vencedora) P2-1, IAS-5 P1-1, Conq (MGBR-46 Conquista) P1-1 lines were obtained from Biobalistic and IAS-5 P3-1 line from Biobalistic/Agrobacterium transformation experiments. F-Box protein and metalloprotease reference genes were used as internal controls to normalize the amount of mRNA present in each sample. Transcript levels of WRKY genes present in the wt were used to calibrate the transcript amounts in transgenic embryos. *Means are significantly different in the wt and transgenic lines (Student's t-test, p < 0.05). (PDF 140 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Bencke-Malato, M., Cabreira, C., Wiebke-Strohm, B. et al. Genome-wide annotation of the soybean WRKY family and functional characterization of genes involved in response to Phakopsora pachyrhiziinfection. BMC Plant Biol 14, 236 (2014). https://0-doi-org.brum.beds.ac.uk/10.1186/s12870-014-0236-0
- Glycine max
- Genetic transformation
- Fungus resistance
- Transcription factors
- Asian Soybean Rust
- Functional analysis