- Research article
- Open Access
The evolutionary history of the sucrose synthase gene family in higher plants
BMC Plant Biology volume 19, Article number: 566 (2019)
Sucrose synthase (SUS) is widely considered a key enzyme participating in sucrose metabolism in higher plants and regarded as a biochemical marker for sink strength in crops. However, despite significant progress in characterizing the physiological functions of the SUS gene family, knowledge of the trajectory of evolutionary processes and significance of the family in higher plants remains incomplete.
In this study, we identified over 100 SUS genes in 19 plant species and reconstructed their phylogenies, presenting a potential framework of SUS gene family evolution in higher plants. Three anciently diverged SUS gene subfamilies (SUS I, II and III) were distinguished based on their phylogenetic relationships and unique intron/exon structures in angiosperms, and they were found to have evolved independently in monocots and dicots. Each subfamily of SUS genes exhibited distinct expression patterns in a wide range of plants, implying that their functional differentiation occurred before the divergence of monocots and dicots. Furthermore, SUS III genes evolved under relaxed purifying selection in dicots and displayed narrowed expression profiles. In addition, for all three subfamilies of SUS genes, the GT-B domain was more conserved than the “regulatory” domain.
The present study reveals the evolution of the SUS gene family in higher plants and provides new insights into the evolutionary conservation and functional divergence of angiosperm SUS genes.
Sucrose is the main end product of photosynthesis in higher plants and is exported from source leaves to sink organs. Sucrose catabolism in plants is one of the largest metabolic fluxes in the world, and it plays critical roles in carbon resource allocation and sugar signalling initiation [1, 2]. Sucrose is hydrolysed either by invertase (INV) into glucose and fructose or by SUS, which catalyses the reversible conversion of sucrose and uridine diphosphate (UDP) to fructose and UDP-glucose [1,2,3]. There is compelling evidence for the role of SUS in facilitating the entry of carbon into the metabolism of nonphotosynthetic plant cells and in determining sink strength in crop species. For instance, the rugosus4 (rug4) mutation in pea reduces seed mass and starch content , and the shrunken1 (sh1) mutation in maize leads to a shrunken seed phenotype due to the disruption of endosperm cell wall integration . Antisense inhibition of specific isoforms of SUS genes reduces fruit setting and the sucrose unloading capacity of young fruit in tomato , decreases starch accumulation in potato tubers , affects the biosynthesis of cellulose and starch in carrot , and represses fibre cell initiation, elongation, and seed development in cotton . Overexpression of the potato Sus4 gene increases the levels of starch, adenosine diphosphate (ADP)-glucose and UDP-glucose and total yield in potato , increases the levels of starch and ADP-glucose in maize seed endosperm , and reduces seed abortion and enhances fiber production in cotton .
SUS is encoded by a small multigene family in the higher plants examined to date. Studies on the SUS genes of individual species have revealed that structural conservation and expressional and functional divergence followed gene family evolution. The Arabidopsis SUS gene family contains only six SUS genes with different but partially overlapping expression profiles , and their roles have been investigated through corresponding knockout mutants . AtSUS1 and AtSUS4 show significant induction under hypoxia, and double mutant of these two genes exhibits reduced growth rates in hydroponic culture . AtSUS5 and AtSUS6 are expressed specifically in the phloem and have a specific function in callose synthesis . Pea harbours an SUS gene family containing at least three divergent genes, namely, Sus1, Sus2 and Sus3, and these three genes show distinct patterns of expression in different organs and during organ development. Of these genes, Sus1 displays a constitutive expression pattern and is highly expressed in the developing seed, and a lack of Sus1 activity in rug4 mutant of pea cannot be compensated by Sus2 and Sus3 . In other plants, such as cotton, poplar, citrus and grape, the tissue-specific and development-dependent expression patterns of different SUS genes imply that each SUS gene may have evolved specialized physiological functions [17,18,19,20]. However, whether the divergence of plant SUS genes in expression and function occurred after the emergence of specific species, or at least to some extent in the common ancestor of angiosperms, is still unknown.
Despite compelling advances in determining the physiological functions and regulatory mechanisms of SUS genes, our knowledge of the evolutionary processes of the SUS gene family in higher plants remains incomplete. Molecular genetic research has provided substantial insights into the physiological function of individual proteins, while evolutionary analysis will shed light on the origin and expansion history of the gene family and further provide new insights into functional implications from an evolutionary perspective. For several years, our understanding of the evolution of the SUS gene family in plants has been based on surveys of individual angiosperm species [17, 20, 21]. Therefore, a comprehensive evolutionary analysis at a larger scale is necessary to achieve a better understanding of the SUS gene evolution in higher plants. In the current work, sixteen angiosperm species and three gymnosperm, fern ally and bryophyte species were chosen to show how the current SUS genes evolved and diverged from ancestral angiosperm SUS lineages. We investigated the classification, gene duplication, structural features, selection pressures and expression profiles of plant SUS genes. These results will provide a fundamental reference for understanding the evolutionary history of SUS genes and how evolutionary divergence contributes to the functional diversity of SUS genes.
Classification of SUS genes in angiosperms
A total of 96 SUS genes were identified in 16 angiosperm species including 10 dicot plants, 5 monocot plants and a basal angiosperm, Amborella trichopoda, using a hidden Markov model (HMM) and BLASTP searches (Fig. 1). Of all species surveyed, Glycine max possessed the greatest number of SUS genes (12), a number six-fold greater than that observed in A. trichopoda (2). Each of the remaining 14 species contained 5 to 8 SUS genes.
The 16 species surveyed in this study occupy important phylogenetic locations, as they include three major angiosperm lineages (monocots, asterids and rosids) and a basal angiosperm (Fig. 1). To trace the phylogenetic relationship of SUS genes in angiosperms, we constructed an unrooted phylogenetic tree of 96 SUS genes from these 16 species. The phylogenetic tree clearly classified the SUS gene family into three subfamilies named SUS I, II and III. Each subfamily further clustered into 3 groups: a monocot group, dicot group and basal angiosperm group, except for the SUS I subfamily, in which the basal angiosperm group was missing (Fig. 2). AtSUS5 and AtSUS6 contain a 3′ extension . In the present study, almost all proteins of SUS III subfamily genes exhibited C-terminal extension (Additional file 1: Figure S1), indicating that SUS III genes of angiosperms may have evolved from a common ancestor.
Intron/exon arrangement is often regarded as an important parameter in gene phylogenies [29, 30]. To confirm the classification of SUS genes, we analysed the predicted intron/exon structure of the 96 SUS genes between the start and stop codons (Additional file 2: Figure S2). Based on the intron/exon arrangement and the exon length (Additional file 6: Table S1) of the 96 SUS genes that have been investigated, we reconstructed the proposed ancient intron/exons structure of SUS I, II and III genes in angiosperms (Fig. 3). The ancient SUS I and SUS II genes had 15 exons, while the ancient SUS III gene had 17 exons as it contained a 3′ extension. The length of the third to the fourteenth exons of the three ancient SUS genes was highly conserved. Intron loss was a common phenomenon in the descendants of the ancient SUS genes in angiosperms, especially in the dicot group of SUS I and SUS III genes. For the dicot group of the SUS I genes, introns were lost between the 5th and 6th exons and between the 12th and 13th exons. The introns were also lost between the 12th and 13th exons in the dicot group of SUS III genes. Intron loss occurred in only a small number of SUS II genes (Additional file 2: Figure S2). In general, SUS I, II and III genes had different intron/exon arrangements. Taken together, our phylogenetic and intron/exon structure analyses of the 96 SUS genes in angiosperms clearly show that there are three ancestral subfamilies of SUS genes predating the divergence of monocots and dicots.
Evolutionary trajectory of SUS genes in higher plants
To trace the evolution of SUS genes in higher plants, we obtained 25 SUS homologous sequences from Physcomitrella patens (moss), Selaginella moellendorffii (fern), Picea abies (gymnosperm) and 3 angiosperm species (A. trichopoda, Arabidopsis thaliana and Oryza sativa). These sequences were then used to reconstruct a phylogenetic tree to infer the evolutionary trajectory of SUS genes with 7 SUS sequences from cyanobacteria and green algae as an outgroup. Two clades (clade A and clade B) of SUS genes were characterized in seed plants, both of which contained a gymnosperm branch and an angiosperm branch. SUS I and SUS II genes form the angiosperm branch of clade A, and SUS III genes form the angiosperm branch of clade B (Fig. 4). SUS genes of clade A are close to those of the bryophyte and pteridophyte branch in the phylogenetic tree (Fig. 4), and they also have similar intron/exon arrangements (Fig. 3; Additional file 3: Figure S3), indicating that clade A genes are more conserved than clade B genes. Previous research revealed an ancestral seed plant whole-genome duplication (WGD) event (ε) and an ancestral angiosperm WGD event (ζ) occurring shortly before the diversification of extant seed plants and extant angiosperms, respectively . According to the phylogenetic relationship shown in Fig. 4, SUS I and SUS II genes may derive from the ancestral angiosperm WGD event, and clade A and clade B SUS genes may derive from the ancestral seed plant WGD event. Proteins of clade B SUS genes contained a C-terminal extension similar to that of bryophyte and pteridophyte SUS proteins (Additional file 4: Figure S4). However, clade B SUS genes exhibited two more 3′-end exons, which were not found in clade A, the bryophyte nor the pteridophyte (Fig. 3; Additional file 3: Figure S3), indicating that clade B SUS genes are a novel type of SUS gene in seed plants.
Furthermore, we inferred the expansion patterns of SUS genes in angiosperms. Tandem duplication rarely occurs in SUS genes, while the expansion of SUS genes in angiosperms is mainly through WGD. In monocots, a WGD event (τ) has been inferred that occurred after their divergence from the eudicot clade , and a whole-genome triplication event (γ) is probably shared by all core eudicots [33,34,35]. A series of successful WGDs has been inferred in association with the Cretaceous-Paleogene (K-Pg) boundary [36, 37]. There are also more recent WGDs that seem to have occurred independently in many different plant lineages (Fig. 1). Taking the SUS genes in soybean as an example, the ‘gamma’ event, the papilionoid lineage WGD event, and the soybean lineage-specific WGD event increased the number of SUS genes by 2, 2 and 5, respectively (Fig. 5; Additional file 7: Table S2). Because of the high retention ratio of duplicated SUS genes in the soybean-lineage-specific WGD event, the number of SUS genes in soybeans is almost twice that in other species (Fig. 1). Interestingly, no expansion was found in the SUS II genes in the five monocot plants we investigated; they each contained only one SUS II gene, which was the same as the number found in A. trichopoda.
In summary, there are two monophyletic clades of SUS genes in seed plants, and clade B SUS genes may be a novel clade as they contain two more 3′-end exons. In angiosperms, there are three subfamilies of SUS genes, and the expansion of SUS genes in angiosperms is mainly through WGD, although the SUS II genes in grass have not expanded.
SUS III genes evolved under relaxed purifying selection in dicots
There is increasing evidence that different subfamilies from the same gene family may experience different purification selection pressures, form relatively fixed expression patterns and have different functions [38, 39]. To determine whether SUS genes are under different evolutionary constraints in angiosperms, the overall ratio of nonsynonymous substitutions per nonsynonymous site to synonymous substitutions per synonymous site (ω) values for the three subfamilies of SUS genes were calculated (Fig. 6; Additional file 8: Table S3). The overall ω value of SUS II genes (0.0630) was significantly lower than that of SUS I genes (0.0842) and SUS III genes (0.0962) in monocots, which indicated that the SUS II genes in monocots experienced strong purifying selection, probably because each of the five monocots we surveyed had only one SUS II gene (Fig. 1). In dicots, the overall ω value of SUS III genes (0.0990) was significantly higher than that of SUS I genes (0.0865) and SUS II (0.0889) genes, suggesting that SUS III genes were subjected to relaxed purifying selection and may have acquired new functions.
Expression profiles of SUS genes
Furthermore, we investigated the expression profiles of three subfamilies of SUS genes in A. thaliana, G. max, Solanum lycopersicum, Solanum tuberosum and O. sativa from publicly available RNA-seq data (Fig. 7). In Arabidopsis, AtSUS5 and AtSUS6 from the SUS III subfamily were expressed only in specific tissues, while SUS I (AtSUS1 and AtSUS4) and SUS II (AtSUS2 and AtSUS3) genes showed broader expression than SUS III genes (Fig. 7a). A similar scenario was also found in four other plant species (tomato, potato, soybean and rice; Fig. 7b-c). These expression data revealed that SUS III genes exhibited more tissue-specific expression patterns than SUS I and SUS II genes. Furthermore, SUS I genes from the five species displayed constitutive expression in diverse vegetative and reproductive organs (Fig. 7).
The distinct expression patterns of SUS genes in dicots (Fig. 7) agree with the different selection pressures they experienced (Fig. 6). SUS III genes evolved under relaxed purifying selection in dicots and showed tissue-specific expression patterns, while SUS I genes, which experienced greater evolutionary constraints, showed broader expression patterns. Interestingly, although SUS III genes did not evolve under relaxed evolutionary constraints, in contrast to SUS I genes in monocots, SUS III genes from rice still exhibited tissue-specific expression patterns (Fig. 7e). These findings reveal the conserved functions of SUS I genes in maintaining cellular sucrose metabolism, and SUS III genes may acquire new functions in the evolution of angiosperms. A similar pattern was observed for another gene involved in sucrose metabolism. The neutral/alkaline invertase gene showed a broad or constitutive expression pattern and experienced greater evolutionary constraints than the acid invertase genes, which exhibited more tissue-specific expression patterns .
CTD, EPBD and GT-B domains were subject to different selection pressures
The plant SUS polypeptide chain consists of a cellular targeting domain (CTD), an early nodulin 40 (ENOD40) peptide-binding domain (EPBD), a typical GT-B domain, and a C-terminal (Fig. 8a) [41, 42]. The N-terminal “regulatory” domain, including the CTD and EPBD, is involved in cellular targeting, and the GT-B domain is involved in the glycosyl transfer reaction. The general kinetic properties of all six SUS genes in Arabidopsis, which rely on the GT-B domain, are closely related to each other . Depending on the metabolic environment, SUS alters its cellular location from the cytosol to sites of cellulose, callose, and starch biosynthesis by its interactions with various organelle membranes [43,44,45] and cytoskeletal actin  through its “regulatory” domain . The CTD, EPBD and GT-B domain of SUS exhibit different functions, and these domains may also experience different selection pressures. We calculated the ω values of three subfamilies of SUS genes in monocots and dicots. To our surprise, the ω values of the GT-B domains of all three subfamilies of SUS genes are lower than those of the “regulatory” domains (Fig. 8b-c; Additional file 9: Table S4). The active sites in the GT-B domains are almost identical, both within and among the three subfamilies (Additional file 5: Figure S5). Therefore, the GT-B domains are more conserved and are subjected to more evolutionary constraints than the “regulatory” domains.
Sucrose is found in a wide range of organisms including cyanobacteria, unicellular algae and especially higher plants. It is usually synthesized in cyanobacteria under salt or osmotic stress and is believed to help maintain osmotic balance [47, 48]. However, in most higher plants, sucrose is the main end product of photosynthesis, and sucrose metabolism plays pivotal roles in the allocation of carbon resources and in the initiation of sugar signalling [1, 2]. Sucrose is cleaved either by SUS into UDP-glucose and fructose or by INV into glucose and fructose . The relationship between evolutionary steps and the functional implications of three types of INV have been elucidated by Wan et al. . Given that the evolutionary history of SUS genes in higher plants remains fragmented and elusive, a comprehensive understanding of their evolutionary trajectory, structural features, expression profiles, and functional significance will be valuable for improving crop yield by optimizing carbon resource allocation.
Origin, evolution and classification of plant SUS genes
The SUS gene might have originated in proteobacteria or a common ancestor of proteobacteria and cyanobacteria, and plants may have inherited it from cyanobacteria . Benefiting from the whole-genome sequencing of various plant species, a large number of SUS genes have been identified through comparative genome approaches, which can be used to investigate the origin, evolution and classification of SUS genes in plants. In our present study, the phylogenetic analysis showed that SUS genes from plants formed a monophyletic group, suggesting that all plant SUS genes might have originated from a common ancestor  (Fig. 4). WGD, or polyploidy, which is often followed by substantial gene loss and diploidization, is a common phenomenon in plants . The retained duplicated genes not only provide the genetic material necessary for biological innovation but also give rise to the diversity of plant homologous genes. SUS genes from seed plants formed two monophyletic clades (clade A and clade B) (Fig. 4), probably because the ancestor of seed plants experienced a WGD event , and the duplicated SUS gene copy was retained. Moreover, two subfamilies of angiosperm SUS genes in clade A (SUS I and SUS II) (Fig. 4) might stem from the ancestral angiosperm WGD event . Plant SUS genes have historically been divided into three major subfamilies based on their phylogenetic relationship (Sus1, Sus A and New Group/NG) [13, 50]. The phylogenetic analysis of angiosperm SUS genes in our research is consistent with this classification, and we renamed them SUS I, II and III, respectively (Fig. 2). Furthermore, clade A genes are closer to the original type of plant SUS gene than are clade B genes, as the former genes clustered with the SUS genes of the bryophyte and pteridophyte in the phylogenetic tree (Fig. 4) and have ancient intron/exon structures similar to those of the bryophyte and pteridophyte SUS genes (Fig. 3; Additional file 3: Figure S3). In contrast, clade B genes may be a new type that appeared in seed plants.
Each subfamily of angiosperm SUS genes consisted of at least two independent groups, i.e., a monocot group and a dicot group (Fig. 2), which was consistent with the classification of 55 SUS genes in angiosperms . In other studies involving the classification of angiosperm SUS genes, SUS II genes were also composed of the genes from monocots and dicots; however, genes from monocots and dicots did not group independently [17, 19], probably because the number of monocot SUS genes used was relatively small. Our results support the view that the SUS genes from monocots and dicots in each subfamily evolved independently. After the split of monocots and dicots, their ancestors underwent specific WGD events [32, 33, 35, 52]. Many species emerged during subsequent evolution and experienced lineage-specific WGD events . WGD events and subsequent retention and loss of specific SUS genes led to different evolutionary trajectories of SUS genes in different species [17, 18, 20].
Intron/exon structures, to a certain extent, allow us to predict the possible origin and relationships of SUS genes [13, 50]. In our study, the first 14 exons of the three ancient SUS genes of angiosperms were highly conserved (Fig. 3), further suggesting that all three subfamilies of SUS genes may be derived from a common ancestor . The most obvious difference in intron/exon structures among the three SUS subfamilies is that most SUS III genes have two more exons at the 3′ end (Additional file 2: Figure S2), which are not found in bryophyte and pteridophyte SUS genes (Additional file 3: Figure S3). However, the function of the 3′ extension in SUS III genes remains unknown. Each subfamily of SUS genes had different degrees of intron loss, and the intron loss between the 12th and 13th exons was conserved in dicots of both SUS I and III genes (Additional file 2: Figure S2) [13, 50]. The evolutionary and functional significance of intron loss in SUS genes requires further research.
SUS I genes may play critical roles in sucrose metabolism in an O2-deficient environment
Sucrose metabolism is vital to multicellular plants and is degraded by either SUS or INV; however, the precise roles of these enzymes in specific plants remain largely unknown [2, 15]. Both SUS and INV appear as multiple, distinct isoforms. The cytoplasmic INV (CIN) genes are ancient and may play pivotal roles in maintaining cytosolic sugar homeostasis and cellular functions. The cell wall INV (CWIN) and vacuolar INV (VIN) genes are subject to relaxed purifying selection pressure, and CWIN genes have coevolved with vascular plants, probably as a functional component of phloem unloading . SUS genes can be clearly divided into three subfamilies in angiosperms (Fig. 2) [13, 20]; however, the precise functions of each subfamily have not been elucidated. We speculate that each subfamily of SUS genes may have different functions.
Conversion of sucrose to hexose phosphates via SUS requires only half the adenosine triphosphate (ATP) needed for conversion via INV, and the SUS route is thought to be more effective than the INV route in an O2-deficient environment, where ATP synthesis may be limited [1, 14, 15]. The induction of some SUS genes by hypoxia or anoxia is a widespread phenomenon in both monocot and dicot species. The expression of Sus1 and Sh1 in maize, Ss1 in wheat, and Susy∗Dc1 in carrot is induced or enhanced under hypoxic or anoxic conditions [53,54,55], and all these genes originate from the SUS I gene subfamily. Likewise, in A. thaliana, transcript levels increase for AtSUS1 and AtSUS4 but not other SUS genes under hypoxia (Additional file 10: Table S5) [13, 56], and the double mutant of these two genes shows marked growth retardation under hypoxia . Antisense suppression of a cucumber SUS I gene (CsSUS3) reduces hypoxic stress tolerance . Consistently, some SUS genes have long been considered a biochemical marker for sink strength, especially in metabolically highly active or bulky organs where the endogenous oxygen level may be low. For instance, mutation of maize lacking either Sus1 or Sh1 leads to reduced starch content , whereas antisense inhibition of specific SUS genes drastically reduces starch accumulation in potato tubers , and represses fibre elongation and seed development in cotton . Furthermore, we investigated the SUS genes associated with sink strength and found that almost all of these SUS genes derived from SUS I (Table 1). Accordingly, we speculate that SUS I genes but not SUS II and III genes are responsible for sucrose conversion in an O2-deficient environment. Consistent with this view, haplotype association revealed that two SUS I genes (TaSus1 and TaSus2) from wheat were associated with thousand kernel weight, which mainly depends on the rate and amount of starch synthesis [58, 59]. The SUS I genes showed constitutive expression in diverse vegetative and reproductive organs (Fig. 7), and their roles in sucrose metabolism could be replaced by INV genes under normal oxygen levels . However, in the case of insufficient oxygen content, SUS I genes are irreplaceable.
SUS II and SUS I genes have similar ancient intron/exon structures and may stem from the ancestral angiosperm WGD event. These two subfamilies SUS genes showed different expression profiles (Fig. 7) , indicating that their functions may have undergone a certain degree of differentiation. For example, two SUS II genes from Arabidopsis, namely, AtSUS2 and AtSUS3, are not induced in response to O2 deficiency, and the double mutant of these two genes is not obviously different from the wild-type (WT) control, although these two genes are strongly expressed in seeds [13, 14]. Jiang et al.  reported a cotton SUS II gene (Table 1), GhSusA1, which is closely associated with productivity as a key regulator of sink strength, indicating that some SUS II genes may have functional overlap with the SUS I gene in specific plants.
SUS III genes exhibit a narrow expression profile, although their functions remain unknown
SUS III genes have intron/exon structures differing from those of SUS I and SUS II genes (Additional file 2: Figure S2), exhibit a narrow expression profile (Fig. 7), and are subject to relaxed purifying selection pressure in dicots (Fig. 6), suggesting that SUS III genes may have functions different from those of SUS I and SUS II genes. Wan et al.  reported that CWIN genes, which emerged in higher plants and show tissue-specific expression patterns, likely coevolved with the vascular development of higher plants. Two Arabidopsis SUS III genes, namely, AtSUS5 and AtSUS6, are expressed only in specific tissues and organs (Fig. 6), and the proteins encoded by these two genes are present specifically in the phloem . A double mutant of these two genes shows a thinner callose layer lining the pores of sieve plates than did WT plants. These two SUS III genes are considered to be involved in callose formation in the sieve plate . Thus, we propose that SUS III genes may also be involved in the vascular development of higher plants.
Evolutionary conservation and divergence of plant SUS genes
As discussed above, all plant SUS genes may have evolved from a common ancestor. In the plant species examined to date, SUS is encoded by a small multigene family. Comparative screening of the intron/exon structures of three subfamilies of SUS genes indeed revealed that the number and position of introns are highly conserved in angiosperms, although some introns were lost in specific SUS genes (Fig. 3; Additional file 2: Figure S2). Furthermore, all 16 active sites in the GT-B domain of AtSUS1 are almost identical among angiosperm SUS genes  (Additional file 5: Figure S5), and the GT-B domain has undergone strong purifying selection (Fig. 8). In addition, the isoforms of SUS from different subfamilies in Arabidopsis have similar kinetic properties . All these results suggest that the SUS gene is structurally and functionally conserved in plants. Therefore, we speculate that different SUS genes may fulfil similar functions in different cell types or organelles at different developmental stages or under different stress conditions. Our analysis showed that three subfamilies of angiosperm SUS genes displayed distinct expression profiles, and these expression profiles may have been formed before the divergence of monocots and dicots. In general, homologous or duplicated SUS genes derived from WGD events within each subfamily inherited the expression patterns of their ancestors (Fig. 7). Moreover, according to the metabolic environment, SUS changes its cellular location to take part in the biosynthesis of cellulose, callose, and starch through its interactions with various organelle membranes and cytoskeletal actin . The specificity of spatiotemporal expression and the variability of protein subcellular localization contribute to the functional diversity of SUS genes. Identifying the function of individual SUS genes is challenging not only because of the functional redundancy among duplicated SUS genes , but also because the VIN gene can partially functionally replace the SUS gene .
The angiosperm SUS gene family can be divided into three subfamilies (SUS I, II and III) based on their phylogenetic relationships and unique intron/exon structures, and they were found to have evolved independently in monocots and dicots. Each subfamily of SUS genes exhibited distinct expression patterns in a wide range of plants, and SUS III genes evolved under relaxed purifying selection in dicots and displayed narrowed expression profiles. This work should provide a foundation for understanding the evolutionary history of SUS genes and how evolutionary divergence contributes to the functional diversity of SUS genes.
Materials and methods
The genomic sequences, annotations and gene models of A. thaliana, Capsella rubella, Brassica rapa, G. max, Pinguicula vulgaris, Medicago truncatula, Vitis vinifera, S. lycopersicum, S. tuberosum, Daucus carota, Setaria italica, Zea mays, Sorghum bicolor, Brachypodium distachyon, O. sativa, A. trichopoda, S. moellendorffii, P. patens, and Coccomyxa subellipsoidea were collected from Phytozome (https://phytozome.jgi.doe.gov/pz/portal.html). Data for P. abies were downloaded from ConGenIE (http://congenie.org/). The tetrameric structure of AtSUS1 from A. thaliana was obtained from the Research Collaboratory for Structural Bioinformatics (RCSB) Protein Data Bank (PDB code 3S27).
Identification of SUS genes
We combined an HMM and BLASTP searches to identify putative SUS genes in the 19 species. First, the HMM profiles of the SUS domain (PF00862) and glycosyl transferase group 1 domain (PF00534) were obtained from the Pfam website (http://pfam.xfam.org/), and these two HMM profiles were then employed as queries to identify all possible SUS genes using HMMER (V3.0) software. Second, the amino acid sequences of the six AtSUS genes  and seven OsSUS genes  were used to run a BLASTP search against all protein sequences in each species, with the threshold expectation value set to 1E-10. All hits obtained from HMM and BLASTP searches were merged together, and the redundant hits were removed. Finally, all candidate sequences were further subjected to online Pfam analysis (http://pfam.xfam.org/) to further confirm that they had both a SUS domain and glycosyl transferase group 1 domain. The protein sequences lacking the SUS domain or the glycosyl transferase group 1 domain were removed.
Phylogenetic and gene structural analysis
Amino acid sequences were aligned using ClustalW, and gaps and poorly aligned sections were manually removed. Phylogenetic tree reconstruction was performed with the maximum likelihood (ML) approach using the aligned amino acid sequences in MEGA v7.0. The parameters were as follows: model, WAG; bootstraps, 1000 replicates; and gaps/missing data, partial deletion. The structure of the SUS genes was parsed from general feature format (GFF) files, and diagrams of the intron/exon structures were drawn using the online program Plant Intron Exon Comparison and Evolution (PIECE) (https://wheat.pw.usda.gov/piece/GSDraw.php).
Estimation of K a/K S ratios
The codon sequences of homologous gene pairs were aligned using ClustalW based on the amino acid sequences. The ratio of nonsynonymous substitutions per nonsynonymous site (Ka) to synonymous substitutions per synonymous site (KS) (ω value) was calculated using DnaSP version 5. Saturation effects were avoided by discarding the gene pairs for which KS > 2. The ω value is commonly considered a measure of selection at the protein level, with values of ω > 1, =1 and < 1 indicating positive selection, neutral evolution and negative or purifying selection, respectively.
Availability of data and materials
The datasets supporting the conclusions of this manuscript are included within the article and its additional files.
Cellular targeting domain
Cell wall INV
Early nodulin 40
ENOD40 peptide-binding domain
Fragments per kilobase of exon model per million reads mapped
Hidden Markov model
Million years ago
Protein Data Bank
Whelan And Goldman
Koch K. Sucrose metabolism: regulatory mechanisms and pivotal roles in sugar sensing and plant development. Curr Opin Plant Biol. 2004;7:235–46.
Ruan YL. Sucrose metabolism: gateway to diverse carbon use and sugar signaling. Annu Rev Plant Biol. 2014;65:33–67.
Kleczkowski LA, Kunz S, Wilczynska M. Mechanisms of UDP-glucose synthesis in plants. Crit Rev Plant Sci. 2010;29:191–203.
Craig J, Barratt P, Tatge H, Déjardin A, Handley L, Gardner CD, Barber L, Wang T, Hedley C, Martin C, et al. Mutations at the rug4 locus alter the carbon and nitrogen metabolism of pea plants through an effect on sucrose synthase. Plant J. 1999;17:353–62.
Chourey PS, Taliercio EW, Carlson SJ, Ruan YL. Genetic evidence that the two isozymes of sucrose synthase present in developing maize endosperm are critical, one for cell wall integrity and the other for starch biosynthesis. Mol Gen Genet. 1998;259:88–96.
D'Aoust MA, Yelle S, Nguyen-Quoc B. Antisense inhibition of tomato fruit sucrose synthase decreases fruit setting and the sucrose unloading capacity of young fruit. Plant Cell. 1999;11:2407–18.
Zrenner R, Salanoubat M, Willmitzer L, Sonnewald U. Evidence of the crucial role of sucrose synthase for sink strength using transgenic potato plants (Solanum tuberosum L.). Plant J. 1995;7:97–107.
Tang GQ, Sturm A. Antisense repression of sucrose synthase in carrot (Daucus carota L.) affects growth rather than sucrose partitioning. Plant Mol Biol. 1999;41:465–79.
Ruan YL, Llewellyn DJ, Furbank RT. Suppression of sucrose synthase gene expression represses cotton fiber cell initiation, elongation, and seed development. Plant Cell. 2003;15:952–64.
Baroja-Fernández E, Muñoz FJ, Montero M, Etxeberria E, Sesma MT, Ovecka M, Bahaji A, Ezquer I, Li J, Prat S, et al. Enhancing sucrose synthase activity in transgenic potato (Solanum tuberosum L.) tubers results in increased levels of starch, ADPglucose and UDPglucose and total yield. Plant Cell Physiol. 2009;50:1651–62.
Li J, Baroja-Fernández E, Bahaji A, Muñoz FJ, Ovecka M, Montero M, Sesma MT, Alonso-Casajús N, Almagro G, Sánchez-López AM, et al. Enhancing sucrose synthase activity results in increased levels of starch and ADP-glucose in maize (Zea mays L.) seed endosperms. Plant Cell Physiol. 2013;54:282–94.
Xu SM, Brill E, Llewellyn DJ, Furbank RT, Ruan YL. Overexpression of a potato sucrose synthase gene in cotton accelerates leaf expansion, reduces seed abortion, and enhances fiber production. Mol Plant. 2012;5:430–41.
Baud S, Vaultier MN, Rochat C. Structure and expression profile of the sucrose synthase multigene family in Arabidopsis. J Exp Bot. 2004;55:397–409.
Bieniawska Z, Paul Barratt DH, Garlick AP, Thole V, Kruger NJ, Martin C, Zrenner R, Smith AM. Analysis of the sucrose synthase gene family in Arabidopsis. Plant J. 2007;49:810–28.
Barratt DH, Derbyshire P, Findlay K, Pike M, Wellner N, Lunn J, Feil R, Simpson C, Maule AJ, Smith AM. Normal growth of Arabidopsis requires cytosolic invertase but not sucrose synthase. Proc Natl Acad Sci U S A. 2009;106:13124–9.
Barratt DH, Barber L, Kruger NJ, Smith AM, Wang TL, Martin C. Multiple, distinct isoforms of sucrose synthase in pea. Plant Physiol. 2001;127:655–64.
Chen A, He S, Li F, Li Z, Ding M, Liu Q, Rong J. Analyses of the sucrose synthase gene family in cotton: structure, phylogeny and expression patterns. BMC Plant Biol. 2012;12:85.
Islam MZ, Hu XM, Jin LF, Liu YZ, Peng SA. Genome-wide identification and expression profile analysis of citrus sucrose synthase genes: investigation of possible roles in the regulation of sugar accumulation. PLoS One. 2014;9:e113623.
An X, Chen Z, Wang J, Ye M, Ji L, Wang J, Liao W, Ma H. Identification and characterization of the Populus sucrose synthase gene family. Gene. 2014;539:58–67.
Zhu X, Wang M, Li X, Jiu S, Wang C, Fang J. Genome-wide analysis of the sucrose synthase gene family in grape (Vitis vinifera): structure, evolution, and expression profiles. Genes (Basel). 2017;8:111.
Li F, Hao C, Yan L, Wu B, Qin X, Lai J, Song Y. Gene structure, phylogeny and expression profile of the sucrose synthase gene family in cacao (Theobroma cacao L.). J Genet. 2015;94:461–72.
Angiosperm Phylogeny Group. An update of the angiosperm phylogeny group classification for the orders and families of flowering plants: APG IV. Bot J Linn Soc. 2016;181:1–20.
Iorizzo M, Ellison S, Senalik D, Zeng P, Satapoomin P, Huang J, Bowman M, Iovene M, Sanseverino W, Cavagnaro P, et al. A high-quality carrot genome assembly provides new insights into carotenoid accumulation and asterid genome evolution. Nat Genet. 2016;48:657–66.
Potato Genome Sequencing Consortium. Genome sequence and analysis of the tuber crop potato. Nat. 2011;475:189–95.
Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, Hyten DL, Song Q, Thelen JJ, Cheng J, et al. Genome sequence of the palaeopolyploid soybean. Nat. 2010;463:178–83.
Tomato Genome Consortium. The tomato genome sequence provides insights into fleshy fruit evolution. Nat. 2012;485:635–41.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33:1870–4.
Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001;18:691–9.
Sánchez D, Ganfornina MD, Gutiérrez G, Marin A. Exon-intron structure and evolution of the Lipocalin gene family. Mol Biol Evol. 2003;20:775–83.
Shao ZQ, Xue JY, Wu P, Zhang YM, Wu Y, Hang YY, Wang B, Chen JQ. Large-scale analyses of angiosperm nucleotide-binding site-leucine-rich repeat genes reveal three anciently diverged classes with distinct evolutionary patterns. Plant Physiol. 2016;170:2095–109.
Jiao Y, Wickett NJ, Ayyampalayam S, Chanderbali AS, Landherr L, Ralph PE, Tomsho LP, Hu Y, Liang H, Soltis PS, et al. Ancestral polyploidy in seed plants and angiosperms. Nat. 2011;473:97–100.
Jiao Y, Li J, Tang H, Paterson AH. Integrated syntenic and phylogenomic analyses reveal an ancient genome duplication in monocots. Plant Cell. 2014;26:2792–802.
Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nat. 2007;449:463–7.
Tang H, Wang X, Bowers JE, Ming R, Alam M, Paterson AH. Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. Genome Res. 2008;18:1944–54.
Jiao Y, Leebens-Mack J, Ayyampalayam S, Bowers JE, McKain MR, McNeal J, Rolf M, Ruzicka DR, Wafula E, Wickett NJ, et al. A genome triplication associated with early diversification of the core eudicots. Genome Biol. 2012;13:R3.
Vanneste K, Baele G, Maere S, Van de Peer Y. Analysis of 41 plant genomes supports a wave of successful genome duplications in association with the cretaceous-Paleogene boundary. Genome Res. 2014;24:1334–47.
Lohaus R, Van de Peer Y. Of dups and dinos: evolution at the K/Pg boundary. Curr Opin Plant Biol. 2016;30:62–9.
Wan H, Wu L, Yang Y, Zhou G, Ruan YL. Evolution of sucrose metabolism: the dichotomy of invertases and beyond. Trends Plant Sci. 2018;23:163–77.
Xu X, Feng Y, Fang S, Xu J, Wang X, Guo W. Genome-wide characterization of the beta-1,3-glucanase gene family in Gossypium by comparative analysis. Sci Rep. 2016;6:29044.
Petryszak R, Keays M, Tang YA, Fonseca NA, Barrera E, Burdett T, Füllgrabe A, Fuentes AM, Jupp S, Koskinen S, et al. Expression atlas update--an integrated database of gene and protein expression in humans, animals and plants. Nucleic Acids Res. 2016;44:D746–52.
Zheng Y, Anderson S, Zhang Y, Garavito RM. The structure of sucrose synthase-1 from Arabidopsis thaliana and its functional implications. J Biol Chem. 2011;286:36108–18.
Schmolzer K, Gutmann A, Diricks M, Desmet T, Nidetzky B. Sucrose synthase: a unique glycosyltransferase for biocatalytic glycosylation process development. Biotechnol Adv. 2016;34:88–111.
Winter H, Huber SC. Regulation of sucrose metabolism in higher plants: localization and regulation of activity of key enzymes. Crit Rev Biochem Mol Biol. 2000;35:253–89.
Etxeberria E, Gonzalez P. Evidence for a tonoplast-associated form of sucrose synthase and its potential involvement in sucrose mobilization from the vacuole. J Exp Bot. 2003;54:1407–14.
Subbaiah CC, Palaniappan A, Duncan K, Rhoads DM, Huber SC, Sachs MM. Mitochondrial localization and putative signaling function of sucrose synthase in maize. J Biol Chem. 2006;281:15625–35.
Duncan KA, Huber SC. Sucrose synthase oligomerization and F-actin association are regulated by sucrose concentration and phosphorylation. Plant Cell Physiol. 2007;48:1612–23.
Hagemann M, Marin K. Salt-induced sucrose accumulation is mediated by sucrose-phosphate-synthase in cyanobacteria. J Plant Physiol. 1999;155:424–30.
Lunn JE. Evolution of sucrose synthesis. Plant Physiol. 2002;128:1490–500.
Braun DM, Wang L, Ruan YL. Understanding and manipulating sucrose phloem loading, unloading, metabolism, and signalling to enhance crop yield and food security. J Exp Bot. 2014;65:1713–35.
Komatsu A, Moriguchi T, Koyama K, Omura M, Akihama T. Analysis of sucrose synthase genes in citrus suggests different roles and phylogenetic relationships. J Exp Bot. 2002;53:61–71.
Zhang D, Xu B, Yang X, Zhang Z, Li B. The sucrose synthase gene family in Populus: structure, expression, and evolution. Tree Genet Genomes. 2010;7:443–56.
Tang H, Bowers JE, Wang X, Paterson AH. Angiosperm genome comparisons reveal early polyploidy in the monocot lineage. Proc Natl Acad Sci U S A. 2010;107:472–7.
Marana C, Garcia-Olmedo F, Carbonero P. Differential expression of two types of sucrose synthase-encoding genes in wheat in response to anaerobiosis, cold shock and light. Gene. 1990;88:167–72.
Zeng Y, Wu Y, Avigne WT, Koch KE. Differential regulation of sugar-sensitive sucrose synthases by hypoxia and anoxia indicate complementary transcriptional and posttranscriptional responses. Plant Physiol. 1998;116:1573–83.
Sturm A, Lienhard S, Schatt S, Hardegger M. Tissue-specific expression of two genes for sucrose synthase in carrot (Daucus carota L.). Plant Mol Biol. 1999;39:349–60.
Klok EJ, Wilson IW, Wilson D, Chapman SC, Ewing RM, Somerville SC, Peacock WJ, Dolferus R, Dennis ES. Expression profile analysis of the low-oxygen response in Arabidopsis root cultures. Plant Cell. 2002;14:2481–94.
Wang H, Sui X, Guo J, Wang Z, Cheng J, Ma S, Li X, Zhang Z. Antisense suppression of cucumber (Cucumis sativus L.) sucrose synthase 3 (CsSUS3) reduces hypoxic stress tolerance. Plant Cell Environ. 2014;37:795–810.
Jiang Q, Hou J, Hao C, Wang L, Ge H, Dong Y, Zhang X. The wheat (T. aestivum) sucrose synthase 2 gene (TaSus2) active in endosperm development is associated with yield traits. Funct Integr Genomics. 2011;11:49–61.
Hou J, Jiang Q, Hao C, Wang Y, Zhang H, Zhang X. Global selection on sucrose synthase haplotypes during a century of wheat breeding. Plant Physiol. 2014;164:1918–29.
Jiang Y, Guo W, Zhu H, Ruan YL, Zhang T. Overexpression of GhSusA1 increases plant biomass and improves cotton fiber yield and quality. Plant Biotechnol J. 2012;10:301–12.
Cho J-I, Kim H-B, Kim C-Y, Hahn T-R, Jeon J-S. Identification and characterization of the duplicate Rice sucrose synthase genes OsSUS5 and OsSUS7 which are associated with the plasma membrane. Mol Cells. 2011;31:553–61.
We are grateful to Yongxia Zhang, Qingquan Liu, Yinjie Wang and Weilin Wang for the precious assistance and comments in improving our research.
This work was supported by the National Natural Science Foundation of China (31701497), and the Jiangsu Key Laboratory for the Research and Utilization of Plant Resources (JSPKLB201801 and JSPKLB201832) (China). The funders had no role in study design, data collection, analysis and interpretation, or preparation of the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. C-terminal amino acid alignment of SUS genes.
Figure S2. Intron/exon structural organization of angiosperms SUS genes. Blue boxes denote exons within coding regions, and the gray lines connecting them represent introns. Boxes of other colors represent fused exons, and the numbers above them indicate which exons are fused. Red boxes indicate the splitting of the exons. Due to the complexity of the 3′ end of the SUS III subfamily genes, we do not show the fusion of their exons.
Figure S3. Intron/exon structural organization of SUS genes from P. patens and S. moellendorffii. Blue boxes denote exons within coding regions, and the gray lines connecting them represent introns. Red numbers on the upper left of the gray lines represent the intron phase of corresponding introns. Boxes of other colors represent fused exons, and the numbers above them indicate which exons are fused.
Figure S4. C-terminal amino acid alignment of SUS genes from A. thaliana, O. sativa, P. patens and S. moellendorffii.
Figure S5. Amino acid alignment of GT-B domain of SUS genes. The 16 active sites (His-287, Gly-302, Gly-303, Gln-304, Arg-382, His-438, Met-578, Arg-580, Gln-648, Asn-654, Glu-675, Phe-677, Gly-678, Leu-679, Thr-680, Glu-683) identified in the GT-B domain of AtSUS1 were colored in orange.
Table S1. Exon length (bp) statistics of angiosperms SUS genes.
Table S2. Ks values of SUS gene pairs within subfamily of G. max.
Table S3. Estimates of Ka/Ks (ω) values of SUS genes.
Table S4. Estimates of Ka/Ks (ω) values of three domains of SUS genes.
Table S5. Expression patterns of Arabidopsis SUS genes under hypoxia. The expression data of six SUS genes in Arabidopsis comes from Gene Expression Omnibus (GEO) DataSets (GSE119327). We analyzed the expression of six SUS genes in Arabidopsis under hypoxia and found that only AtSUS1 and AtSUS4 were induced by hypoxia. adj. P.Val: P-value after adjustment for multiple testing. logFC: Log2-fold change between two experimental conditions.
About this article
Cite this article
Xu, X., Yang, Y., Liu, C. et al. The evolutionary history of the sucrose synthase gene family in higher plants. BMC Plant Biol 19, 566 (2019). https://0-doi-org.brum.beds.ac.uk/10.1186/s12870-019-2181-4
- Sucrose synthase
- Gene family
- Expression pattern