- Research article
- Open Access
Identification and structural characterization of FYVE domain-containing proteins of Arabidopsis thaliana
BMC Plant Biology volume 10, Article number: 157 (2010)
FYVE domains have emerged as membrane-targeting domains highly specific for phosphatidylinositol 3-phosphate (PtdIns(3)P). They are predominantly found in proteins involved in various trafficking pathways. Although FYVE domains may function as individual modules, dimers or in partnership with other proteins, structurally, all FYVE domains share a fold comprising two small characteristic double-stranded β-sheets, and a C-terminal α-helix, which houses eight conserved Zn2+ ion-binding cysteines. To date, the structural, biochemical, and biophysical mechanisms for subcellular targeting of FYVE domains for proteins from various model organisms have been worked out but plant FYVE domains remain noticeably under-investigated.
We carried out an extensive examination of all Arabidopsis FYVE domains, including their identification, classification, molecular modeling and biophysical characterization using computational approaches. Our classification of fifteen Arabidopsis FYVE proteins at the outset reveals unique domain architectures for FYVE containing proteins, which are not paralleled in other organisms. Detailed sequence analysis and biophysical characterization of the structural models are used to predict membrane interaction mechanisms previously described for other FYVE domains and their subtle variations as well as novel mechanisms that seem to be specific to plants.
Our study contributes to the understanding of the molecular basis of FYVE-based membrane targeting in plants on a genomic scale. The results show that FYVE domain containing proteins in plants have evolved to incorporate significant differences from those in other organisms implying that they play a unique role in plant signaling pathways and/or play similar/parallel roles in signaling to other organisms but use different protein players/signaling mechanisms.
The FYVE lipid-binding domains were named after the first letter of the four proteins in which they were originally discovered: Fab1, YOTB, Vac1, and EEA1 . FYVE proteins have primarily been associated with functions related to endosomal trafficking e.g. Hrs is involved in sorting of down-regulated receptor molecules in early endosomes , Vacuolar protein sorting mutant 27 phenotype (Vps27p) in endosome maturation , EEA1 in endocytic membrane fusion  and regulation of endosome-to-TGN retrograde transport via phosphatidylinositol 3-phosphate 5-kinase (PIKfyve) . However, they may play other important roles in cell signaling as exemplified by Faciogenital dysplasia 1 in cytoskeletal regulation , Fab1p in regulation of membrane homeostasis [7–9] and Smad Anchor for Receptor Activation (SARA)  as well as endofin in growth factor signaling [11–13]. Structurally, FYVE domains share a fold comprising of two small double-stranded β-sheets and a C-terminal α-helix as deduced from experimentally solved structures such as the crystal structure of the FYVE domain from yeast Vps27p . The fold is stabilized by eight Zn2+ coordinating cysteines residues, which bind Zn2+ in pairs such that the first and third pairs bind one zinc atom, while the second and fourth pairs bind the other zinc atom . The FYVE domains have been characterized as phosphoinositide-binding domains that are highly specific for the phosphatidylinositol 3 phosphate (PtdIns(3)P) [15–18]. This ligand recognition is Zn2+-dependent  and stems primarily from a conserved ligand-binding motif, i.e. (R/K)(R/K)HHCR surrounding the third and fourth cysteine residues . Mutagenesis of either the cysteines involved in Zn2+ coordination or the ligand-binding conserved residues result in decreased affinity for PtdIns(3)P [15, 19–21].
The PtdIns(3)P-binding signature contains three classic conserved regions: the N-terminal WxxD, the central R(R/K)HHCR and the C-terminal R(V/I)C motifs . Combined they drive the PtdIns(3)P specific membrane recruitment of FYVE domains. However, there are several factors in addition to PtdIns(3)P-binding that are thought to contribute to the membrane affinity of FYVE domains: nonspecific electrostatic interactions between the basic face of the domain and the anionic membrane surface [22–24], hydrophobic interactions between the residues located in the "turret loop" near the PtdIns(3)P binding pocket and the membrane bilayer [14, 23–26], dimerization [19, 27] and pH . In additional to working out the structural and functional role of various amino acids comprising the binding motifs, it has also been shown that the binding of PtdIns(3)P to the ligand-binding pocket of FYVE domains neutralizes nearby basic residues to reduce the local positive potential and allow conserved hydrophobic residues to penetrate the membrane interface enhancing membrane attachment [22, 24, 25]. Recently, a molecular dynamics simulations study explored the interactions of the EEA1-FYVE domain and verified that it undergoes a decrease in dynamic flexibility upon binding to its PtdIns(3)P ligand and a phospholipid bilayer .
The PtdIns(3)P-binding FYVE domains are well conserved in various organisms and have been studied extensively in different model organisms except plants. Plants possess several FYVE domain-containing proteins and PtdIns(3)P has been shown to be present in various compartments  as well as membranes  of plant cells. It is possible to envision that plant cells utilize the same or highly similar lipid-binding and membrane-targeting mechanisms  for FYVE domains given that both the FYVE domains and type III PI3-kinase, which makes PtdIns(3)P, are present in plant cells . However some recent reports suggest that PtdIns(3)P may not be the only known phosphoinositide ligand recognized by plant FYVE domains, for example, the FYVE of EEA1 has been shown to be capable of binding to PtdIns(5)P [32, 33].
We have undertaken a comprehensive examination of all FYVE domains of the model plant Arabidopsis thaliana (At) to understand the structural basis for the mechanism of their function and to explore their similarities and differences with respect to other organisms. We describe the 15 different FYVE domain-containing proteins that are expressed in Arabidopsis, all of which are largely unexplored. Our detailed sequence analysis and biophysical characterization of the structural models of the FYVE domains in Arabidopsis suggest membrane interaction mechanisms and their subtleties. Moreover, the study also reveals unique biophysical properties of plant FYVE domains, a new binding motif specific only to the variant class of plant FYVE domains and novel domain architectures unique to plant FYVE proteins.
Identification, characterization and chromosomal localization of FYVE domain-containing proteins encoded in the Arabidopsis genome
The total number of FYVE domain-containing proteins seems to be directly correlated with the total estimated number of genes for a given organism, e.g. 27 FYVE encoding genes in a total of 42,000 in H. sapiens, 13 in a total of 18,000 in C. elegans and 5 in a total of 6,000 in S. cerevisiae . We identified 15 AtFYVE proteins in the Arabidopsis protein sequence database i.e. TAIR first genome release (version TAIR 6.0, Nov 2005). Later genome releases built upon the gene structures of TAIR6 release as well as community input regarding missing and incorrectly annotated genes and they do not contain any new genes encoding FYVE proteins. Our finding of 15 FYVE proteins encoded within predicted 25,500 genes  of the Arabidopsis genome falls in line with the above observation. The initial identification was done using an automated pipeline . Later, the total number of AtFYVE proteins and their individual accession numbers were verified through manual searches performed in various databases. The 15 FYVE domains present in various Arabidopsis proteins (representing the entire family of AtFYVE proteins) aligned with human EEA1 FYVE domain (PDB: 1JOC chain A ) are shown in Fig. 1A. Fig. 1B displays the schematic localization of the 15 AtFYVE proteins within the Arabidopsis genome. The 15 identified sequences of AtFYVE proteins are dispersed throughout the Arabidopsis genome, being located on all chromosomes except chromosome 2 (Fig. 1B). The disagreement of our total with previously reported totals of nine , over ten  and most recently, sixteen  FYVE domains stems from misannotations. For example, AT1G61620, AT1G66040, AT1G66050 and AT5G39550 proteins are all annotated as FYVE proteins but do not actually possess FYVE domains based on various sequence analysis methods.
Domain Architecture of Arabidopsis FYVE proteins
On the basis of domain architecture of the proteins, we propose five classes of AtFYVE proteins (Fig. 2). Class I comprises two out of four documented Arabidopsis Fab1p homologues expressed in plants, i.e. AT3G14270 and AT4G33240 [40, 41]. The other two Fab1 homologues do not contain a FYVE domain [40, 41]. Class I members, i.e. AT3G14270 and AT4G33240, contain a FYVE domain, followed by Fab1_TCP(chaparonin-like) and PIPKc domains. AT3G14270 and AT4G33240 are annotated in NCBI database as "phosphatidylinositol-4-phosphate 5-kinase family proteins" while in UniProtKB/TrEMBL as "putative uncharacterized proteins." Our Blast analysis reveals similarity of both class I members to ppk-3 (C. elegans), Fab1p (S. cerevisiae), and phosphatidylinositol-3-phosphate 5-kinase type III (H. sapiens) (see supplementary material). Ppk-3 and Fab1p proteins share domain architecture identical to class I members and phosphatidylinositol-3-phosphate 5-kinase type III protein has an additional DEP domain (see supplementary material). Class II is represented by two sequences, AT3G43230 and AT1G29800, which possess two domains: a FYVE and a Domain of Unknown Function (DUF500). Class III comprises the AT1G61690 protein and class IV comprises the AT1G20110 protein. Both classes are unique in that they contain only a FYVE domain but they differ in the placement of the FYVE domain (N-terminus versus C-terminus) and also their biophysical properties (this study). UniProtKB/TrEMBL annotates function for class II-IV as putative uncharacterized. The representation of class II-IV members in the literature is full of contradictions. They are not mentioned in the classification by Drobak and Heras  and AT1G29800 of class II together with AT1G61690 of class III are omitted from the classification by Jensen et al . Moreover, class IV protein was identified as AtAAF79901 and shown to contain a FYVE domain followed by a plant specific SGNH-plant-lipase-like domain . Our analysis of the sequence suggests, however, that class IV protein is over 300 amino acids shorter than AtAAF79901, and it does not contain a SGNH-plant-lipase-like domain. Class II sequences, seem to contain an additional DUF500 domains not represented by van Leeuwen . Class V is the largest class. It includes nine AtFYVE proteins, which share similar domain architecture, i.e. Pleckstrin Homology of Phospholipase C (PH_PLC), followed by Regulator of Chromosome Condensation 1 (RCC1) regions/blades (overlapping with Alpha Tubulin Suppressor 1 (ATS1)) and FYVE domains. In addition, seven out of nine class V proteins are characterized by the presence of a DZC motif found near the C-terminus DZC. UniProtKB/TrEMBL annotates function for class V members as either disease resistance protein-like, e.g. AT5G42140 and AT4G14370, Ran GTPase binding/chromatin binding/zinc ion binding, e.g. AT1G65920, AT1G69710, AT3G23270 and AT5G12350, or putative uncharacterized, e.g. AT3G47660, AT1G76950, and AT5G19420.
The SMART database recognizes between three and five RCC1 regions within class V AtFYVE proteins, whereas the CD-search identifies additionally yeast domain with similarity to human RCC1 domain, ATS1 domain, overlapping the RCC1 blades (Fig. 2). In some cases, only the ATS1 domain is detected by the CD-search or the number of RCC1 blades does not correspond to the number obtained from SMART database (data not shown). These inconsistencies prompted further enquiry into the number and nature of the putative RCC1 repeats identified in class V of AtFYVE proteins. Up to now, RCC1 and RCC1-like domains that have been described are within cytoplasmic proteins associated with membrane structures, e.g. endosomes (Alsin)  and Golgi apparatus (HERC1) . Fig. 3 shows an internal sevenfold sequence repeat of 51-68 residues present in the solved structure of human RCC1  aligned with putative RCC1 regions of class V AtFYVE proteins. In human RCC1, one half of the first sequence repeat, the C and D repeats, is made from the N-terminal end of the protein, and the other half, the A and B repeats, is made from the C-terminal end . It has been suggested that this arrangement stabilize the circular arrangement of secondary structural elements through a molecular clasp mechanism similar to a belt closure . Our data show that putative RCC1 blades of AtFYVE proteins align well with six of human seven RCC1 blades. In fact, the seven highly conserved residues, i.e. four glycines, a tyrosine, a leucine and a cis-proline, identified in human RCC1 repeats are also mostly conserved among putative AtRCC1 blades (boxed residues). However, it appears that the first blade of human RCC1 shares little or no primary and/or secondary sequence similarity with most putative AtRCC1 blades. The first blade of putative AtRCC1 may not even be a potential repeat for at least seven out of nine class V AtFYVE proteins because they share a low sequence similarity with human RCC1 in the corresponding region as compared to other regions.
Molecular models of the Arabidopsis FYVE domains and their biophysical properties
We built homology models of all FYVE domains present in 15 AtFYVE proteins listed in Fig. 1. Since electrostatic forces play critical roles in protein-membrane interactions and numerous membrane-mediated biological phenomena, we mapped the charge distribution on the surface of each AtFYVE domain (Fig. 4). In Fig. 4, we have constructed an electrostatic profile panel showing the location of negatively (red) and positively (blue) charged regions on the surface of AtFYVE domains. The electrostatic profile of class I AT3G14270-FYVE model, which has a net charge of +7 (including zinc ions), bears a resemblance to the electrostatic profile of D. melanogaster Hrs-FYVE (PDB: 1DVP). The model of the other class I member, AT4G33240-FYVE, exhibits weaker positive potential. Both members of class II AtFYVE domains share a potential profile that is similar to the profile observed for H. sapiens EEA1-FYVE (PDB: 1HYI). Class II AT3G43230-FYVE has a net charge of +6 and AT1G29880-FYVE has a net charge of +8. Intriguingly, the model of class III AT1G61690-FYVE has a very strong positive potential similar to that observed also for all models of class V AtFYVE domains, M. musculus 19 protein-FYVE (PDB: 1WFK) and for H. sapiens 27 isoform β-FYVE (PDB: 1X4U). Their overall net charge is highly positive, but varies from +9 to +16. The electrostatic profile of class IV AtFYVE model shows the weakest positive potential observed among AtFYVE domain models. Additional electrostatic profiles for the alternative models, their PDB coordinate files and verification profiles are available online (see supplementary material).
Sequence motifs of the Arabidopsis FYVE domains
AtFYVE domains can be divided into two distinct groups based on different consensus sequences identified via CLUSTALW multiple sequence alignment (Fig. 5). Fig. 5A depicts AtFYVE domains, which belong to class I-IV. These Arabidopsis domains were previously referred to as classic FYVE domains because they contain three classic conserved regions: the N-terminal WxxD, the central R(R/K)HHCR and the C-terminal R(V/I)C motifs  implicated in binding the phosphoinositide ligand PtdIns(3)P. Class I-IV AtFYVE proteins have a classic FYVE domain (Fig. 5A) with a conserved motif for PtdIns(3)P-binding that is found in FYVE domains of H. sapiens , S. cerevisiae, C. elegans  and various other organisms, e.g. P. troglodytes, M. musculus, R. norvegicus, C. familiaris, B. taurus, G. Gallus. Class V AtFYVE domains do not share the N-terminal WxxD motif. Instead they have a WxxG motif, only a G residue or residues that share no similarity to the WxxD or WxxG motifs (Fig. 5). Moreover, the central R(R/K)HHCR motif is replaced by a (K/R)(R/K)HNCY motif, which is atypical and hence the name "variant binding motif" of FYVE domains .
We observe that the variable turret loop prior to the R(R/K)HHCR motif, which is associated with membrane penetration of the FYVE domain, and the putative dimerization interface region are made up of residues, which are quite diverse in the various class I-IV FYVE domains. Despite the observed differences in residues, however, all class I-IV AtFYVE domains share at least one hydrophobic residue within the turret loop and highly hydrophobic dimerization interface regions. AT1G29800-FYVE and AT3G43230-FYVE have an insertion of an additional hydrophobic residue within the turret loop. Class V AtFYVE domains contain a conserved phenylalanine residue in the second position (with the exception of AT3G47660-FYVE) and a conserved arginine in the last position within the turret loop. As in the case of class I-IV AtFYVE domains, the putative dimerization interface region of class V AtFYVE domains is highly hydrophobic. Unlike class I-IV AtFYVE domains, however, class V AtFYVE domains dimerization interface regions seems highly conserved with at least three absolutely conserved residues, i.e. AxxAP.
FYVE domains have the potential to bind headgroups of both PtdIns(3)P and PtdIns(5)P
Preliminary docking studies depicted in Fig. 5 and Fig. 6 show that class I-IV AtFYVE domains have a potential to bind headgroups of both PtdIns(3)P and PtdIns(5)P using the same set of residues previously identified to bind the headgroup of PtdIns(3)P in other FYVE domains, i.e. the RHHxR motif and the arginine residue of RVC motif (Fig. 5). Class V AtFYVE domains use the variant signature of residues, i.e. xRKxHNxY motif, and a (L/F/P)YR motif, which overlaps the classic RVC motif, to potentially bind headgroups of PtdIns(3)P and PtdIns(5)P (Fig. 5B). In addition to the variant residues, our data indicate that a (H/K/N)xx(S/T)(S/N)(K/R)K motif located immediately prior to the dimerization region is also used by class V AtFYVE domains to recognize either headgroup (Fig. 5B).
Proteins that contain FYVE zinc finger domains have so far been known as effectors of PtdIns(3)P playing a major role in endocytic and vesicular trafficking [46–48]. PtdIns(3)P is a phosphoinositide that is present at very low levels in plant cells [49–52]. It is synthesized by phosphatidylinositol 3-kinase (PI3K). Both PtdIns(3)P and PI3K are essential for normal plant growth  and have been implicated in diverse physiological functions, including root nodule formation , auxin-induced production of reactive oxygen species (ROS) and root gravitropism , root hair curling and Rhizobium infection in M. truncatula , maintenance of the processes essential for root hair cell elongation , increased plasma membrane endocytosis and the intracellular production of ROS in the salt tolerance response , stomatal closing movement [57, 58], and possibly cytokinesis . If we envision plant FYVE domains as being potential effectors of PtdIns(3)P, they could play important roles in various physiological processes. In this study we have modeled the structure of all AtFYVE domains and predicted their membrane targeting behavior based on the biophysical profiles of the modeled structures.
Based on the domain architecture and homology to proteins of known function, we have classified AtFYVE proteins into five distinct classes (Fig. 2). Similar domain based classifications previously performed for FYVE domain-containing proteins in H. sapiens, C. elegans and S. cerevisiae genomes [34, 45] suggested a certain degree of correspondence among the different FYVE proteins in various organisms . However, AtFYVE proteins are striking in showing no obvious similarities or correspondence to the FYVE proteins included in these domain architecture-based classifications. More specifically, only one class of AtFYVE proteins corresponds to what was reported in other organisms, i.e. class I in our classification and the corresponding PIKfvye, MmPIKfyve and ScFab1p groups in the other classifications [34, 40, 45]. Even that correspondence, however, is partial since the Arabidopsis counterparts lack the disheveled, Egl-10, and pleckstrin (DEP) domain observed in mammals and worms [34, 40, 45, 59]. The remaining members of AtFYVE proteins class II-V are unique and exhibit completely different domains suggesting that FYVE domains in plants play a unique role in plant signaling pathways and/or play similar/parallel roles in signaling as other organisms but use different protein players or signaling mechanisms.
The two class I sequences are homologues of the PIKfyve/Fab1 family of phosphatidylinositol phosphate 5-kinases that phosphorylate the D-5 position in phosphatidylinositol (PtdIns) and PtdIns(3)P to make PtdIns(5)P and PtdIns(3,5)P2, respectively . PIKfyve/Fab1 proteins bind PtdIns(3)P with high specificity through their FYVE domains [15, 60] and are known to participate in several aspects of endosomal trafficking functions , transduction of osmotic shock signals  and other cellular functions in mammals and yeast  as well as in plants [63–65]. Recently, the two class I AtFYVE, PIKfyve proteins, were found to participate in vacuolar rearrangement essential for successful pollen development  and our molecular models provide the structural insight into their mode of function. Both members possess the complete classic signature for PtdIns(3)P-binding and the conserved hydrophobic motif suggesting that they likely bind membranes using the general mechanism of non-specific electrostatic interactions, followed by membrane penetration of hydrophobic residues close to the PtdIns(3)P-binding pocket facilitated by an electrostatic switch coupled with specific interactions with PtdIns(3)P as proposed by previous computational modeling studies . These studies have shown that all human FYVE domains have electrostatic equipotential profiles similar to those of Hrs and EEA1 FYVE domains. This electrostatic polarity seems to be characteristic for class I AtFYVE domains and their S. cerevisiae and C. elegans homologues (Fig. 4 and Fig. S1 (Supplementary material)). Despite the overall electrostatic profile similarity, AT3G14270-FYVE has a higher net charge (+7 at pH 6.5; Zn ions included) than AT4G33240-FYVE (+3 at pH 6.5; Zn ions included) (Fig 3). Based on the net charge difference, we predict that AT4G33240-FYVE will have a reduced non-specific electrostatic contribution to membrane targeting. Moreover, we predict that its hydrophobic contribution will also be reduced because the conserved hydrophobic motif of AT4G33240-FYVE possesses a valine residue instead of a leucine residue found in AT3G14270-FYVE (Fig. 4). Additionally, FYVE domain dimerization might be important for functional membrane association of AT4G33240-FYVE.
Class II-IV proteins have untouched sequences in terms of functional assignment, which remain annotated as "putative uncharacterized proteins" in various sequence databases. All of them share the complete/nearly complete conserved PtdIns(3)P-binding motif and a large basic binding pocket except for class IV AT1G20110-FYVE, which has a significantly reduced basic surface patch in the potential ligand-binding pocket (Fig. 4; class II-IV domains have net charges of +6, +8, +11, and +2, respectively). Class II FYVE domains possess a classic FYVE domain electrostatic profile but their binding signature is missing the first of the arginines in the R(R/K)HHCR motif, which is known to recognize the 1-phosphate of PtdIns(3)P headgroup . Even though this residue doesn't participate in the direct recognition of the 3-phosphate, mutational studies suggest that substitution of this arginine substantially reduces the FYVE domain's affinity for PtdIns(3)P-containing membranes and potential for membrane localization. The altered signature may slightly reduce the local basic charge in the vicinity of the hydrophobic motif and lower the barrier to membrane penetration. In this class, we predict a classic FYVE domain membrane-targeting behavior with subtle differences that could be verified using mutational studies. Class III FYVE domain on the contrary has the full binding signature and an electrostatic equipotential profile similar to those of Hrs and EEA1 FYVE domains. We predict that this domain will localize to PtdIns(3)P-containing membranes using the classic mechanism of action of previously studied FYVE domains with a strong contribution from non-specific electrostatic interactions.
Class IV AtFYVE domain has the most reduced basic surface patch and the lowest net charge of +2 among AtFYVE domains. Hydrophobic contribution through membrane insertion will likely be an important component of membrane binding for this class, similar to FENS-FYVE , which localizes to endosomal membrane  even though it has a weaker positive potential than other known FYVE domains .
Class V proteins are the most interesting class of the FYVE domain-containing proteins although much remains to be understood about their function. Out of the 18 human RCC1 superfamily proteins, none corresponds, in their domain architecture to class V FYVE proteins . The closest match, the PAM protein, has 3 RCC1 repeats and a FYVE domain but in a different order and accompanied by domains other than domains found in class V AtFYVE proteins . In contrast to the traditional seven canonical repeats found is most RCC1-like proteins, there are six RCC1 repeats in some proteins such as WBSCR16, Nek9, RPGR  and some AtFYVE proteins (this study). Since β-propellers (including RCC1 repeats) could be made of a variable number of blades and are thought to evolve by blade duplication and deletion , there could be three alternative explanations for the absence of the first canonical RCC1 repeat in some class V AtFYVE proteins: 1) the second half of blade 1 and the first half of blade 7 engage with one another to form a symmetrical 6-bladed β-propeller; 2) an "open" ring-propeller forms as known for the C-terminal domain of ParC subunit  and suggested for the short-form of Alsin ; or 3) the first repeat is a non-canonical RCC1 repeat as seen in other proteins . Therefore, despite the sequence differences, it is possible that the 6 RCC1 repeats found in some AtFYVE adapt a β-propeller structure similar to β-propeller structures found in proteins from other organisms.
Previously, it has been suggested that association with membrane(s) may be crucial for the functioning of this class of AtFYVE proteins given the presence of two phosphoinositide-binding domains, i.e. PH and FYVE domains . Experimental data suggest that class V AT1G65920 PH domain binds to PtdIns(4,5)P2 while its FYVE domain binds to PtdIns(3)P as well as PtdIns(5)P . The various members of class V AtFYVE domains show a high degree of sequence conservation within an enrichment of basic residues throughout the length of the FYVE domain (Fig. 5). The most striking feature of these FYVE domains is the presence of a variant phosphoinositide-binding motif (Fig. 5B), which seems to be unique to plants as is the overall domain architecture of these proteins (; Fig. 5B). When the variant (K/R)(R/K)HNCY motif of class V FYVE domains is used to search for other FYVE domains, only sequences from plants are retrieved, e.g. Q1SA17 (M. truncatula), Q1SIN6 (M. truncatula), Q84RS2 (M. sativa), Q5JL00 (O. sativa) Q5N8I7 (O. sativa) Q6AV10 (O. sativa) Q6L5B2 (O. sativa) Q259N3 (O. sativa), Q5XWP1 (S. tuberosum), Q60CZ5 (S. tuberosum), Q60CZ5 (S. demissum) or Q5EWZ4 (T. turgidum).
The obvious question that comes to mind is whether this variant signature is responsible for an altered binding specificity in this class of FYVE proteins and therefore associated with a novel pattern of membrane/sub-cellular targeting. Within mammalian cells, FYVE domains are highly conserved and seem to select PtdIns(3)P over other phosphoinositides . Despite the conservation in the overall mechanism, there are significant differences in the specificity and affinity of individual FYVE domains towards phosphoinositides. In fact, EEA1 has affinity for PtdIns(5)P as well, perhaps because PtdIns(3)P and PtdIns(5)P are similar in all aspects except having the phosphomonoester in a different position . Consequently, PtdIns(5)P has been shown to induce small but important chemical shift changes similar to those induced by PtdIns(3)P in the binding motif residues with the exception of one arginine, which remains practically unaltered by PtdIns(5)P . PtdIns(3)P specific recognition by the FYVE domain seems to involve indirect recognition of this specific ligand by exclusion of alternatively phosphorylated phosphoinositides: the two residues implicated in this are the aspartic acid of the N-terminal WxxD motif and the second histidine of the central HHCR motif . Both of these motifs are substituted in class V variant AtFYVE domains (Fig. 5) by the WxxG and HNCY motifs, respectively. This opens up the possibility that class V FYVE domains may have the potential to interact equally or better with phosphoinositide ligands other than PtdIns(3)P. Our preliminary docking analysis of classic as well as variant motif-containing AtFYVE domains seem to suggest that both have the potential to interact with PtdIns(3)P and PtdIns(5)P headgroups using practically the same set of residues (Fig. 5). Additionally, our analysis reveals a highly conserved putative ligand-association motif located immediately prior to the dimerization region present only within the class V proteins (Fig. 5). Class V AtFYVE domains are also different in exhibiting very large basic surface patches with prominent hydrophobic motifs. These patches are the largest observed among FYVE domains classified to date [71, 72]. We predict that class V AtFYVE domains target to the membrane with highly significant contributions from non-specific electrostatics and hydrophobic interactions, coupled with specific interactions with PtdIns(3)P and/or PtdIns(5)P using the variant binding residues and an additional conserved motif specific to this class of FYVE domains.
Based on experimental studies, it has been suggested that the strength of the positive potential and the identity of the hydrophobic residues near the binding site may be two key factors, which are critical in determining which FYVE domains act alone, undergo dimerization or require additional partners before anchoring to the membrane . For example, SARA-FYVE was predicted and verified experimentally to associate with the membrane with significant contributions from non-specific electrostatic and hydrophobic interactions given its net charge of +12 (zinc ions included) as well as the presence of phenylalanine at the conserved hydrophobic position [22, 71, 73]. Our data suggests that AtFYVE domains engage in both non-specific electrostatic and PtdIns(3)P-induced hydrophobic interactions for membrane localization, the contribution differing for individual domains as described earlier. Additionally, dimerization may play an important role in the membrane recruitment of FYVE domains [21, 74] and it appears that the free energy contributions to the membrane association are additive for each monomer of the EEA1-FYVE dimer . The dimer interface regions of AtFYVE domains are longer and more hydrophobic (Fig. 5) than the equivalent region of EEA1-FYVE and predicted region of SARA-FYVE [72, 75] suggesting that all AtFYVE domains have the potential to dimerize and associate with membrane(s) as dimers.
Overall, AtFYVE proteins are quite distinct from other organisms, exhibiting unique domain architectures, biophysical properties as well as altered binding motifs. The biophysical profiles of the modeled FYVE domains in Arabidopsis suggest membrane-targeting mechanisms ranging from the previously described classic modes to the novel binding mode of the class V FYVE domains, which seem to be found only in plants. Our predictions provide a foundation for designing directed mutational studies to confirm these behaviors, which is crucial to the understanding of the role of these domains in important plant signaling pathways, something that has so far not been explored.
Arabidopsis FYVE proteins
The accession numbers of the AtFYVE proteins were identified using a computational pipeline for automated high-throughput modeling , which run against Arabidopsis protein sequence database (TAIR6_pep_20051108). The AtFYVE protein sequences corresponding to the identified accession numbers were retrieved from KEGG GENES [76, 77] and verified for presence of FYVE domain with SMART [78–80].
To verify the total number and individual accession numbers of AtFYVE proteins obtained with pipeline, three additional methods were employed: 1) search of publicly available sequence databases: Swiss-Prot/TrEMBL [81, 82], NCBI [83, 84] and UniProt [85–87]; 2) query performed by the Arabidopsis Information Resource (TAIR) for BLASTn 2.2.14 ; and 3) MOTIF search  using a manually derived pattern specific to AtFYVE domains in PROSITE format offered by GenomeNet service .
Domain architecture analyses
Each AtFYVE protein sequence was analyzed by searching against Pfam , SMART [78–80], Conserved Domain Database v2.10 and CD-Search [90–94] and Clusters of Orthologous Groups [95, 96]. All sequences were then subgrouped according to consensus domain architecture.
There is no single homology modeling program/routine that has been singled out as the best method for comparative modeling . To generate high-quality models for the AtFYVE domains, we implemented a number of programs to create many different alternative alignments and models followed by a quality assessment and a selection process. We used two separate approaches: automated and manual. The automated approach involved the use of a high-throughput computational pipeline, which uses its own built in alignment, modeling and evaluation methods  as well as Pudge for modeling and evaluation . The manual approach is based on choosing several alternative options for each step in the process of creating the homology models as previously detailed by Singh and Murray . The scheme involves the use of multiple approaches at each step: 1) choice of a suitable structural template, 2) alignment of the template and target sequences, 3) model building, and 4) model evaluation and refinement using 3D-JIGSAW [100–102], Modeller 8v1 [103, 104], NEST , LOOPP [106–108], HOMER , CPH , PHYRE , manual editing using GeneDoc , guided by Verify3 D [113, 114] and Prosa . Loop refinement and side chain conformations were performed using individual modeling programs whenever available. In addition, loop refinement was done with Loopy  and the prediction of side-chain conformations with SCWRL3.0  and SCAP [118–120].
Analysis of the models
The models were analyzed for their sequence, structural and biophysical properties. The analyses of biophysical properties including the electrostatics, hydrophobicity and shape of each model were conducted using the surface property analysis tools in the program GRASP . The pKa values of ionizable amino acid side chains in AtFYVE domains as well as total charges were computed using the automated system H++ [122–124], which is based on solutions to the Poisson-Boltzmann equation. The calculations were performed using default settings. The reported total charges was calculated at pH 6.5 because EEA1-FYVE was estimated to exist in bound state at low pH of 6.0-6.6 and only half of the protein was estimated to remain active at the cytostolic pH of 7.3 .
Ins(1,3)P2, Ins(1,4)P2, Ins(1,4,5)P3, Ins(1,3,5)P3, Ins(1,3,4)P3, and Ins(1,3,4,5)P4 ligands were extracted from their corresponding PDB files. Ins(1,5)P2 ligands were created from Ins(1,4,5)P3 ligands in Chimera  and energy minimized. Hydrogens were added to all ligands using Chimera . Gasteiger charges were calculated for all ligands.
Phosphoinositides docking and analysis of resulting interactions
Rigid and flexible docking was performed using DOCK 6.1  and DOCK 6.1 suite programs. A molecular surface of the receptor was created with DMS [127, 128]. Spheres were generated with Sphgen_cpp v1.2, which was modified by Andrew Magis from its original version called Sphgen . The resulting file was edited to include only spheres grouped within the first cluster. Grids were generated with GRID [129, 130]. Contact scores and energy scores were calculated using an energy cutoff distance of 5.0 A. Our docking technique was validated by docking Ins(1,3)P2 of known FYVE domains into their corresponding solved structures. Although FYVE domains are suggested to bind only Ins(1,3)P2 and Ins(1,5)P2, we also docked Ins(1,4)P2, Ins(1,4,5)P3, Ins(1,3,5)P3, Ins(1,3,4)P3, and Ins(1,3,4,5)P4 as controls. Following the initial validation we used our approach to dock three Ins(1,3)P2 and three Ins(1,5)P2 ligands using rigid and flexible docking scenarios with the predictive models of AtFYVE domains. In the end, each predictive model was subjected to twelve docking runs, six for each headgroup. A given residue is reported to interact with the headgroup only if it does so 50% or more of the time (i.e. 3 or more times) as evaluated by the Ligand-Protein Contacts (LPC) server .
Electronic supplementary material
The sequences and coordinate files representing our models for all AtFYVE domains as well as other supplementary information (GRASP images and structure verification plots, and alignment files) are available at the following website: http://userhome.brooklyn.cuny.edu/ssingh/arabidopsis/FYVE/fyve.html.
- AT :
Alpha Tubulin Suppressor 1
Domain of Unknown Function 500
Early Endosomal Antigen 1
Formation of aploid and binucleate cells
Fucoxanthin Chlorophyll a/c-Binding
Fab1, YOTB, Vac1, and EEA1
Hepatocyte growth factor-regulated tyrosine kinase substrate
Pleckstrin Homology of Phospholipase C
PhosphatidylInositol 3-phosphate 5-Kinase
- PI 3-kinase:
PH, RCC1 and FYVE
PhosphatIdylinositol 3 Phosphate
PhosphatIdylinositol 5 Phosphate
Regulator of Chromosome Condensation 1
Smad Anchor for Receptor Activation
Vacuolar protein sorting mutant 27 phenotype
Vacuolar protein sorting mutant 34
Reactive Oxygen Species.
Stenmark H, Aasland R, Toh BH, D'Arrigo A: Endosomal localization of the autoantigen EEA1 is mediated by a zinc-binding FYVE finger. J Biol Chem. 1996, 271 (39): 24048-24054. 10.1074/jbc.271.39.24048.
Petiot A, Faure J, Stenmark H, Gruenberg J: PI3P signaling regulates receptor sorting but not transport in the endosomal pathway. J Cell Biol. 2003, 162 (6): 971-979. 10.1083/jcb.200303018.
Wurmser AE, Gary JD, Emr SD: Phosphoinositide 3-Kinases and Their FYVE Domain-containing Effectors as Regulators of Vacuolar/Lysosomal Membrane Trafficking Pathways. J Biol Chem. 1999, 274 (14): 9129-9132. 10.1074/jbc.274.14.9129.
Simonsen A, Lippe R, Christoforidis S, Gaullier JM, Brech A, Callaghan J, Toh BH, Murphy C, Zerial M, Stenmark H: EEA1 links PI(3)K function to Rab5 regulation of endosome fusion. Nature. 1998, 394 (6692): 494-498. 10.1038/28879.
Rutherford AC, Traer C, Wassmer T, Pattni K, Bujny MV, Carlton JG, Stenmark H, Cullen PJ: The mammalian phosphatidylinositol 3-phosphate 5-kinase (PIKfyve) regulates endosome-to-TGN retrograde transport. J Cell Sci. 2006, 119 (Pt 19): 3944-3957. 10.1242/jcs.03153.
Estrada L, Caron E, Gorski JL: Fgd1, the Cdc42 guanine nucleotide exchange factor responsible for faciogenital dysplasia, is localized to the subcortical actin cytoskeleton and Golgi membrane. Hum Mol Genet. 2001, 10 (5): 485-495. 10.1093/hmg/10.5.485.
Cooke FT, Dove SK, McEwen RK, Painter G, Holmes AB, Hall MN, Michell RH, Parker PJ: The stress-activated phosphatidylinositol 3-phosphate 5-kinase Fab1p is essential for vacuole function in S. cerevisiae. Curr Biol. 1998, 8 (22): 1219-1222. 10.1016/S0960-9822(07)00513-1.
Gary JD, Wurmser AE, Bonangelino CJ, Weisman LS, Emr SD: Fab1p is essential for PtdIns(3)P 5-kinase activity and the maintenance of vacuolar size and membrane homeostasis. J Cell Biol. 1998, 143 (1): 65-79. 10.1083/jcb.143.1.65.
Odorizzi G, Babst M, Emr SD: Fab1p PtdIns(3)P 5-kinase function essential for protein sorting in the multivesicular body. Cell. 1998, 95 (6): 847-858. 10.1016/S0092-8674(00)81707-9.
Tsukazaki T, Chiang TA, Davison AF, Attisano L, Wrana JL: SARA, a FYVE domain protein that recruits Smad2 to the TGFbeta receptor. Cell. 1998, 95 (6): 779-791. 10.1016/S0092-8674(00)81701-8.
Seet LF, Hong W: Endofin, an endosomal FYVE domain protein. J Biol Chem. 2001, 276 (45): 42445-42454. 10.1074/jbc.M105917200.
Seet LF, Hong W: Endofin recruits clathrin to early endosomes via TOM1. J Cell Sci. 2005, 118 (Pt 3): 575-587. 10.1242/jcs.01628.
Seet LF, Liu N, Hanson BJ, Hong W: Endofin recruits TOM1 to endosomes. J Biol Chem. 2004, 279 (6): 4670-4679. 10.1074/jbc.M311228200.
Misra S, Hurley JH: Crystal structure of a phosphatidylinositol 3-phosphate-specific membrane-targeting motif, the FYVE domain of Vps27p. Cell. 1999, 97 (5): 657-666. 10.1016/S0092-8674(00)80776-X.
Burd CG, Emr SD: Phosphatidylinositol(3)-phosphate signaling mediated by specific binding to RING FYVE domains. Mol Cell. 1998, 2 (1): 157-162. 10.1016/S1097-2765(00)80125-2.
Gaullier JM, Simonsen A, D'Arrigo A, Bremnes B, Stenmark H, Aasland R: FYVE fingers bind PtdIns(3)P. Nature. 1998, 394 (6692): 432-433. 10.1038/28767.
Patki V, Lawe DC, Corvera S, Virbasius JV, Chawla A: A functional PtdIns(3)P-binding motif. Nature. 1998, 394 (6692): 433-434. 10.1038/28771.
Patki V, Virbasius J, Lane WS, Toh BH, Shpetner HS, Corvera S: Identification of an early endosomal protein regulated by phosphatidylinositol 3-kinase. Proc Natl Acad Sci USA. 1997, 94 (14): 7326-7330. 10.1073/pnas.94.14.7326.
Gaullier JM, Ronning E, Gillooly DJ, Stenmark H: Interaction of the EEA1 FYVE finger with phosphatidylinositol 3-phosphate and early endosomes. Role of conserved residues. J Biol Chem. 2000, 275 (32): 24595-24600. 10.1074/jbc.M906554199.
Kutateladze TG: Phosphatidylinositol 3-phosphate recognition and membrane docking by the FYVE domain. Biochim Biophys Acta. 2006, 1761 (8): 868-877.
Kutateladze TG, Ogburn KD, Watson WT, de Beer T, Emr SD, Burd CG, Overduin M: Phosphatidylinositol 3-phosphate recognition by the FYVE domain. Mol Cell. 1999, 3 (6): 805-811. 10.1016/S1097-2765(01)80013-7.
Diraviyam K, Stahelin RV, Cho W, Murray D: Computer modeling of the membrane interaction of FYVE domains. J Mol Biol. 2003, 328 (3): 721-736. 10.1016/S0022-2836(03)00325-5.
Kutateladze TG, Capelluto DG, Ferguson CG, Cheever ML, Kutateladze AG, Prestwich GD, Overduin M: Multivalent mechanism of membrane insertion by the FYVE domain. J Biol Chem. 2004, 279 (4): 3050-3057. 10.1074/jbc.M309007200.
Stahelin RV, Long F, Diraviyam K, Bruzik KS, Murray D, Cho W: Phosphatidylinositol 3-phosphate induces the membrane penetration of the FYVE domains of Vps27p and Hrs. J Biol Chem. 2002, 277 (29): 26379-26388. 10.1074/jbc.M201106200.
Kutateladze T, Overduin M: Structural mechanism of endosome docking by the FYVE domain. Science. 2001, 291 (5509): 1793-1796. 10.1126/science.291.5509.1793.
Mao Y, Nickitenko A, Duan X, Lloyd TE, Wu MN, Bellen H, Quiocho FA: Crystal structure of the VHS and FYVE tandem domains of Hrs, a protein involved in membrane trafficking and signal transduction. Cell. 2000, 100 (4): 447-456. 10.1016/S0092-8674(00)80680-7.
Hayakawa A, Hayes SJ, Lawe DC, Sudharshan E, Tuft R, Fogarty K, Lambright D, Corvera S: Structural basis for endosomal targeting by FYVE domains. J Biol Chem. 2004, 279 (7): 5958-5966. 10.1074/jbc.M310503200.
He J, Vora M, Haney RM, Filonov GS, Musselman CA, Burd CG, Kutateladze AG, Verkhusha VV, Stahelin RV, Kutateladze TG: Membrane insertion of the FYVE domain is modulated by pH. Proteins. 2009, 76 (4): 852-860. 10.1002/prot.22392.
Psachoulia E, Sansom MS: PX- and FYVE-mediated interactions with membranes: simulation studies. Biochemistry. 2009, 48 (23): 5090-5095. 10.1021/bi900435m.
Kim DH, Eu YJ, Yoo CM, Kim YW, Pih KT, Jin JB, Kim SJ, Stenmark H, Hwang I: Trafficking of phosphatidylinositol 3-phosphate from the trans-Golgi network to the lumen of the central vacuole in plant cells. Plant Cell. 2001, 13 (2): 287-301. 10.1105/tpc.13.2.287.
Welters P, Takegawa K, Emr SD, Chrispeels MJ: AtVPS34, a Phosphatidylinositol 3-Kinase of Arabidopsis thaliana, is an Essential Protein with Homology to a Calcium-Dependent Lipid Binding Domain. PNAS. 1994, 91 (24): 11398-11402. 10.1073/pnas.91.24.11398.
Jensen RB, La Cour T, Albrethsen J, Nielsen M, Skriver K: FYVE zinc-finger proteins in the plant model Arabidopsis thaliana: identification of PtdIns3P-binding residues by comparison of classic and variant FYVE domains. Biochem J. 2001, 359 (Pt 1): 165-173. 10.1042/0264-6021:3590165.
Heras B, Drobak BK: PARF-1: an Arabidopsis thaliana FYVE-domain protein displaying a novel eukaryotic domain structure and phosphoinositide affinity. J Exp Bot. 2002, 53 (368): 565-567. 10.1093/jexbot/53.368.565.
Stenmark H, Aasland R, Driscoll PC: The phosphatidylinositol 3-phosphate-binding FYVE finger. FEBS Lett. 2002, 513 (1): 77-84. 10.1016/S0014-5793(01)03308-7.
AGI: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408 (6814): 796-815. 10.1038/35048692.
Mirkovic N, Li Z, Parnassa A, Murray D: Strategies for high-throughput comparative modeling: Applications to leverage analysis in structural genomics and protein family organization. Proteins. 2006, 66 (4): 766-777. 10.1002/prot.21191.
Dumas JJ, Merithew E, Sudharshan E, Rajamani D, Hayes S, Lawe D, Corvera S, Lambright DG: Multivalent endosome targeting by homodimeric EEA1. Mol Cell. 2001, 8 (5): 947-958. 10.1016/S1097-2765(01)00385-9.
Drobak BK, Heras B: Nuclear phosphoinositides could bring FYVE alive. Trends Plant Sci. 2002, 7 (3): 132-138. 10.1016/S1360-1385(01)02213-0.
van Leeuwen W, Okresz L, Bogre L, Munnik T: Learning the lipid language of plant signalling. Trends Plant Sci. 2004, 9 (8): 378-384. 10.1016/j.tplants.2004.06.008.
Cooke FT: Phosphatidylinositol 3,5-bisphosphate: metabolism and function. Arch Biochem Biophys. 2002, 407 (2): 143-151. 10.1016/S0003-9861(02)00487-3.
Mueller-Roeber B, Pical C: Inositol phospholipid metabolism in Arabidopsis. Characterized and putative isoforms of inositol phospholipid kinase and phosphoinositide-specific phospholipase C. Plant Physiol. 2002, 130 (1): 22-46. 10.1104/pp.004770.
Otomo A, Hadano S, Okada T, Mizumura H, Kunita R, Nishijima H, Showguchi-Miyata J, Yanagisawa Y, Kohiki E, Suga E, et al: ALS2, a novel guanine nucleotide exchange factor for the small GTPase Rab5, is implicated in endosomal dynamics. Hum Mol Genet. 2003, 12 (14): 1671-1687. 10.1093/hmg/ddg184.
Rosa JL, Casaroli-Marano RP, Buckler AJ, Vilaro S, Barbacid M: p619, a giant protein related to the chromosome condensation regulator RCC1, stimulates guanine nucleotide exchange on ARF1 and Rab proteins. Embo J. 1996, 15 (16): 4262-4273.
Renault L, Nassar N, Vetter I, Becker J, Klebe C, Roth M, Wittinghofer A: The 1.7 A crystal structure of the regulator of chromosome condensation (RCC1) reveals a seven-bladed propeller. Nature. 1998, 392 (6671): 97-101. 10.1038/32204.
Stenmark H, Aasland R: FYVE-finger proteins--effectors of an inositol lipid. J Cell Sci. 1999, 112 (Pt 23): 4175-4183.
Corvera S, D'Arrigo A, Stenmark H: Phosphoinositides in membrane traffic. Curr Opin Cell Biol. 1999, 11 (4): 460-465. 10.1016/S0955-0674(99)80066-0.
Simonsen A, Wurmser AE, Emr SD, Stenmark H: The role of phosphoinositides in membrane transport. Curr Opin Cell Biol. 2001, 13 (4): 485-492. 10.1016/S0955-0674(00)00240-4.
Stenmark H, Gillooly DJ: Intracellular trafficking and turnover of phosphatidylinositol 3-phosphate. Semin Cell Dev Biol. 2001, 12 (2): 193-199. 10.1006/scdb.2000.0236.
Meijer HJG, Berrie CP, Lurisci C, Divecha N, Musgrave A, Munnik T: Identification of a new polyphosphoinositide in plants, phosphatidylinositol 5-phosphate and its accumulation upon osmotic stress. Biochem J. 2001, 360: 491-498. 10.1042/0264-6021:3600491.
Brearley CA, Hanke DE: 3- and 4-phosphorylated phosphatidylinositols in the aquatic plant Spirodela polyrhiza L. Biochem J. 1992, 283 (Pt 1): 255-260.
Munnik T, Irvine RF, Musgrave A: Rapid turnover of phosphatidylinositol 3-phosphate in the green alga Chlamydomonas eugametos: signs of a phosphatidylinositide 3-kinase signalling pathway in lower plants?. Biochem J. 1994, 298 (Pt 2): 269-273.
Munnik T, Musgrave A, De Vrije T: Rapid turnover of polyphosphoinositides in carnation flower petals. Planta. 1994, 193: 89-98. 10.1007/BF00191611.
Hong Z, Verma DP: A phosphatidylinositol 3-kinase is induced during soybean nodule organogenesis and is associated with membrane proliferation. Proc Natl Acad Sci USA. 1994, 91 (20): 9617-9621. 10.1073/pnas.91.20.9617.
Joo JH, Yoo HJ, Hwang I, Lee JS, Nam KH, Bae YS: Auxin-induced reactive oxygen species production requires the activation of phosphatidylinositol 3-kinase. FEBS Lett. 2005, 579 (5): 1243-1248. 10.1016/j.febslet.2005.01.018.
Peleg-Grossman S, Volpin H, Levine A: Root hair curling and Rhizobium infection in Medicago truncatula are mediated by phosphatidylinositide-regulated endocytosis and reactive oxygen species. J Exp Bot. 2007, 58 (7): 1637-1649. 10.1093/jxb/erm013.
Lee Y, Bak G, Choi Y, Chuang WI, Cho HT, Lee Y: Roles of phosphatidylinositol 3-kinase in root hair growth. Plant Physiol. 2008, 147 (2): 624-635. 10.1104/pp.108.117341.
Leshem Y, Seri L, Levine A: Induction of phosphatidylinositol 3-kinase-mediated endocytosis by salt stress leads to intracellular production of reactive oxygen species and salt tolerance. Plant J. 2007, 51 (2): 185-197. 10.1111/j.1365-313X.2007.03134.x.
Jung JY, Kim YW, Kwak JM, Hwang JU, Young J, Schroeder JI, Hwang I, Lee Y: Phosphatidylinositol 3- and 4-phosphate are required for normal stomatal movements. Plant Cell. 2002, 14 (10): 2399-2412. 10.1105/tpc.004143.
Shisheva A: PIKfyve: the road to PtdIns 5-P and PtdIns 3,5-P(2). Cell Biol Int. 2001, 25 (12): 1201-1206. 10.1006/cbir.2001.0803.
Sbrissa D, Ikonomov OC, Shisheva A: Phosphatidylinositol 3-phosphate-interacting domains in PIKfyve. Binding specificity and role in PIKfyve. Endomenbrane localization. J Biol Chem. 2002, 277 (8): 6073-6079. 10.1074/jbc.M110194200.
Shisheva A: PIKfyve: Partners, significance, debates and paradoxes. Cell Biol Int. 2008, 32 (6): 591-604. 10.1016/j.cellbi.2008.01.006.
Dove SK, Cooke FT, Douglas MR, Sayers LG, Parker PJ, Michell RH: Osmotic stress activates phosphatidylinositol-3,5-bisphosphate synthesis. Nature. 1997, 390 (6656): 187-192. 10.1038/36613.
Whitley P, Hinz S, Doughty J: Arabidopsis FAB1/PIKfyve proteins are essential for development of viable pollen. Plant Physiol. 2009, 151 (4): 1812-1822. 10.1104/pp.109.146159.
Zonia L, Munnik T: Osmotically induced cell swelling versus cell shrinking elicits specific changes in phospholipid signals in tobacco pollen tubes. Plant Physiol. 2004, 134 (2): 813-823. 10.1104/pp.103.029454.
Meijer HJG, Divecha N, van den Ende H, Musgrave A, Munnik T: Hyperosmotic stress induces rapid synthesis of phosphatidyl-D-inositol 3,5-bisphosphate in plant cells. Planta. 1999, 208: 294-298. 10.1007/s004250050561.
Ridley SH, Ktistakis N, Davidson K, Anderson KE, Manifava M, Ellson CD, Lipp P, Bootman M, Coadwell J, Nazarian A, et al: FENS-1 and DFCP1 are FYVE domain-containing proteins with distinct functions in the endosomal and Golgi compartments. J Cell Sci. 2001, 114 (Pt 22): 3991-4000.
Hadjebi O, Casas-Terradellas E, Garcia-Gonzalo FR, Rosa JL: The RCC1 superfamily: from genes, to function, to disease. Biochim Biophys Acta. 2008, 1783 (8): 1467-1479. 10.1016/j.bbamcr.2008.03.015.
Chaudhuri I, Soding J, Lupas AN: Evolution of the beta-propeller fold. Proteins. 2008, 71 (2): 795-803. 10.1002/prot.21764.
Hsieh TJ, Farh L, Huang WM, Chan NL: Structure of the topoisomerase IV C-terminal domain: a broken beta-propeller implies a role as geometry facilitator in catalysis. J Biol Chem. 2004, 279 (53): 55587-55593. 10.1074/jbc.M408934200.
Soares DC, Barlow PN, Porteous DJ, Devon RS: An interrupted beta-propeller and protein disorder: structural bioinformatics insights into the N-terminus of alsin. Journal of molecular modeling. 2009, 15 (2): 113-122. 10.1007/s00894-008-0381-1.
Itoh F, Divecha N, Brocks L, Oomen L, Janssen H, Calafat J, Itoh S, Dijke Pt P: The FYVE domain in Smad anchor for receptor activation (SARA) is sufficient for localization of SARA in early endosomes and regulates TGF-beta/Smad signalling. Genes Cells. 2002, 7 (3): 321-331. 10.1046/j.1365-2443.2002.00519.x.
Qin BY, Lam SS, Correia JJ, Lin K: Smad3 allostery links TGF-beta receptor kinase activation to transcriptional control. Genes Dev. 2002, 16 (15): 1950-1963. 10.1101/gad.1002002.
Panopoulou E, Gillooly DJ, Wrana JL, Zerial M, Stenmark H, Murphy C, Fotsis T: Early endosomal regulation of Smad-dependent signaling in endothelial cells. J Biol Chem. 2002, 277 (20): 18046-18052. 10.1074/jbc.M107983200.
Callaghan J, Simonsen A, Gaullier JM, Toh BH, Stenmark H: The endosome fusion regulator early-endosomal autoantigen 1 (EEA1) is a dimer. Biochem J. 1999, 338 (Pt 2): 539-543. 10.1042/0264-6021:3380539.
Blatner NR, Stahelin RV, Diraviyam K, Hawkins PT, Hong W, Murray D, Cho W: The molecular basis of the differential subcellular localization of FYVE domains. J Biol Chem. 2004, 279 (51): 53818-53827. 10.1074/jbc.M408408200.
Kanehisa M: The KEGG database. Novartis Found Symp. 2002, 247: 91-101. full_text. discussion 101-103, 119-128, 244-152.
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucl Acids Res. 2006, 34 (suppl_1): D354-357. 10.1093/nar/gkj102.
Letunic I, Copley RR, Pils B, Pinkert S, Schultz J, Bork P: SMART 5: domains in the context of genomes and networks. Nucleic Acids Res. 2006, D257-260. 10.1093/nar/gkj079. 34 Database
Schultz J, Copley RR, Doerks T, Ponting CP, Bork P: SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids Res. 2000, 28 (1): 231-234. 10.1093/nar/28.1.231.
Schultz J, Milpetz F, Bork P, Ponting CP: SMART, a simple modular architecture research tool: Identification of signaling domains. PNAS. 1998, 95 (11): 5857-5864. 10.1073/pnas.95.11.5857.
Bairoch A, Boeckmann B: The SWISS-PROT protein sequence data bank: current status. Nucleic Acids Res. 1994, 22 (17): 3578-3580. 10.1093/nar/22.17.3626.
Boeckmann B, Blatter MC, Famiglietti L, Hinz U, Lane L, Roechert B, Bairoch A: Protein variety and functional diversity: Swiss-Prot annotation in its biological context. C R Biol. 2005, 328 (10-11): 882-899. 10.1016/j.crvi.2005.06.001.
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL: GenBank. Nucleic Acids Res. 2006, D16-20. 10.1093/nar/gkj157. 34 Database
Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, Federhen S, et al: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2006, D173-180. 10.1093/nar/gkj158. 34 Database
Apweiler R, Bairoch A, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, et al: UniProt: the Universal Protein knowledgebase. Nucleic Acids Res. 2004, D115-119. 10.1093/nar/gkh131. 32 Database
Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, et al: The Universal Protein Resource (UniProt). Nucleic Acids Res. 2005, D154-159. 33 Database
Wu CH, Apweiler R, Bairoch A, Natale DA, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, et al: The Universal Protein Resource (UniProt): an expanding universe of protein information. Nucleic Acids Res. 2006, D187-191. 10.1093/nar/gkj161. 34 Database
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.
Bateman A, Coin L, Durbin R, Finn RD, Hollich V, Griffiths-Jones S, Khanna A, Marshall M, Moxon S, Sonnhammer EL, et al: The Pfam protein families database. Nucleic Acids Res. 2004, D138-141. 10.1093/nar/gkh121. 32 Database
Marchler-Bauer A, Anderson JB, Cherukuri PF, DeWeese-Scott C, Geer LY, Gwadz M, He S, Hurwitz DI, Jackson JD, Ke Z, et al: CDD: a Conserved Domain Database for protein classification. Nucleic Acids Res. 2005, D192-196. 33 Database
Marchler-Bauer A, Anderson JB, Derbyshire MK, DeWeese-Scott C, Gonzales NR, Gwadz M, Hao L, He S, Hurwitz DI, Jackson JD, et al: CDD: a conserved domain database for interactive domain family analysis. Nucleic Acids Res. 2007, D237-240. 10.1093/nar/gkl951. 35 Database
Marchler-Bauer A, Anderson JB, DeWeese-Scott C, Fedorova ND, Geer LY, He S, Hurwitz DI, Jackson JD, Jacobs AR, Lanczycki CJ, et al: CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Res. 2003, 31 (1): 383-387. 10.1093/nar/gkg087.
Marchler-Bauer A, Bryant SH: CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 2004, W327-331. 10.1093/nar/gkh454. 32 Web Server
Marchler-Bauer A, Panchenko AR, Shoemaker BA, Thiessen PA, Geer LY, Bryant SH: CDD: a database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res. 2002, 30 (1): 281-283. 10.1093/nar/30.1.281.
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, et al: The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003, 4: 41-10.1186/1471-2105-4-41.
Tatusov RL, Koonin EV, Lipman DJ: A genomic perspective on protein families. Science. 1997, 278 (5338): 631-637. 10.1126/science.278.5338.631.
Wallner B, Elofsson A: All are not equal: A benchmark of different homology modeling programs. Protein Sci. 2005, 14 (5): 1315-1327. 10.1110/ps.041253405.
Petrey D, Honig B: Protein structure prediction: inroads to biology. Mol Cell. 2005, 20 (6): 811-819. 10.1016/j.molcel.2005.12.005.
Singh SM, Murray D: Molecular modeling of the membrane targeting of phospholipase C pleckstrin homology domains. Protein Sci. 2003, 12 (9): 1934-1953. 10.1110/ps.0358803.
Bates PA, Kelley LA, MacCallum RM, Sternberg MJ: Enhancement of protein modeling by human intervention in applying the automatic programs 3D-JIGSAW and 3D-PSSM. Proteins. 2001, 39-46. 10.1002/prot.1168. Suppl 5
Bates PA, Sternberg MJ: Model building by comparison at CASP3: using expert knowledge and computer automation. Proteins. 1999, 47-54. 10.1002/(SICI)1097-0134(1999)37:3+<47::AID-PROT7>3.0.CO;2-F. Suppl 3
Contreras-Moreira B, Bates PA: Domain fishing: a first step in protein comparative modelling. Bioinformatics. 2002, 18 (8): 1141-1142. 10.1093/bioinformatics/18.8.1141.
Marti-Renom MA, Stuart AC, Fiser A, Sanchez R, Melo F, Sali A: Comparative protein structure modeling of genes and genomes. Annu Rev Biophys Biomol Struct. 2000, 29: 291-325. 10.1146/annurev.biophys.29.1.291.
Sali A, Blundell TL: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993, 234 (3): 779-815. 10.1006/jmbi.1993.1626.
Petrey D, Xiang Z, Tang CL, Xie L, Gimpelev M, Mitros T, Soto CS, Goldsmith-Fischman S, Kernytsky A, Schlessinger A, et al: Using multiple structure alignments, fast model building, and energetic analysis in fold recognition and homology modeling. Proteins. 2003, 53 (Suppl 6): 430-435. 10.1002/prot.10550.
Meller J, Elber R: Linear programming optimization and a double statistical filter for protein threading protocols. Proteins. 2001, 45 (3): 241-261. 10.1002/prot.1145.
Teodorescu O, Galor T, Pillardy J, Elber R: Enriching the sequence substitution matrix by structural information. Proteins. 2004, 54 (1): 41-48. 10.1002/prot.10474.
Tobi D, Elber R: Distance-dependent, pair potential for protein folding: results from linear optimization. Proteins. 2000, 41 (1): 40-46. 10.1002/1097-0134(20001001)41:1<40::AID-PROT70>3.0.CO;2-U.
Tosatto SCE: The Victor/FRST Function for Model Quality Estimation. Journal of Computational Biology. 2005, 12 (10): 1316-1327. 10.1089/cmb.2005.12.1316.
Lund O, Nielsen M, Lundegaard C, Worning P: CPHmodels 2.0: X3 M a Computer Program to Extract 3 D Models. Abstract at the CASP5 conference A102. 2002
Bennett-Lovsey RM, Herbert AD, Sternberg MJ, Kelley LA: Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre. Proteins. 2008, 70 (3): 611-625. 10.1002/prot.21688.
Nicholas K, Nicholas H, Deerfield D: GeneDoc: Analysis and Visualization of Genetic Variation. EMBNEWNEWS. 1997, 14:
Bowie JU, Luthy R, Eisenberg D: A method to identify protein sequences that fold into a known three-dimensional structure. Science. 1991, 253 (5016): 164-170. 10.1126/science.1853201.
Luethy R, Bowie JU, Eisenberg D: Assessment of protein models with three-dimensional profiles. Nature. 1992, 356 (6364): 83-85. 10.1038/356083a0.
Sippl MJ: Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures. J Comput Aided Mol Des. 1993, 7 (4): 473-501. 10.1007/BF02337562.
Xiang Z, Soto CS, Honig B: Evaluating conformational free energies: the colony energy and its application to the problem of loop prediction. Proc Natl Acad Sci USA. 2002, 99 (11): 7432-7437. 10.1073/pnas.102179699.
Canutescu AA, Shelenkov AA, Dunbrack RL: A graph-theory algorithm for rapid protein side-chain prediction. Protein Sci. 2003, 12 (9): 2001-2014. 10.1110/ps.03154503.
Jacobson MP, Friesner RA, Xiang Z, Honig B: On the role of the crystal environment in determining protein side-chain conformations. J Mol Biol. 2002, 320 (3): 597-608. 10.1016/S0022-2836(02)00470-9.
Xiang Z, Honig B: Extending the accuracy limits of prediction for side-chain conformations. J Mol Biol. 2001, 311 (2): 421-430. 10.1006/jmbi.2001.4865.
Xiang Z, Steinbach PJ, Jacobson MP, Friesner RA, Honig B: Prediction of side-chain conformations on protein surfaces. Proteins. 2007, 66 (4): 814-823. 10.1002/prot.21099.
Nicholls A, Sharp KA, Honig B: Protein folding and association: insights from the interfacial and thermodynamic properties of hydrocarbons. Proteins. 1991, 11 (4): 281-296. 10.1002/prot.340110407.
Bashford D, Karplus M: pKa's of ionizable groups in proteins: atomic detail from a continuum electrostatic model. Biochemistry. 1990, 29 (44): 10219-10225. 10.1021/bi00496a010.
Gordon JC, Myers JB, Folta T, Shoja V, Heath LS, Onufriev A: H++: a server for estimating pKas and adding missing hydrogens to macromolecules. Nucleic Acids Res. 2005, W368-371. 10.1093/nar/gki464. 33 Web Server
Myers J, Grothaus G, Narayanan S, Onufriev A: A simple clustering algorithm can be accurate enough for use in calculations of pKs in macromolecules. Proteins. 2006, 63 (4): 928-938. 10.1002/prot.20922.
Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE: UCSF Chimera--a visualization system for exploratory research and analysis. J Comput Chem. 2004, 25 (13): 1605-1612. 10.1002/jcc.20084.
Kuntz ID, Blaney JM, Oatley SJ, Langridge R, Ferrin TE: A geometric approach to macromolecule-ligand interactions. J Mol Biol. 1982, 161 (2): 269-288. 10.1016/0022-2836(82)90153-X.
Richards FM: Areas, volumes, packing and protein structure. Annu Rev Biophys Bioeng. 1977, 6: 151-176. 10.1146/annurev.bb.06.060177.001055.
Ferrin TE, Huang CC, Jarvis LE, Langridge R: The MIDAS display system. J Mol Graph. 1988, 6 (1): 13-27. 10.1016/0263-7855(88)80054-7.
Shoichet BK, Bodian DL, Kuntz ID: Molecular docking using shape descriptors. J Comp Chem. 1992, 13 (3): 380-397. 10.1002/jcc.540130311.
Meng EC, Shoichet BK, Kuntz ID: Automated docking with grid-based energy evaluation. J Comp Chem. 1992, 13: 505-524. 10.1002/jcc.540130412.
Sobolev V, Sorokine A, Prilusky J, Abola EE, Edelman M: Automated analysis of interatomic contacts in proteins. Bioinformatics. 1999, 15 (4): 327-332. 10.1093/bioinformatics/15.4.327.
Wallace AC, Laskowski RA, Thornton JM: LIGPLOT: A program to generate schematic diagrams of protein-ligand interactions. Prot Eng. 1995, 8: 127-134. 10.1093/protein/8.2.127.
The authors wish to thank Diana Murray and Antonina Silkov for access to computational resources and helpful discussions. This work was financially supported by the National Science Foundation (NSF Grant 0618233).
EW carried out the modeling and sequence/structure analyses and drafted the manuscript. SMS conceived of the study, participated in its design and coordination, guided the analyses and refined the manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Wywial, E., Singh, S.M. Identification and structural characterization of FYVE domain-containing proteins of Arabidopsis thaliana. BMC Plant Biol 10, 157 (2010). https://0-doi-org.brum.beds.ac.uk/10.1186/1471-2229-10-157
- Domain Architecture
- FYVE Domain
- Hydrophobic Motif
- Putative Uncharacterized Protein
- Plant Signaling Pathway