Open Access

A new scheme to discover functional associations and regulatory networks of E3 ubiquitin ligases

BMC Systems BiologyBMC series – open, inclusive and trusted201610(Suppl 1):S3

https://doi.org/10.1186/s12918-015-0244-1

Published: 11 January 2016

Abstract

Background

Protein ubiquitination catalyzed by E3 ubiquitin ligases play important modulatory roles in various biological processes. With the emergence of high-throughput mass spectrometry technology, the proteomics research community embraced the development of numerous experimental methods for the determination of ubiquitination sites. The result is an accumulation of ubiquitinome data, coupled with a lack of available resources for investigating the regulatory networks among E3 ligases and ubiquitinated proteins. In this study, by integrating existing ubiquitinome data, experimentally validated E3 ligases and established protein-protein interactions, we have devised a strategy to construct a comprehensive map of protein ubiquitination networks.

Results

In total, 41,392 experimentally verified ubiquitination sites from 12,786 ubiquitinated proteins of humans have been obtained for this study. Additional 494 E3 ligases along with 1220 functional annotations and 28588 protein domains were manually curated. To characterize the regulatory networks among E3 ligases and ubiquitinated proteins, a well-established network viewer was utilized for the exploration of ubiquitination networks from 40892 protein-protein interactions. The effectiveness of the proposed approach was demonstrated in a case study examining E3 ligases involved in the ubiquitination of tumor suppressor p53. In addition to Mdm2, a known regulator of p53, the investigation also revealed other potential E3 ligases that may participate in the ubiquitination of p53.

Conclusion

Aside from the ability to facilitate comprehensive investigations of protein ubiquitination networks, by integrating information regarding protein-protein interactions and substrate specificities, the proposed method could discover potential E3 ligases for ubiquitinated proteins. Our strategy presents an efficient means for the preliminary screen of ubiquitination networks and overcomes the challenge as a result of limited knowledge about E3 ligase-regulated ubiquitination.

Keywords

Ubiquitination Ubiquitin E3 ubiquitin ligase Protein-protein interaction Ubiquitination network

Introduction

Protein ubiquitination involves a series of enzymatic reactions such as E1 activation, E2 conjugation, and E3 ligation, resulting in the conjugation of single or multiple ubiquitin proteins at a target lysine residue [1]. Numerous substrate proteins with ubiquitination sites have been characterized to date, owing to the emergence of high-throughput mass spectrometry-based proteomics approaches [2]–[4]. Identified to play key roles in transcriptional regulation, signal transduction, development, apoptosis, endocytosis, cell proliferation and cancers, ubiquitination of the lysine residue has been regarded as an essential mediator of various biological processes [5]–[7]. Among the enzymes that catalyze protein ubiquitination, E3 ligases are particularly important for the recognition of substrate sites to facilitate ubiquitin-mediated protein degradation [8]. The relationships between E3 ligase and substrates are complex. Multiple substrates could be targeted by a single E3 ligase; alternatively, multiple E3 ligases could catalyze the ubiquitination of a single substrate [9]. These substrate-enzyme correlations could be used to construct E3-specific regulatory networks and map to the associated cellular pathways, making possible the characterization of complex cellular processes and functional analysis of E3-sbustrate relationships [9]. This approach has allowed the discovery of the role that anaphase-promoting complex (APC)/cyclosome plays in modulating key targets of the cell cycle, such as cyclins and their related E3 ligases [10]–[12].

To date, a significant amount of research efforts have been invested towards the characterization of E3 structures and examination of the mechanisms underlying E3-mediated regulatory networks, as well as E3-related diseases [13]–[21]. Based on their catalytic mechanisms in the ubiquitination process, E3 ligases can be classified into three major types: the HECT (homologous to E6-AP C-terminus), the RING (really interesting new gene), and U-box domain types [22]. The HECT-type is responsible for catalyzing the attachment of ubiquitin to substrate proteins. In contrast, the RING-type and U-box-type, similar in both structure and function, facilitate the interaction between an E2 enzyme and the target proteins. Regardless of the types, the significance of E3-mediated ubiquitination is obvious from their association with diseases [23]. Several studies have suggested that the inhibition of E3 ligases may cause growth suppression or cell death, as evidenced by the over-expression of Mdm2/Hdm2, IAPs, and SCF in various human cancers [24]. Therefore, regulation of E3 ligase activities and functions may be a promising approach for cancer treatments.

Many databases and tools have been developed to aid in the study of E3 ligases. For example, E3Miner [25] offers a text mining approach to identify ubiquitin-protein ligases, whereas E3Net [9] allows users to search through a collection of 1671 E3-substrate relationships among 493 E3s and 1277 substrates in 42 organisms. In contrast, by analyzing protein sequence similarities, domains, and distributions across different species, Sakiyama et al. [26] constructed a useful database for the exploration of proteins involved in the ubiquitin signaling cascade. Unfortunately, the present accumulation of large-scale ubiquitinome data demands for the development of tools that investigate the regulatory networks of E3 ligases and their substrates. Here, we present a new strategy that utilizes an interactive network viewer to assist with the discovery of novel protein ubiquitination networks. Furthermore, to effectively investigate the relationships between E3 ligases and their substrates, metabolic pathways and protein-protein interactions (PPIs) were integrated to construct comprehensive protein ubiquitination networks. The ability of the proposed method to identify E3 ligase-mediated ubiquitination networks and their biological significance was demonstrated by case studies. The results indicated that, despite the current limited knowledge about regulatory relationship between E3 ligases and ubiquitinated proteins, our approach could uncover potential E3 ligase-substrate relationships based on based on protein-protein interaction information and substrate site specificities.

Materials and method

Construction of the protein ubiquitination networks involved collection of E3 ligase and ubiquitinated protein data, integration of ubiquitinated proteins’ functional data, computational identification of ubiquitination sites based on substrate motifs, as well as network construction using protein-protein interactions and metabolic pathways (Fig. 1). A network viewer was employed to provide a visualization of the ubiquitination regulatory network, with implemented functional information, for a group of proteins of interest. The detailed workflow is described as follows.
Fig. 1

System flow of protein ubiquitination network construction

Data collection of E3 ubiquitin ligases and ubiquitinated proteins

Experimentally validated E1 activating, E2 conjugating, and E3 ligating enzyme data were obtained from various sources. From UUCD-Version 1.0 [27], seven distinct E1 activating enzymes were collected. From E3Net, UUCD [27], hUbiquitome [28], and UniProtKB [29], 494 non-redundant E3 ubiquitin ligases and their biological functions were extracted. In addition, a total of 46 non-redundant E2 conjugating enzymes were collected from UUCD [27], hUbiquitome [28] and UniProtKB [29]. Experimentally verified ubiquitination sites from dbPTM [30]–[32] were also included. Next, search keywords, such as “ubiquitinated”, “ubiquitination”, “ubiquitylated”, or “ubiquitylation”, were entered on the PubMed database to extract ubiquitinated protein data from research articles. Specifically, full texts of the matched research articles were manually reviewed to ensure that the exact ubiquitinated peptide and modified lysine residue information were extracted. Finally, redundant data were removed, generating a total of 41,392 ubiquitinated lysines from 12,786 ubiquitinated human proteins.

Characterization of protein ubiquitination sites

To characterize the amino acid composition of protein ubiquitination sites, WebLogo [33], [34] was utilized to generate the relative frequency of the corresponding amino acid at each position around the ubiquitination sites as represented by the graphical sequence logo. As well, to further discriminate the amino acid composition of ubiquitinated sites from their non-ubiquitinated counterparts, TwoSampleLogo [35] was adopted to display statistically significant differences in position-specific symbol compositions. The inherent complexity of large-scale ubiquitinome data may make it difficult to uncover conserved motifs. To overcome this problem, MDDLogo [36] was applied to identify potential motifs for the curated protein ubiquitination sites. MDDLogo is a program that uses the maximal dependence decomposition (MDD) approach to discover conserved motifs from groups of aligned signal sequences through a recursive process that divides the data sets into tree-like subgroups. The effectiveness of MDDLogo has been demonstrated in the identification of substrate motifs for phosphorylation [37]–[40], S-nitrosylation [41], O-GlcNAcylation sites [42], S-glutathionylation [43], as well as ubiquitin conjugation sites [2].

Data integration for functional investigation of ubiquitinated proteins

To investigate the biological significance of ubiquitinated proteins, various biological databases, such as Gene Ontology (GO) [44], InterPro [45], as well as KEGG Diseases and Pathways [46], were incorporated. To provide comprehensive functional annotations of proteins associated with ubiquitination, the ubiquitinated proteins were classified according to their molecular functions, biological processes, and cellular components. Since ubiquitination is known to regulate the cellular localization, interactions, and degradation of proteins [47]–[49], the biological roles of ubiquitination sites within a specific protein domain could be inferred from the functional annotation of the domain. For this purpose, essential protein family, domain, and functional site information was obtained from InterPro [45], a database which integrates data from various sources such as the PROSITE [50], PRINTS [51], Pfam [52], and ProDom [53].

Network construction using protein-protein interactions and metabolic pathways

Substantial evidence supports the role that protein ubiquitination plays in the regulation of cellular processes. Thus, by integrating experimentally validated mammalian E3 ubiquitin ligases and their functional information, we hoped to provide a foundation for navigating ubiquitination regulatory networks in mammals. To facilitate the exploration of regulatory relationships between E3 ligases and their ubiquitinated substrates, associated metabolic pathways and protein-protein interactions (PPIs) were included for the comprehensive construction of protein ubiquitination networks. The human metabolic pathways were extracted from KEGG [54]. Experimentally verified PPIs were obtained from over ten PPI databases (Additional file 1: Table S1). Potential PPIs predicted based on co-regulation, co-occurrence in the literature, co-expression, and genomic context were curated from the STRING database [55]; each interaction included a confidence score calculated by the STRING built-in function.

Next, a graph theory [56], [57] approach has been adopted to illustrate the relationships between E3 ligases and substrates. Specifically, we use a directed and cyclic graph G = ( V , E ) to symbolize a protein ubiquitination network, where x , y V and ( x , y ) E. The E3 ligases and substrate proteins were represented by x and y, respectively, and protein ubiquitination was denoted by (x, y) E to indicate the recognition of a specific substrate y by E3 ligase x (Additional file 2: Figure S1). Due to limited knowledge about ubiquitinated substrates that are recognized by E3 ligases, (x, y) could also represent a type of protein-protein interaction between E3 ligase x and ubiquitinated protein y. We used V to refer to all human proteins and E, to all experimentally confirmed PPIs. Cytoscape [58], a publicly available network viewer, was employed for the visualization of regulatory networks among E3 ligases and ubiquitinated substrates.

Results and discussion

Data statistics in this investigation

Data used for building the protein ubiquitination networks in this study were experimentally validated and supported with 39,814 research articles. Over 500 research articles were manually reviewed via a text mining method. In total, 41,392 ubiquitination sites from 12,786 ubiquitinated proteins in humans were extracted from 406 literatures. After removing redundant data among heterogeneous online resources, 494 experimentally verified human E3 ubiquitin ligases remained in the resulting data. PPIs between E3 ligases and ubiquitinated proteins were retrieved to deduce potential regulatory relationships between E3 ligases and ubiquitinated substrates to compensate for the limited information about E3 ligase targets. As shown in Table 1, 9,271 physical PPIs between 426 E3 ligases and 2,649 ubiquitinated proteins were curated. In particular, by incorporating the substrate motifs identified by the MDDLogo ubiquitination site prediction method [36], potential substrates of E3 ligases could be inferred from the 27,227 PPIs between E3 ligases and other proteins. Moreover, 377,117 PPIs that appeared to involve ubiquitinated proteins could be integrated for the investigation of their functional associations in the context of ubiquitination.
Table 1

Data statistics in this work

Data content

Number of records

Ubiquitinated protein (potential E3 substrates)

12,786

Ubiquitination sites

41,392

E3 ubiquitin ligases

494

PPIs between E3 ligases and other proteins

27,227

PPIs between E3 ligases and ubiquitinated proteins

9,271

E3 ligases interacting with ubiquitinated proteins

436

Ubiquitinated proteins interacting with E3 ligases

2,649

Supported articles

39,814

Substrate specificities of human ubiquitination sites

The entropy plots generated by the sequence logo was used to graphically visualize the amino acid sequences flanking the substrate sites (at position 0). This allows for the easy observation of amino acid conservation surrounding the ubiquitination sites. Figure 2a shows Leu (L), Glu (E), and Ala (A) to be the most conserved amino acid residues as indicated by the position-specific amino aicd composition around the ubiquitinated lysines. Furthermore, using TwoSampleLogo, the differences in position-specific amino acid composition between ubiquitinated and non-ubiquitinated sites were revealed (Fig. 2b). The residues surrounding the ubiquitination sites were significantly enriched with Ala (A), Asp (D), Glu (E), Leu (L), Gly (G) and Thr (T), and depleted in Cys (C), His (H), Arg (R), Trp (W) and Met (M) (p < 0.005).
Fig. 2

Amino acid composition of protein ubiquitination sites. a The frequency plot of ubiquitinated sites. b The compositional biases of amino acids around ubiquitination sites compared to the non-ubiquitination sites

To overcome the difficulty of discovering conserved motifs from large-scale ubiquitinome data, the MDDLogo clustering method was adopted to search for substrate motifs from the curated human ubiquitination sites using a 13-mer window length. MDDLogo identified a total of nine subgroups containing conserved motifs from non-homologous human ubiquitination sites (Additional file 3: Table S2). While subgroup 1 (241 ubiquitination sites) contained the conserved amino acid composition at positions +3 and +5, the conserved motif of subgroup 2 included Arginine (R), Lysine (K), Phenylalanine (F), Tyrosine (Y) and Tryptophan (W) residues at position +5. The conserved motifs of Subgroups 3 and 8 comprised Glutamic acid (E), Aspartic acid (D), Glutamine (Q) and Asparagine (N) residues at positions +3 and -2, respectively. In contrast, the remaining subgroups consisted of Phenylalanine (F), Tyrosine (Y) and Tryptophan (W) residues at various specific positions in their conserved motifs. Thus, substrate motifs for ubiquitination sites may be determined by the position-specific conservation of Phenylalanine (F), Tyrosine (Y) and Tryptophan (W) residues. Furthermore, MDDLogo could be utilized to identify putative ubiquitination sites and potential interaction between E3 ligase and ubiquitinated proteins based on substrate motif conservation.

Functional associations of E3 ligases and ubiquitinated proteins

Distributions of GO annotations for E3 ligases and ubiquitinated proteins categorized according to their corresponding biological processes, molecular functions and cellular components are provided in (Additional file 4: Table S3 and Additional file 5: Table S4), respectively. Following the InterPro annotation, the most abundant protein domain for E3 ligases appeared to belong to the “Zinc finger, RING-type RNA” (Table 2). In a genome-wide study of E3 ligases, it was suggested that the mammalian genomes encode more than 600 potential RING finger E3s [59]. E3 ligases containing the RING finger domain facilitate the interaction between an E2 enzyme and a substrate to mediate the transfer of ubiquitin from E2 to the target [60], [61]. On the other hand, those with the HECT domain are involved in the regulation of cellular trafficking, immune response, cellular growth and proliferation [62]. The HECT domain containing E3 ligases form a catalytic intermediate with ubiquitin and is responsible for the catalysis of two reactions: 1) transesterification reaction, in which ubiquitin from the cysteine residue at the E2 active site is transferred to another cysteine residue in the HECT domain [60]; 2) the subsequent attack of a substrate lysine on the thioester of the ubiquitin-bound HECT domain [63] (Additional file 6: Figure S2). Whereas the C-terminus of the HECT domain is more conserved, the N-terminus, the part that mediates substrate targeting, is more diverse [62].
Table 2

The distribution of top 20 functional domains for human E3 ligases

No.

InterPro ID

Domain terms

Number of proteins

% Total

1

IPR001680

WD40 repeat

287

57.5150 %

2

IPR006652

Kelch repeat type 1

68

13.6273 %

3

IPR000408

Regulator of chromosome condensation, RCC1

55

11.0220 %

4

IPR001841

Zinc finger, RING-type

54

10.8216 %

5

IPR000315

Zinc finger, B-box

48

9.6192 %

6

IPR003877

SPla/RYanodine receptor SPRY

38

7.6152 %

7

IPR013069

BTB/POZ

28

5.6112 %

8

IPR000569

HECT

28

5.6112 %

9

IPR001202

WW/Rsp5/WWP

28

5.6112 %

10

IPR020683

Ankyrin repeat-containing domain

27

5.4108 %

11

IPR001496

SOCS protein, C-terminal

25

5.0100 %

12

IPR002867

Zinc finger, C6HC-type

23

4.6092 %

13

IPR018957

Zinc finger, C3HC4 RING-type

22

4.4088 %

14

IPR001258

NHL repeat

21

4.2084 %

15

IPR011705

BTB/Kelch-associated

15

3.0060 %

16

IPR002110

Ankyrin repeat

14

2.8056 %

17

IPR000571

Zinc finger, CCCH-type

11

2.2044 %

18

IPR001876

Zinc finger, RanBP2-type

11

2.2044 %

19

IPR011016

Zinc finger, RING-CH-type

11

2.2044 %

20

IPR001452

Src homology-3 domain

10

2.0040 %

According to the annotation information on InterPro, approximately 70 % of established ubiquitination sites are mapped to specific functional domains, suggesting that ubiquitination may modulate a variety of biological functions. The top 50 InterPro functional domains containing ubiquitinated sites in humans are given in Table 3. It appeared that most ubiquitination sites could be found in the MHC class I (alpha chain) protein domains. It has been reported that viral proteins could induce the degradation of the histocompatibility complex (MHC) class I protein in the endoplasmic reticulum and at the cell surface by ubiquitinating the MHC class I domain [64]. The immunoglobulin C1-set domain, or classical Ig-like domains that resemble the antibody constant domain, is another domain found to be enriched with ubiquitinated sites. Interestingly, these domains were found exclusively in mediators of immune response, including various T-cell receptors, MHC class I and II complexes [65].
Table 3

Distribution of the top 50 functional domains covering ubiquitination sites

No.

Domain (InterPro) ID

Domain (InterPro) terms

Number of sites

% Total

1

IPR001039

MHC class I, alpha chain, alpha1/alpha2

1625

3.6574 %

2

IPR003597

Immunoglobulin C1-set

1321

2.9732 %

3

IPR001680

WD40 repeat

1043

2.3475 %

4

IPR002017

Spectrin repeat

1012

2.2777 %

5

IPR010579

MHC class I, alpha chain, C-terminal

729

1.6408 %

6

IPR003961

Fibronectin, type III

469

1.0556 %

7

IPR000504

RNA recognition motif domain

419

0.9431 %

8

IPR000719

Protein kinase, catalytic domain

338

0.7607 %

9

IPR013098

Immunoglobulin I-set

232

0.5222 %

10

IPR020683

Ankyrin repeat-containing domain

230

0.5177 %

11

IPR017868

Filamin/ABP280 repeat-like

212

0.4772 %

12

IPR006209

EGF-like domain

179

0.4029 %

13

IPR001715

Calponin homology domain

177

0.3984 %

14

IPR010630

Neuroblastoma breakpoint family (NBPF) domain

168

0.3781 %

15

IPR001452

Src homology-3 domain

167

0.3759 %

16

IPR006652

Kelch repeat type 1

158

0.3556 %

17

IPR000408

Regulator of chromosome condensation, RCC1

157

0.3534 %

18

IPR000225

Armadillo

156

0.3511 %

19

IPR004088

K Homology domain, type 1

154

0.3466 %

20

IPR001650

Helicase, C-terminal

146

0.3286 %

21

IPR002126

Cadherin

145

0.3264 %

22

IPR000048

IQ motif, EF-hand binding site

143

0.3219 %

23

IPR000083

Fibronectin, type I

143

0.3219 %

24

IPR000626

Ubiquitin

142

0.3196 %

25

IPR000008

C2 calcium-dependent membrane targeting

135

0.3038 %

26

IPR001806

Small GTPase superfamily

125

0.2813 %

27

IPR001245

Serine-threonine/tyrosine-protein kinase catalytic domain

125

0.2813 %

28

IPR000433

Zinc finger, ZZ-type

123

0.2768 %

29

IPR001478

PDZ domain

122

0.2746 %

30

IPR001849

Pleckstrin homology domain

120

0.2701 %

31

IPR011545

DNA/RNA helicase, DEAD/DEAH box type, N-terminal

112

0.2521 %

32

IPR019787

Zinc finger, PHD-finger

111

0.2498 %

33

IPR001440

Tetratricopeptide TPR-1

111

0.2498 %

34

IPR001202

WW/Rsp5/WWP

110

0.2476 %

35

IPR018108

Mitochondrial substrate/solute carrier

110

0.2476 %

36

IPR001781

Zinc finger, LIM-type

106

0.2386 %

37

IPR001101

Plectin repeat

104

0.2341 %

38

IPR002049

EGF-like, laminin

100

0.2251 %

39

IPR002429

Cytochrome c oxidase subunit II C-terminal

96

0.2161 %

40

IPR011759

Cytochrome C oxidase subunit II, transmembrane domain

96

0.2161 %

41

IPR003439

ABC transporter-like

91

0.2048 %

42

IPR001487

Bromodomain

91

0.2048 %

43

IPR018502

Annexin repeat

88

0.1981 %

44

IPR000980

SH2 domain

86

0.1936 %

45

IPR003008

Tubulin/FtsZ, GTPase domain

77

0.1733 %

46

IPR001421

ATPase, F0 complex, subunit 8, mitochondrial, Metazoan

75

0.1688 %

47

IPR013069

BTB/POZ

75

0.1688 %

48

IPR003959

ATPase, AAA-type, core

74

0.1666 %

49

IPR007087

Zinc finger, C2H2

74

0.1666 %

50

IPR000197

Zinc finger, TAZ-type

74

0.1666 %

Network analysis for a group of interested proteins

To allow users to efficiently search for the proteins of their interest, a convenient interactive network viewer was implemented in the proposed method implemented an interactive network viewer. An example of constructing a protein ubiquitination network using our approach is illustrated in (Additional file 7: Figure S3). The network was built with four E3 ligases, 14 ubiquitinated proteins and three other proteins. While the established interactions between the four E3 ligases and 14 ubiquitinated proteins were immediately recognized, three other proteins interacting with two of the E3 ligases were also revealed as potential ubiquitinated substrates. For instance, E3 ligase MDM2 was predicted to target Forkhead box protein O3 (FOXO3) for ubiquitination. This is consistent with a recent study supporting MDM2 to be an E3 ligase responsible for the ubiquitin-mediated degradation of FOXO3 [66]. As well, our approach could provide the potential ubiquitination sites and the corresponding substrate motifs for a specific protein. Furthermore, for a specific E3 ligase and their interacting ubiquitinated proteins, the analysis could even be extended to exploring their functional associations and creating a comprehensive ubiquitin regulatory network.

A case study of the discovered E3 ligases associated with the regulation of p53

In cases where information is limited with respect to the interaction between an E3 ligase and its corresponding substrates, our strategy could still identify the potential E3 ligases that may target a specific ubiquitinated protein. A case study is shown in Fig. 3, demonstrating the ability of the proposed method to construct an interaction map for the ubiquitination of tumor suppressor p53 (TP53). The resulting network is consistent with the literature. As a transcription factor, the tumor suppressor protein p53 responds to stress such as DNA damage by inducing cell cycle arrest and apoptosis [67]. Recent evidence has established that MDM2, a RING oncoprotein and a negative regulator of p53 [24], modulates the proteasomal degradation of p53 via a RING-finger-dependent manner [68]–[72]. Yet, our approach discovered other E3 ligases that may also regulate the ubiquitination of p53. Thus, the proposed strategy has the ability to uncover potential substrates for a specific E3 ligase, as well as potential E3 ligases for ubiquitinated proteins.
Fig. 3

A case study of the discovered E3 ligases associated with the regulation of tumor suppressor p53 (TP53)

Conclusion

In an attempt to characterize the regulatory role protein ubiquitination plays in a variety of biological processes, we combined the information of E3 ligases, ubiquitinated proteins, and protein-protein interactions to construct a comprehensive network of E3 ligases and their ubiquitinated substrates. Designed to serve as not only a meaningful platform for investigating E3-substrate regulatory networks but also a new strategy to uncover potential E3 ligases for ubiquitinated substrates, the proposed approach allows for the efficient characterization of protein ubiquitination networks from large-scale ubiquitinome data. With access to more updated data, the proposed scheme can be further refined for the study of E1 activating enzymes, E2 conjugating enzymes, and E3 ubiquitin ligases. Also, recent publications regarding the structural environment of experimentally validated ubiquitination sites based on protein tertiary structures [73]–[76] could be incorporated to infer the functional interactions between the enzymes and substrates. Finally, confirmed functional annotations of ubiquitination sites could be extracted from the literature via a more advanced information retrieval system to collect more adequate information required for further functional analyses.

Additional files

Declarations

Acknowledgements

The authors sincerely appreciate the Ministry of Science and Technology of Taiwan for financially supporting this research under Contract Number of MOST 103-2221-E-155-020-MY3, MOST 103-2633-E-155-002 and MOST104-2221-E-155-036-MY2.

Declarations

Publication charge for this work was funded by MOST grant under contract number of MOST 103-2221-E-155-020-MY3 and MOST 104-2221-E-155-036-MY2 to TYL.

Authors’ Affiliations

(1)
Department of Computer Science and Engineering, Yuan Ze University
(2)
Innovation Center for Big Data and Digital Convergence, Yuan Ze University
(3)
Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital
(4)
Mackay Junior College of Medicine, Nursing and Management
(5)
Department of Medicine, Mackay Medical College

References

  1. Hershko A, Ciechanover A: The ubiquitin system. Annu Rev Biochem. 1998, 67: 425-79. 10.1146/annurev.biochem.67.1.425.View ArticlePubMedGoogle Scholar
  2. Nguyen VN, Huang KY, Huang CH, Chang TH, Bretana N, Lai K, et al: Characterization and identification of ubiquitin conjugation sites with E3 ligase recognition specificities. BMC bioinformatics. 2015, 16 (Suppl 1): S1-10.1186/1471-2105-16-S1-S1.View ArticlePubMedPubMed CentralGoogle Scholar
  3. Wagner SA, Beli P, Weinert BT, Nielsen ML, Cox J, Mann M, et al: A proteome-wide, quantitative survey of in vivo ubiquitylation sites reveals widespread regulatory roles. Molecular & cellular proteomics: MCP. 2011, 10 (10): M111 013284-10.1074/mcp.M111.013284.View ArticlePubMed CentralGoogle Scholar
  4. Lee TY, Chen SA, Hung HY, Ou YY: Incorporating distant sequence features and radial basis function networks to identify ubiquitin conjugation sites. Plos One. 2011, 6 (3): e17331-10.1371/journal.pone.0017331.View ArticlePubMedPubMed CentralGoogle Scholar
  5. Hurley JH, Lee S, Prag G: Ubiquitin-binding domains. Biochem J. 2006, 399: 361-72. 10.1042/BJ20061138.View ArticlePubMedPubMed CentralGoogle Scholar
  6. Hicke L, Schubert HL, Hill CP: Ubiquitin-binding domains. Nat Rev Mol Cell Bio. 2005, 6 (8): 610-21. 10.1038/nrm1701.View ArticleGoogle Scholar
  7. Peng JM, Schwartz D, Elias JE, Thoreen CC, Cheng DM, Marsischky G, et al: A proteomics approach to understanding protein ubiquitination. Nat Biotechnol. 2003, 21 (8): 921-6. 10.1038/nbt849.View ArticlePubMedGoogle Scholar
  8. Wilkinson KD: The discovery of ubiquitin-dependent proteolysis. Proc Natl Acad Sci U S A. 2005, 102 (43): 15280-2. 10.1073/pnas.0504842102.View ArticlePubMedPubMed CentralGoogle Scholar
  9. Han Y, Lee H, Park JC, Yi GS: E3Net: a system for exploring E3-mediated regulatory networks of cellular functions. Mol Cell Proteomics. 2012, 11 (4): O111.014076-10.1074/mcp.O111.014076.View ArticlePubMedGoogle Scholar
  10. Zhang J, Wan L, Dai X, Sun Y, Wei W: Functional characterization of anaphase promoting complex/cyclosome (APC/C) E3 ubiquitin ligases in tumorigenesis. Biochim Biophys Acta. 2014, 1845 (2): 277-93.PubMedPubMed CentralGoogle Scholar
  11. Manchado E, Eguren M, Malumbres M: The anaphase-promoting complex/cyclosome (APC/C): cell-cycle-dependent and -independent functions. Biochem Soc Trans. 2010, 38 (Pt 1): 65-71. 10.1042/BST0380065.View ArticlePubMedGoogle Scholar
  12. Acquaviva C, Pines J: The anaphase-promoting complex/cyclosome: APC/C. J Cell Sci. 2006, 119 (Pt 12): 2401-4. 10.1242/jcs.02937.View ArticlePubMedGoogle Scholar
  13. Spratt DE, Walden H, Shaw GS: RBR E3 ubiquitin ligases: new structures, new insights, new questions. Biochem J. 2014, 458 (3): 421-37. 10.1042/BJ20140006.View ArticlePubMedPubMed CentralGoogle Scholar
  14. Paul I, Ghosh MK. The E3 ligase CHIP: insights into its structure and regulation. Biomed Res Int. 2014.Google Scholar
  15. Duplan V, Rivas S. E3 ubiquitin-ligases and their target proteins during the regulation of plant innate immunity. Front Plant Sci. 2014;5.Google Scholar
  16. Snoek BC, de Wilt LH, Jansen G, Peters GJ: Role of E3 ubiquitin ligases in lung cancer. World journal of clinical oncology. 2013, 4 (3): 58-69. 10.5306/wjco.v4.i3.58.View ArticlePubMedPubMed CentralGoogle Scholar
  17. Plechanovova A, Jaffray EG, Tatham MH, Naismith JH, Hay RT: Structure of a RING E3 ligase and ubiquitin-loaded E2 primed for catalysis. Nature. 2012, 489 (7414): 115-20. 10.1038/nature11376.View ArticlePubMedPubMed CentralGoogle Scholar
  18. Metzger MB, Hristova VA, Weissman AM: HECT and RING finger families of E3 ubiquitin ligases at a glance. J Cell Sci. 2012, 125 (3): 531-7. 10.1242/jcs.091777.View ArticlePubMedPubMed CentralGoogle Scholar
  19. Bernassola F, Karin M, Ciechanover A, Melino G: The HECT family of E3 ubiquitin ligases: Multiple players in cancer development. Cancer Cell. 2008, 14 (1): 10-21. 10.1016/j.ccr.2008.06.001.View ArticlePubMedGoogle Scholar
  20. Scheffner M, Staub O. HECT E3s and human disease. BMC Biochem. 2007;8.Google Scholar
  21. Mazzucotelli E, Belloni S, Marone D, De Leonardis AM, Guerra D, Fonzo N, et al: The E3 ubiquitin ligase gene family in plants: regulation by degradation. Curr Genomics. 2006, 7 (8): 509-22. 10.2174/138920206779315728.View ArticlePubMedPubMed CentralGoogle Scholar
  22. Robinson PA, Ardley HC: Ubiquitin-protein ligases. J Cell Sci. 2004, 117 (22): 5191-4. 10.1242/jcs.01539.View ArticlePubMedGoogle Scholar
  23. Sun Y: Targeting E3 ubiquitin ligases for cancer therapy. Cancer Biol Ther. 2003, 2 (6): 623-9. 10.4161/cbt.2.6.677.View ArticlePubMedGoogle Scholar
  24. Manfredi JJ: The Mdm2-p53 relationship evolves: Mdm2 swings both ways as an oncogene and a tumor suppressor. Genes Dev. 2010, 24 (15): 1580-9. 10.1101/gad.1941710.View ArticlePubMedPubMed CentralGoogle Scholar
  25. Lee H, Yi GS, Park JC: E3Miner: a text mining tool for ubiquitin-protein ligases. Nucleic Acids Res. 2008, 36 (Web Server issue): W416-22. 10.1093/nar/gkn286.View ArticlePubMedPubMed CentralGoogle Scholar
  26. Sakiyama T, Kawashima S, Yoshizawa AC, Kanehisa M: The construction of a database for ubiquitin signaling cascade. Genome Informatics. 2003, 14: 653-4.Google Scholar
  27. Gao T, Liu Z, Wang Y, Cheng H, Yang Q, Guo A, et al: UUCD: a family-based database of ubiquitin and ubiquitin-like conjugation. Nucleic Acids Res. 2013, 41 (Database issue): D445-51. 10.1093/nar/gks1103.View ArticlePubMedGoogle Scholar
  28. Du YP, Xu NF, Lu M, Li TT. hUbiquitome: a database of experimentally verified ubiquitination cascades in humans. Database-Oxford. 2011.Google Scholar
  29. Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger E, et al: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003, 31 (1): 365-70. 10.1093/nar/gkg095.View ArticlePubMedPubMed CentralGoogle Scholar
  30. Lee TY, Huang HD, Hung JH, Huang HY, Yang YS, Wang TH: dbPTM: an information repository of protein post-translational modification. Nucleic Acids Res. 2006, 34 (Database issue): D622-7. 10.1093/nar/gkj083.View ArticlePubMedGoogle Scholar
  31. Lu CT, Huang KY, Su MG, Lee TY, Bretana NA, Chang WC, et al: dbPTM 3.0: an informative resource for investigating substrate site specificity and functional association of protein post-translational modifications. Nucleic Acids Res. 2013, 41 (D1): D295-305. 10.1093/nar/gks1229.View ArticlePubMedGoogle Scholar
  32. Su MG, Huang KY, Lu CT, Kao HJ, Chang YH, Lee TY: topPTM: a new module of dbPTM for identifying functional post-translational modifications in transmembrane proteins. Nucleic Acids Res. 2014, 42 (Database issue): D537-45. 10.1093/nar/gkt1221.View ArticlePubMedGoogle Scholar
  33. Crooks GE, Hon G, Chandonia JM, Brenner SE: WebLogo: a sequence logo generator. Genome Res. 2004, 14 (6): 1188-90. 10.1101/gr.849004.View ArticlePubMedPubMed CentralGoogle Scholar
  34. Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990, 18 (20): 6097-100. 10.1093/nar/18.20.6097.View ArticlePubMedPubMed CentralGoogle Scholar
  35. Vacic V, Iakoucheva LM, Radivojac P: Two sample logo: a graphical representation of the differences between two sets of sequence alignments. Bioinformatics. 2006, 22 (12): 1536-7. 10.1093/bioinformatics/btl151.View ArticlePubMedGoogle Scholar
  36. Lee TY, Lin ZQ, Hsieh SJ, Bretana NA, Lu CT: Exploiting maximal dependence decomposition to identify conserved motifs from a group of aligned signal sequences. Bioinformatics. 2011, 27 (13): 1780-7. 10.1093/bioinformatics/btr291.View ArticlePubMedGoogle Scholar
  37. Wong YH, Lee TY, Liang HK, Huang CM, Wang TY, Yang YH, et al: KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns. Nucleic Acids Res. 2007, 35 (Web Server issue): W588-94. 10.1093/nar/gkm322.View ArticlePubMedPubMed CentralGoogle Scholar
  38. Huang HD, Lee TY, Tzeng SW, Horng JT: KinasePhos: a web tool for identifying protein kinase-specific phosphorylation sites. Nucleic Acids Res. 2005, 33 (Web Server issue): W226-9. 10.1093/nar/gki471.View ArticlePubMedPubMed CentralGoogle Scholar
  39. Lee TY, Bretana NA, Lu CT: PlantPhos: using maximal dependence decomposition to identify plant phosphorylation sites with substrate site specificity. BMC Bioinformatics. 2011, 12: 261-10.1186/1471-2105-12-261.View ArticlePubMedPubMed CentralGoogle Scholar
  40. Bretana NA, Lu CT, Chiang CY, Su MG, Huang KY, Lee TY, et al: Identifying protein phosphorylation sites with kinase substrate specificity on human viruses. PLoS One. 2012, 7 (7): 10.1371/journal.pone.0040694. Article ID e40694Google Scholar
  41. Lee TY, Chen YJ, Lu TC, Huang HD: SNOSite: exploiting maximal dependence decomposition to identify cysteine S-nitrosylation with substrate site specificity. PLoS One. 2011, 6 (7): 10.1371/journal.pone.0021849. Article ID e21849Google Scholar
  42. Wu HY, Lu CT, Kao HJ, Chen YJ, Chen YJ, Lee TY: Characterization and identification of protein O-GlcNAcylation sites with substrate specificity. BMC bioinformatics. 2014, 15 (Suppl 16): S1-10.1186/1471-2105-15-S16-S1.View ArticleGoogle Scholar
  43. Chen YJ, Lu CT, Huang KY, Wu HY, Chen YJ, Lee TY: GSHSite: exploiting an iteratively statistical method to identify s-glutathionylation sites with substrate specificity. PLoS One. 2015, 10 (4): 10.1371/journal.pone.0118752. Article ID e0118752Google Scholar
  44. Gene Ontology C, Blake JA, Dolan M, Drabkin H, Hill DP, Li N, et al: Gene ontology annotations and resources. Nucleic Acids Res. 2013, 41 (Database issue): D530-5. 10.1093/nar/gks1050.View ArticleGoogle Scholar
  45. Hunter S, Jones P, Mitchell A, Apweiler R, Attwood TK, Bateman A, et al: InterPro in 2011: new developments in the family and domain prediction database. Nucleic Acids Res. 2011, 40 (Database issue): D306-12.PubMedPubMed CentralGoogle Scholar
  46. Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M: KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 2012, 40 (Database issue): D109-14. 10.1093/nar/gkr988.View ArticlePubMedGoogle Scholar
  47. Mukhopadhyay D, Riezman H: Proteasome-independent functions of ubiquitin in endocytosis and signaling. Science. 2007, 315 (5809): 201-5. 10.1126/science.1127085.View ArticlePubMedGoogle Scholar
  48. Schnell JD, Hicke L: Non-traditional functions of ubiquitin and ubiquitin-binding proteins. The Journal of biological chemistry. 2003, 278 (38): 35857-60. 10.1074/jbc.R300018200.View ArticlePubMedGoogle Scholar
  49. Glickman MH, Ciechanover A: The ubiquitin-proteasome proteolytic pathway: destruction for the sake of construction. Physiol Rev. 2002, 82 (2): 373-428. 10.1152/physrev.00027.2001.View ArticlePubMedGoogle Scholar
  50. Bairoch A: PROSITE: a dictionary of sites and patterns in proteins. Nucleic Acids Res. 1991, 19 (Suppl): 2241-5. 10.1093/nar/19.suppl.2241.View ArticlePubMedPubMed CentralGoogle Scholar
  51. Attwood TK, Beck ME, Bleasby AJ, Parry-Smith DJ: PRINTS--a database of protein motif fingerprints. Nucleic Acids Res. 1994, 22 (17): 3590-6.PubMedPubMed CentralGoogle Scholar
  52. Sonnhammer EL, Eddy SR, Durbin R: Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins. 1997, 28 (3): 405-20. 10.1002/(SICI)1097-0134(199707)28:3<405::AID-PROT10>3.0.CO;2-L.View ArticlePubMedGoogle Scholar
  53. Corpet F, Gouzy J, Kahn D: The ProDom database of protein domain families. Nucleic Acids Res. 1998, 26 (1): 323-6. 10.1093/nar/26.1.323.View ArticlePubMedPubMed CentralGoogle Scholar
  54. Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M: KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 1999, 27 (1): 29-34. 10.1093/nar/27.1.29.View ArticlePubMedPubMed CentralGoogle Scholar
  55. von Mering C, Huynen M, Jaeggi D, Schmidt S, Bork P, Snel B: STRING: a database of predicted functional associations between proteins. Nucleic Acids Res. 2003, 31 (1): 258-61. 10.1093/nar/gkg034.View ArticlePubMedPubMed CentralGoogle Scholar
  56. Lee TY, Bo-Kai Hsu J, Chang WC, Huang HD: RegPhos: a system to explore the protein kinase-substrate phosphorylation network in humans. Nucleic Acids Res. 2011, 39 (Database issue): D777-87. 10.1093/nar/gkq970.View ArticlePubMedGoogle Scholar
  57. Huang KY, Wu HY, Chen YJ, Lu CT, Su MG, Hsieh YC, et al: RegPhos 2.0: an updated resource to explore protein kinase-substrate phosphorylation networks in mammals. Database : the journal of biological databases and curation. 2014, 2014 (0): bau034-10.1093/database/bau034.View ArticlePubMedGoogle Scholar
  58. Kohl M, Wiese S, Warscheid B: Cytoscape: software for visualization and analysis of biological networks. Methods Mol Biol. 2010, 696: 291-303. 10.1007/978-1-60761-987-1_18.View ArticleGoogle Scholar
  59. Li W, Bengtson MH, Ulbrich A, Matsuda A, Reddy VA, Orth A, et al: Genome-wide and functional annotation of human E3 ubiquitin ligases identifies MULAN, a mitochondrial E3 that regulates the organelle’s dynamics and signaling. Plos One. 2008, 3 (1): 10.1371/journal.pone.0001487. Article ID e1487Google Scholar
  60. Berndsen CE, Wolberger C: New insights into ubiquitin E3 ligase mechanism. Nat Struct Mol Biol. 2014, 21 (4): 301-7. 10.1038/nsmb.2780.View ArticlePubMedGoogle Scholar
  61. Zheng N, Wang P, Jeffrey PD, Pavletich NP: Structure of a c-Cbl-UbcH7 complex: RING domain function in ubiquitin-protein ligases. Cell. 2000, 102 (4): 533-9. 10.1016/S0092-8674(00)00057-X.View ArticlePubMedGoogle Scholar
  62. Rotin D, Kumar S: Physiological functions of the HECT family of ubiquitin ligases. Nat Rev Mol Cell Biol. 2009, 10 (6): 398-409. 10.1038/nrm2690.View ArticlePubMedGoogle Scholar
  63. Huibregtse JM, Scheffner M, Beaudenon S, Howley PM: A family of proteins structurally and functionally related to the E6-AP ubiquitin-protein ligase. Proc Natl Acad Sci U S A. 1995, 92 (11): 5249-10.1073/pnas.92.11.5249-a.View ArticlePubMedPubMed CentralGoogle Scholar
  64. Burr ML, Boname JM, Lehner PJ: Studying ubiquitination of MHC class I molecules. Methods Mol Biol. 2013, 960: 109-25. 10.1007/978-1-62703-218-6_9.View ArticlePubMedGoogle Scholar
  65. Cresswell P, Ackerman AL, Giodini A, Peaper DR, Wearsch PA: Mechanisms of MHC class I-restricted antigen processing and cross-presentation. Immunol Rev. 2005, 207: 145-57. 10.1111/j.0105-2896.2005.00316.x.View ArticlePubMedGoogle Scholar
  66. Chou CC, Lee KH, Lai IL, Wang D, Mo X, Kulp SK, et al: AMPK reverses the mesenchymal phenotype of cancer cells by targeting the Akt-MDM2-Foxo3a signaling axis. Cancer Res. 2014, 74 (17): 4783-95. 10.1158/0008-5472.CAN-14-0135.View ArticlePubMedPubMed CentralGoogle Scholar
  67. Lee JT, Gu W: The multiple levels of regulation by p53 ubiquitination. Cell Death Differ. 2010, 17 (1): 86-92. 10.1038/cdd.2009.77.View ArticlePubMedPubMed CentralGoogle Scholar
  68. Haupt Y, Maya R, Kazaz A, Oren M: Mdm2 promotes the rapid degradation of p53. Nature. 1997, 387 (6630): 296-9. 10.1038/387296a0.View ArticlePubMedGoogle Scholar
  69. Kubbutat MHG, Jones SN, Vousden KH: Regulation of p53 stability by Mdm2. Nature. 1997, 387 (6630): 299-303. 10.1038/387299a0.View ArticlePubMedGoogle Scholar
  70. Honda R, Tanaka H, Yasuda H: Oncoprotein MDM2 is a ubiquitin ligase E3 for tumor suppressor p53. Febs Lett. 1997, 420 (1): 25-7. 10.1016/S0014-5793(97)01480-4.View ArticlePubMedGoogle Scholar
  71. Fang SY, Jensen JP, Ludwig RL, Vousden KH, Weissman AM: Mdm2 is a RING finger-dependent ubiquitin protein ligase for itself and p53. J Biol Chem. 2000, 275 (12): 8945-51. 10.1074/jbc.275.12.8945.View ArticlePubMedGoogle Scholar
  72. Honda R, Yasuda H: Activity of MDM2, a ubiquitin Ligase, toward p53 or itself is dependent on the RING finger domain of the ligase. Oncogene. 2000, 19 (11): 1473-6. 10.1038/sj.onc.1203464.View ArticlePubMedGoogle Scholar
  73. Su MG, Lee TY: Incorporating substrate sequence motifs and spatial amino acid composition to identify kinase-specific phosphorylation sites on protein three-dimensional structures. BMC bioinformatics. 2013, 14 (Suppl 16): S2-10.1186/1471-2105-14-S16-S2.View ArticlePubMedPubMed CentralGoogle Scholar
  74. Lee TY, Chen YJ, Lu CT, Ching WC, Teng YC, Huang HD: dbSNO: a database of cysteine S-nitrosylation. Bioinformatics. 2012, 28 (17): 2293-5. 10.1093/bioinformatics/bts436.View ArticlePubMedGoogle Scholar
  75. Chen YJ, Lu CT, Su MG, Huang KY, Ching WC, Yang HH, et al: dbSNO 2.0: a resource for exploring structural environment, functional and disease association and regulatory network of protein S-nitrosylation. Nucleic Acids Res. 2015, 43 (Database issue): D503-11. 10.1093/nar/gku1176.View ArticlePubMedGoogle Scholar
  76. Chen YJ, Lu CT, Lee TY, Chen YJ: dbGSH: a database of S-glutathionylation. Bioinformatics. 2014, 30 (16): 2386-8. 10.1093/bioinformatics/btu301.View ArticlePubMedGoogle Scholar

Copyright

© Huang et al. 2015

This article is published under license to BioMed Central Ltd. Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.