Skip to main content

Table 1 A summary of protein interaction and GO annotation data used in the training and testing of the hub classifiers.

From: The use of Gene Ontology terms for predicting highly-connected 'hub' nodes in protein-protein interaction networks

Training/Testing set

E. coli

S. cerevisiae

D. melanogaster

H. sapiens

total of 4 species

# of proteins

2860

5397

6935

6592

21784

# of hubs (10% of total proteins)

286

535

628

620

2069

# of non-hubs (90% of total proteins)

2574

4862

6307

5972

19715

# of protein interactions

13888

37167

19994

19115

90164

minimum # of interactions per hub

20

33

16

13

 

# of proteins with at least one GO term

1378

4738

5931

5097

17144

# of proteins without any GO term

1482

659

1004

1495

4640

% of proteins with at least one GO term

48.18%

87.79%

85.52%

77.32%

78.70%

# of different GO terms – process

30

41

48

49

50

# of different GO terms – function

21

37

38

37

40

# of different GO terms – component

4

27

31

29

35

# of different GO terms – total

55

105

117

115

125

  1. The top table lists the protein interactions and hubs in each of the four species, and the bottom part of the table lists the number of unique GO terms for each annotation category.