Skip to main content

Table 3 Top 10 tokens from pathway names in representative databases

From: PathNER: a tool for systematic identification of biological pathway mentions in the literature

 

BioCarta

KEGG

PID

Reactome

WikiPathways

PO

Rank

Token

Freq

Token

Freq

Token

Freq

Token

Freq

Token

Freq

Token

Freq

#1

pathway

6.09%

metabolism

6.01%

signalling

3.66%

activation

1.68%

signalling

5.38%

pathway

23.34%

#2

signalling

4.63%

pathway

3.63%

pathway

2.56%

signalling

1.63%

pathway

4.55%

signalling

9.52%

#3

regulation

1.79%

signalling

3.50%

activation

1.27%

metabolism

1.18%

metabolism

3.11%

altered

3.02%

#4

cell

1.65%

biosynthesis

3.25%

events

1.23%

synthesis

1.06%

regulation

1.38%

metabolic

2.88%

#5

role

1.06%

cell

1.50%

regulation

1.17%

regulation

0.95%

cell

1.31%

mediated

1.69%

#6

receptor

1.06%

acid

1.38%

mediated

1.03%

mediated

0.90%

receptor

1.04%

biosynthetic

1.39%

#7

activation

0.99%

cancer

1.25%

receptor

1.02%

transport

0.86%

activity

0.83%

degradation

0.79%

#8

kinase

0.86%

infection

1.00%

cell

0.73%

receptor

0.80%

synthesis

0.83%

drug

0.78%

#9

gene

0.73%

disease

0.88%

metabolism

0.64%

complex

0.69%

cycle

0.83%

factor

0.78%

#10

cycle

0.66%

degradation

0.75%

synthesis

0.63%

receptors

0.69%

proteins

0.83%

acid

0.72%