Skip to main content

Advertisement

Table 3 Top 10 tokens from pathway names in representative databases

From: PathNER: a tool for systematic identification of biological pathway mentions in the literature

  BioCarta KEGG PID Reactome WikiPathways PO
Rank Token Freq Token Freq Token Freq Token Freq Token Freq Token Freq
#1 pathway 6.09% metabolism 6.01% signalling 3.66% activation 1.68% signalling 5.38% pathway 23.34%
#2 signalling 4.63% pathway 3.63% pathway 2.56% signalling 1.63% pathway 4.55% signalling 9.52%
#3 regulation 1.79% signalling 3.50% activation 1.27% metabolism 1.18% metabolism 3.11% altered 3.02%
#4 cell 1.65% biosynthesis 3.25% events 1.23% synthesis 1.06% regulation 1.38% metabolic 2.88%
#5 role 1.06% cell 1.50% regulation 1.17% regulation 0.95% cell 1.31% mediated 1.69%
#6 receptor 1.06% acid 1.38% mediated 1.03% mediated 0.90% receptor 1.04% biosynthetic 1.39%
#7 activation 0.99% cancer 1.25% receptor 1.02% transport 0.86% activity 0.83% degradation 0.79%
#8 kinase 0.86% infection 1.00% cell 0.73% receptor 0.80% synthesis 0.83% drug 0.78%
#9 gene 0.73% disease 0.88% metabolism 0.64% complex 0.69% cycle 0.83% factor 0.78%
#10 cycle 0.66% degradation 0.75% synthesis 0.63% receptors 0.69% proteins 0.83% acid 0.72%