Skip to main content

Table 3 Statistics of the ten 7-mers that were identified to be important for high-affinity 12-mers through Round 1.

From: Modeling DNA affinity landscape through two-round support vector regression with weighted degree kernels

Rank

7-mer

Freq.

MIN

MAX

Average

Standard

Deviation

1

ATGACTC

419

8.49

409.08

39.31

43.04

2

TGACTCA

990

8.49

567.81

56.66

54.61

3

GTGACTC

446

9.83

648.79

74.46

96.84

4

TGAGTCA

453

14.52

303.87

63.66

54.64

5

TATGACT

224

8.74

896.78

112.54

190.25

6

GACTCAT

392

8.49

963.28

167.26

254.46

7

ATGAGTC

504

15.60

975.18

276.01

292.93

8

TGACTAA

327

14.67

821.67

192.02

199.69

9

TACTCAC

847

9.65

975.05

437.92

336.43

10

GACTAAT

808

14.67

984.67

528.74

300.75

  1. The seven columns list the rank of importance, nucleotide sequence, number of 12-mer sequences that contain this 7-mer, the minimum K d for all such 12-mers, the maximum K d for all such 12-mers, the mean K d value for these 12-mers, and the standard deviation of these 12-mers, respectively. The six 7-mers in bold are the ones with lower dispersions of K d values than the remainders.