Rejection sampling versus random sampling. Average publication distributions for 10,000 random samples taken from the population of protein-coding genes, rand
(grey) and 10,000 randomised samples taken to match the publication count distribution of the HIV sample, rand
(blue). The rand
samples match the HIV publication distribution with a p-value of 0.43 (chi-squared).