Reproducibility is essential to reliable scientific discovery in
high-throughput experiments. In this work we propose a unified approach to
measure the reproducibility of findings identified from replicate experiments
and identify putative discoveries using reproducibility. Unlike the usual
scalar measures of reproducibility, our approach creates a curve, which
quantitatively assesses when the findings are no longer consistent across
replicates.
Large-scale statistical analysis of data sets associated with genome
sequences plays an important role in modern biology. A key component of such
statistical analyses is the computation of $p$-values and confidence bounds for
statistics defined on the genome. Currently such computation is commonly
achieved through ad hoc simulation measures. The method of randomization, which
is at the heart of these simulation procedures, can significantly affect the
resulting statistical conclusions.