Colloquium: Statistical challenges with next-generation DNA sequencing
by Professor Terry Speed
Abstract: For the past decade, microarrays have been the assays of choice for a wide range of high-throughput biological assays. Recent improvements in the efficiency, quality, and cost of DNA sequencing are prompting biologists rapidly to abandon microarrays in favor of so-called next-generation sequencers, e.g., Applied Biosystems' SOLiD, Helicos BioSciences' HeliScope, Illumina's Genome Analyzer, and Roche's 454 Life Sciences sequencing
systems. These high-throughput sequencing technologies have already been applied to studying genome-wide transcription levels (mRNA-Seq), transcription factor binding sites (ChIP-Seq), chromatin structure, DNA methylation status, copy number and
several other tasks. Such studies have overcome many longstanding limitations of microarray-based studies, but of course new technologies raise familiar as well as novel statistical and computational
challenges. In this talk I'll outline some of the topics that are currently interesting me in this area.