School Seminars and Colloquia

Data prediction competitions: far more than just a bit of fun

Miscellaneous Seminar

by Anthony Goldbloom

Institution: Founder of
Date: Tue 27th July 2010
Time: 5:45 PM
Location: Staff Tea Room/Russell Love Theatre, Richard Berry Building, The University of Melbourne

Abstract: SSAI – Victorian Branch Meeting

5:45pm - light refreshments in the Staff Tea Room, Richard Berry Building, University of Melbourne.

6:15pm – seminar in the Russell Love Theatre, Richard Berry Building, University of Melbourne.


Kaggle is a global platform for data prediction competitions allowing researchers and companies to post their problem and have it scrutinised by the world's statisticians and computer scientists.
By exposing a problem to a wide range of analysts and techniques, data prediction competitions turn out to be great way to get the most out of a dataset, given its inherent noise and richness. For example, Kaggle has been running a bioinformatics competition requiring participants to pick markers in HIV's genetic sequence that predict a change in viral load (a measure of the severity of infection). Within a week and a half, the best submission had already outdone the best methods in the scientific literature.

