Open Source Data Mining Workshop
on Frequent Pattern Mining Implementations

in conjunction with ACM SIGKDD 2005

Keynote Talk

Geoff Webb: Finding the Real Patterns

Pattern discovery typically explores a massive space of potential patterns to identify those that satisfy some user-specified set of criteria. This process entails a huge risk (in many cases a near certainty) that many patterns will be false discoveries. These are patterns that satisfy the specified criteria with respect to the sample data but do not satisfy those criteria with respect to the population from which those data are drawn. This talk discusses the problem of false discoveries, and presents techniques for avoiding them.
