Geoff Webb: Finding the Real Patterns
Pattern discovery typically explores a massive space of potential patterns
to identify those that satisfy some user-specified set of criteria. This process
entails a huge risk (in many cases a near certainty) that many patterns
will be false discoveries. These are patterns that satisfy the specified
criteria with respect to the sample data but do not satisfy those criteria
with respect to the population from which those data are drawn.
This talk discusses the problem of false discoveries, and presents
techniques for avoiding them.