---

* News

Seminar

Permutation Methods for Model Validation

Malik Magdon-Ismail


September 15, 2010
Lally 102 - 11:00 a.m. to 12:00 p.m.

Abstract:


We give a permutation approach to validation and model selection in data mining, machine learning and statistical inference. We define permutation complexity measures in learning, and give uniform bounds on the out-sample error, similar to a VC-style bounds. We also show concentration of measure for permutation complexity, which directly means that the methods are algorithmically efficient. The theoretical novelty is that we develop methods for analyzing *dependent* sampling schemes (e.g. sampling without replacement), and show how to get concentration of measure in such a dependent sampling setting. We demonstrate the practical value of these techniques with extensive model selection experiments in real as well as synthetic data.

Hosted by: Dr. Elliot Anshelevich (x6491)

Last updated: September 13, 2010


---

---