My research interests include designing algorithms for genetic datasets and data mining.
My major focus in recent times has been on developing methods which identify tagging SNPs in genome wide datasets. This much smaller set of SNPs can then be used to predict a new individual's genome for all the SNPs in the original dataset. To put it simply, it's a compression problem. For further details please see Tagging SNPs.
Another project which worked real well but sadly did not culminate in a publication was, efficient computations of pair wise Pearson correlation coefficient for SNPs. The code is optimized for speed. For details and to download the software please see LD.
Previously I have worked extensively on parallel algorithms, in particular for data mining for distributed systems. I have also dabbled a trifle in sensor network domain.