---

* News

Colloquia

Comparative Analysis of Molecular Interaction Networks

Mehmet Koyuturk
Department of Computer Science
Purdue University

Monday, April 9, 2007

Emergence of high-throughput experiments and resulting databases capture relationships and interactions between biomolecules. These interactions enable modeling and analysis of a cell from a systems perspective - generally using network models. In this talk, we focus on development of computational tools and statistical models for comparative analysis of molecular interaction networks. These tools target understanding of functional modularity in the cell by extracting novel information from massive amounts of interaction data, through integration of cellular organization, functional hierarchy, and evolutionary conservation.
We first discuss the problem of identifying conserved sub-networks in a collection of interaction networks belonging to diverse species. The main algorithmic challenges here stem from the NP-hard subgraph isomorphism problem that underlies frequent subgraph discovery. Three decades of research into theoretical aspects of this problem has highlighted the futility of syntactic approaches, thus motivating use of semantic information. Using a biologically motivated homolog contraction technique for relating proteins across species, we render this problem tractable. We experimentally show that the proposed method can be used as a pruning heuristic that accelerates existing techniques significantly, as well as a standalone tool that conveys significant biological insights at near-interactive rates.
With a view to understanding the conservation and divergence of modular substructures, we also develop network alignment techniques, grounded in theoretical models of network evolution. Through graph-theoretic modeling of evolutionary events in terms of matches, mismatches, and duplications, we reduce the alignment problem to a graph optimization problem and develop heuristics to solve this problem efficiently. In order to assess the statistical significance of the patterns identified by our algorithms, we probabilistically analyze the distribution of highly connected and conserved subgraphs in random graphs. Our methods and algorithms are implemented on various platforms and tested extensively on a comprehensive collection of molecular interaction data, illustrating their effectiveness in terms of providing novel biological insights as well as computational efficiency.

This is joint work with Yohan Kim, Shankar Subramaniam (University of California, San Diego), Wojciech Szpankowski, and Ananth Grama (Purdue University) and is supported by the National Institutes of Health.


---

---