Protein Folding Pathways

Yogesh A. girdhar

girdhy(AT)cs(DOT)rpi(DOT)edu

I developed the following simulator to simulate protein folding pathways. Given a protein in its native configuration, the simulator generates thousands of possible folding pathways(from different initial configuration) which the protein takes while it is folding. This program has applications in molecular drug design and studying diseases which are caused due to protein mis-folding.

The simulator is based on Probabilistic Roadmaps (PRM) method of motion planning. Its working can be divided into the following 3 steps:


Sample Generation

Samples are generated by the following procedure:




Graph Generation

Once we have the samples (c-space), we now try to connect them as following:



Distance Matrix Error is used as the distance function above. Another possible function could be euclidean distance in c-space.

DME is calculate as:

DME = Sum( | a_ij - b-ij | ) / (N*(N-1)/2)

a_ij is the distance between atom i and atom j in configuration a.
b_ij is the distance between atom i and atom j in configuration b.
N is the number of atoms




Path Generation

One we have the Graph(list of edges and vertices), we simply run Dijkstra's single source shortest path algorithm to get all possible paths leading to the native. Dijkstra's algorithms minimizes the edge weights; as a result we get paths with lowest energy climb.


Some Results

I ran the simulator on 2GB1 protein fragment with the following parameters:

Here are some randomly chosen output paths(you need VMD to see these):

1 2 3 4 5
Some animated GIFs(they are big): 1 2
Note: The above animated GIF's are in stereoscopic. Try to combine the two images into one and you will see a 3d version of the image!

References