Identifying overlapping functional modules of protein interactions graph

Introduction

In this project, I tried implementing one of the most efficient algorithms for large graphs clustering, the SR-MCL, in python and checked how it works on both big and small graphs. Due to this, the implemented code was then run on a real-world dataset, weighted yeast proteins interactome, to check how long it takes to reach an end and converges with a graph of size 1e3 nodes and 1e4 edges!

Methodology

The SR-MCL algorithm is an improvement to the MCL (Markov Clustering Algorithm). The MCL is a practical algorithm for clustering biological networks, for instance, clustering protein-protein interaction (PPI) networks to identify functional modules. But it has some limitations and problems with bridge nodes.

A few years later, the R-MCL (Regularized MCL) introduced by Yu-Keng Shih and Srinivasan Parthasarathy in 2010 solved some of the MCL problems and improved execution time. However, still, the problem with bridge nodes remains. Two years later, the same people came up with the new idea of SR-MCL. The main idea behind it was to make the R-MCL algorithm not focus on bridges by softening the weights of the canonical flow matrix after each step.

Result on a small random graph

After testing its performance to see how it works, I created a random graph with 50 nodes and some edges between them.

Then by running the algorithm on them, I colored each cluster a different color, But the bridge nodes colored yellow to be identified!

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
ClusteredRandomGraph.png		ClusteredRandomGraph.png
LICENSE		LICENSE
README.md		README.md
RandomGraph.png		RandomGraph.png
SR-MCL.ipynb		SR-MCL.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Identifying overlapping functional modules of protein interactions graph

Introduction

Methodology

Result on a small random graph

Reference

About

Releases

Packages

Languages

License

arabporr/SR-MCL_Graph_Clustering

Folders and files

Latest commit

History

Repository files navigation

Identifying overlapping functional modules of protein interactions graph

Introduction

Methodology

Result on a small random graph

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages