motifFinder-Bioinformatics

This project involves developing a "motif finding" program and testing the program on a set of synthetic datasets.

The first step is to build a benchmark of the collection of synthetic datasets, and each dataset contains a set of DNA sequences, into which a "motif" has been "planted".

The second step is to write a program to read the dataset generated in the first step and find the "motif" planted in the first step. We employed the Gibbs sampling algorithm for implementing the motif finder.

Lastly, we evaluated the performance of the motif finder using three metrics:
(1) Relative Entropy: difference between the motif our algorithm found and the planted motif.
(2) Overlapping Sites: the correct sites our algorithm found.
(3) Running time: how long our algorithm takes to find the motif.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
evaluationOverlapping.py		evaluationOverlapping.py
generateDataset.py		generateDataset.py
motifFindingGibbs.py		motifFindingGibbs.py
relativeEntropy.py		relativeEntropy.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

motifFinder-Bioinformatics

About

Releases

Packages

Languages

chuankaizhao/motifFinder-Bioinformatics

Folders and files

Latest commit

History

Repository files navigation

motifFinder-Bioinformatics

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages