Skip to content

Operon model to predict operon membership between two genes within prokaryotes

Notifications You must be signed in to change notification settings

makman09/operon_model_prediction_for_prokaryotes

Repository files navigation

Operon membership prediction from intergenic distance of two genes

We are using an operon model on the Agro Bacterium genome. We use the operon model to predict whether two genes are likely within the same operon based of the distance between the two genes and use that prediction to infer functional relationship between two genes if they're are predicted within the same operon.

We used a naive bayesian model built off of positive control and negative control.

We used a naive bayesian algorithm to build our operon model.

alt-text

Our posterior probability is p(h1) is 0.60 which is referenced from a paper linked below discussing operons across prokaryotes in which the average prior probability of an operon pair is 0.60.

Operons Across Prokaryotes link

Positive Control

It's a set of distances between genes within the same operon from the E.coli genome.

Negative control

It's a set of distances between genes at the border of neighboring operons from the E.coli genome.

Threshold

I picked a threshold of 0.60 for predicting operon membership in order to assume very little false positives as possible. I also wanted a threshold that has high sensitivity while having good accuracy based off the curves generated below.

alt-text alt-text alt-text alt-text

About

Operon model to predict operon membership between two genes within prokaryotes

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published