Molecule and drug Discovery Application For Three Diseases

Main phases that i worked on for this projects:

Data Preparation

In this phase, we meticulously handled the data. The key steps involved are:

Data Import: We imported the ChEMBL datasets relevant to our three target diseases.
Data Cleaning: Extensive cleaning was performed to ensure data quality and consistency.
Feature Engineering: Relevant features were extracted from the datasets to facilitate modeling.

The modeling phase had two distinct objectives:

We explored two deep learning algorithms: Variational Autoencoder (VAE) and MolGann.
After thorough evaluation, VAE was selected as the preferred algorithm and applied to all three datasets.

We employed various machine learning algorithms to predict two key aspects:
- PIC50 Value and Bioactivity Class: Predicting the PIC50 value for bioactivit and Classifying compounds based on their bioactivity.
The best-performing model for each dataset was identified and saved for deployment using Streamlit.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
1_🏠_HomePage.py		1_🏠_HomePage.py
Acetylcholinesterase.txt		Acetylcholinesterase.txt
Aromatase Data.txt		Aromatase Data.txt
Design.png		Design.png
IBD Data.txt		IBD Data.txt
IBD_model1.pkl		IBD_model1.pkl
IBD_model2.pkl		IBD_model2.pkl
README.md		README.md
acetylcholinesterase_model1.pkl		acetylcholinesterase_model1.pkl
acetylcholinesterase_model2.pkl		acetylcholinesterase_model2.pkl
acetylcholinesterase_model3.pkl		acetylcholinesterase_model3.pkl
aromatase_model1.pkl		aromatase_model1.pkl
aromatase_model2.pkl		aromatase_model2.pkl
bioactivity_prediction_app.ipynb		bioactivity_prediction_app.ipynb
checkpoint		checkpoint
descriptor_list.csv		descriptor_list.csv
descriptor_list3.csv		descriptor_list3.csv
descriptor_list4.csv		descriptor_list4.csv
descriptors_output.csv		descriptors_output.csv
descriptors_output2.csv		descriptors_output2.csv
example_acetylcholinesterase.txt		example_acetylcholinesterase.txt
exemple aromatase.txt		exemple aromatase.txt
exemple inflammatory.txt		exemple inflammatory.txt
gen.png		gen.png
homeimage.png		homeimage.png
image2.png		image2.png
imagegen.png		imagegen.png
logo.png		logo.png
pred.png		pred.png
requirements.txt		requirements.txt