Code and data for the final project of the Computational Structural Biology for Lead Discovery course (MSc in Computational Biology-UPM).
The aim of this project was to create a computational pipeline for the identification of drug candidates for JAK-family kinases, which are a group of proteins involved in inflamatory diseases such as rheumatoid arthritis or psoriasis.
The lead discovery pipeline proposed in this work is divided in three modules, which must be performed in chronological order:
- Ligand based virtual screening (LBVS)
- Pharamcophore based virtual screening (PBVS)
- Structure based virtual screening (PBVS)
All the data and Jupyter notebook's scripts used for each module in the present work can be found in their correspondent folders.
The structural and chemical data used for the project has been retrieved from several biochemical databases using the web server or their API.
- Drug candidates: ChEMBL and PubChem
- LBVS compounds: SwissModel
- Protein structures: PDB
- Pharmacophore: Pharmit and PharmaGist
Ana Solbas and Natalia García