Data-driven Generation of Perturbation Networks for Relative Binding Free Energy Calculations

Source code to reproduce the work's preprint.

Given all publicly available RBFE benchmarking datasets, we have created a training domain ('RBFE-Space') that contains a representation of all perturbations present in these datasets by grafting them onto a common benzene scaffold. Then, after running all RBFE simulations for this novel set, we have used this training domain to train ML models to predict the quintuplicate standard error of the mean free energy (SEM). We have adjusted LOMAP to ingest these predicted SEM values to use instead of the native LOMAP-score, thereby producing a data-driven method of producing RBFE networks.

To reproduce, install the provided conda environment on a linux machine with at least one GPU (cuda). Main dependencies:

Main steps to reproduce:

Run _01_SETUP_BENZENE_TRAINSET.ipynb to get the list of transformations in RBFE-Space
Run _02_SETUP_BSS_FOLDERS_TRAINSET.ipynb to set up RBFE input files using BioSimSpace. BSS can set up simulations for SOMD, Amber and Gromacs or export the files needed for other RBFE implementations.
Run all RBFE simulations on a cluster
Collect SEM values from simulations in the format of ANALYSIS/perturbation_networks/input/fepspace_sems_full_balanced.csv
Sequentially run all python scripts/notebooks in ANALYSIS/perturbation_networks/ to reproduce the majority of RBFE network generation figures used in the paper.
Other figures can be reproduced using the notebooks found in ANALYSIS/fepspace_vs_free_vs_bound/ and ANALYSIS/lambda_spacing/

Please note that some files (e.g. simulation outputs) were not included in this repository due to github memory restrictions. Feel free to post an issue with any questions regarding this work.

Authors:

J. Scheen
M. Mackey
J. Michel

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.ipynb_checkpoints		.ipynb_checkpoints
ANALYSIS		ANALYSIS
FEPSPACE_TRAIN		FEPSPACE_TRAIN
fep_ref_ligands		fep_ref_ligands
figures		figures
tmp_images		tmp_images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_01_SETUP_BENZENE_TRAINSET.ipynb		_01_SETUP_BENZENE_TRAINSET.ipynb
_02_SETUP_BSS_FOLDERS_TRAINSET.ipynb		_02_SETUP_BSS_FOLDERS_TRAINSET.ipynb
conda_env.yml		conda_env.yml
toc_figure.png		toc_figure.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data-driven Generation of Perturbation Networks for Relative Binding Free Energy Calculations

About

Releases

Packages

Contributors 2

Languages

License

michellab/RBFENN

Folders and files

Latest commit

History

Repository files navigation

Data-driven Generation of Perturbation Networks for Relative Binding Free Energy Calculations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages