UMM Discovery

Overview

UMM-Discovery is a fully unsupervised deep learning method to cluster cellular images with similar phenotypes together, solely based on the intensity values. It is a modification of the Deep Clustering framework developed by Caron at al. (2019). Based on the findings of Godinez et al. (2017), we decided to use an updated version of the Deep Neural Network (DNN) architecture, called Multi-Scale-Net. UMM Discovery uses two batch correction methods, Typical Variation Normalization (TVN) (Ando et al., 2017) and Combat (Johnson et al., 2007), during training to significantly improve the results and to create more representative embeddings.

Link to resources

UMM Discovery makes use of:

DeepCluster, by Facebook, Inc, available under a Creative Commons Attribution-Noncommercial license.
ComBat by brent
Multi-Scale-Net from Godinez et al. (2017)

Prequisites and dependencies

All Requirements of DeepCluster
Multicore-TSNE
HDBScan
patsy
anndata
sklearn
seaborn
umap
matplotlib

Installation

The easiest way to install all dependencies is with conda.

$ conda env create -f environment.yml

ComBat

Clone the github for ComBat from brentb and copy the combat.py in the directory

Data

The method can be applied on any cellular dataset. To do so change the loading of the images in the my_dataset.py. Additionally, the Multi-Scale Net input shape may need some changes (see file model.py) if the number of input channels differ. In the paper, UMM Discovery is evaluated on the BBBC021 cellular dataset available from the Broad Bioimage Benchmark Collection.

Running UMM Discovery

Start a jupyter session on your local machine or gpu cluster

$ jupyter

and open jupyter notebook UMM_discovery_BBBC021.ipynb

Within the notebook change the parameters (e.g. dataset path and output path) to your needs and run the cells.

Reference

If you use this code, please cite the following paper:

Rens Janssens, Xian Zhang, Audrey Kauffmann, Antoine de Weck, Eric Y. Durand. "Fully unsupervised deep mode of action learning for phenotyping high-content cellular images" doi: https://doi.org/10.1101/2020.07.22.215459

@article {Janssens2020.07.22.215459, author = {Janssens, Rens and Zhang, Xian and Kauffmann, Audrey and de Weck, Antoine and Durand, Eric Y.}, title = {Fully unsupervised deep mode of action learning for phenotyping high-content cellular images}, elocation-id = {2020.07.22.215459}, year = {2020}, doi = {10.1101/2020.07.22.215459}, URL = {https://www.biorxiv.org/content/early/2020/07/23/2020.07.22.215459}, eprint = {https://www.biorxiv.org/content/early/2020/07/23/2020.07.22.215459.full.pdf}, journal = {bioRxiv} }

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
UMM_discovery_BBBC021.ipynb		UMM_discovery_BBBC021.ipynb
clustering.py		clustering.py
correction.py		correction.py
environment.yml		environment.yml
evaluation.py		evaluation.py
model.py		model.py
my_dataset.py		my_dataset.py
my_transform.py		my_transform.py
plot.py		plot.py
training.py		training.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UMM Discovery

Overview

Link to resources

Prequisites and dependencies

Installation

Data

Running UMM Discovery

Reference

About

Releases

Packages

License

Novartis/UMM-Discovery

Folders and files

Latest commit

History

Repository files navigation

UMM Discovery

Overview

Link to resources

Prequisites and dependencies

Installation

Data

Running UMM Discovery

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages