FRMIL

Official Pytorch Implementation of 'Feature Re-calibration based Multiple Instance Learning for Whole Slide Image Classification' (MICCAI 2022) arxiv

Feature Re-calibration based Multiple Instance Learning for Whole Slide Image Classification
Philip Chikontwe, Soo Jeong Nam, Heounjeong Go, Mee-jeong Kim, Hyun-Jung Sung, Sang Hyun Park

Abstract: Whole slide image (WSI) classification is a fundamental task for the diagnosis and treatment of diseases; but, curation of accurate labels is time-consuming and limits the application of fully-supervised methods. To address this, multiple instance learning (MIL) is a popular method that poses classification as a weakly supervised learning task with slide-level labels only. While current MIL methods apply variants of the attention mechanism to re-weight instance features with stronger models, scant attention is paid to the properties of the data distribution. In this work, we propose to re-calibrate the distribution of a WSI bag (instances) by using the statistics of the max-instance (critical) feature. We assume that in binary MIL, positive bags have larger feature magnitudes than negatives, thus we can enforce the model to maximize the discrepancy between bags with a metric feature loss that models positive bags as out-of-distribution. To achieve this, unlike existing MIL methods that use single-batch training modes, we propose balanced-batch sampling to effectively use the feature loss i.e., (+/-) bags simultaneously. Further, we employ a position encoding module (PEM) to model spatial/morphological information, and perform pooling by multi-head self-attention (PSMA) with a Transformer encoder. Experimental results on existing benchmark datasets show our approach is effective and improves over state-of-the-art MIL methods.

Enviroment Requirements

Ubuntu 20
Python 3.7
CUDA 11.0
PyTorch 1.7.1

Conda environment installation

conda env create --name FRMIL python=3.7
conda activate FRMIL

run pip install -r requirements.txt

Getting started

Code Structure

FRMIL/
-- checkpoints/ : default model checkpoint save location (includes pre-trained weights).
-- common/ : common utilities and functions.
-- configs/ : configuration files for pre-processing.
-- datasets/ : split and library files for a given dataset.
-- models /: consists of network definitions
        - /dataloaders/: defines the data loaders
-- scripts/ : train/test scripts
-- wsi_tools/ : collection of pre-processing scripts.

train.py: the main training script that requires a config file
test.py : the main testing script ''.
plots_cm16.py : used to plot feature magnitudes w/ & w/o max-norm or using FRMIL.

Datasets | Pre-computed Features

Download pre-computed features for Camelyon16 Link
- Unzip the features and modify the DATA variable in scripts i.e., train/wsi_frmil.sh and test/wsi_frmil.sh to re-train from scratch. (see other options)
  Run bash scripts/train/wsi_frmil.sh
- Pre-trained weights are stored in checkpoints/cm16/
  Run bash scripts/test/wsi_frmil.sh

Model Architecture

See models/mil_ss.py

Datasets | Pre-process from Scratch

Download entire Camelyon16 WSI CM16 dataset
Pre-processing employs the wsi_tools modules and configuration files(see. /config/cm16_tools.yml)
- Modify the files in /datasets/wsi/cm16 with your actual data path for all classes (example files included.)
- Modify paths for each step in cm16_tools.yml i.e., global save location for extracted WSI tissue masks etc. (see. /config/cm16_tools.yml)
- Run:
  python wsi_tools/xml2mask.py --config /path/to/config
  python wsi_tools/sample_spot.py --config /path/to/config
  python wsi_tools/sample_spot.py --config /path/to/config
  
  Note: Repeat for each class type i.e., {normal, tumor}, also modify the save paths in the sample-spot configuration section. Sample_spot saves a library files for all patch locations for patch extraction (eg. tumor.pth, and normal.pth).
- Once patch-locations have been sampled: Run:
  python wsi_tools/create_lib.py --config /path/to/config : This combines the library files for each class into a single file.
  python wsi_tools/patch_gen.py --split 'train'
  python wsi_tools/patch_gen.py --split 'train' : Ensure the library root and patch save locations are modified in patch_gen.py. \
- Create WSI features (instances-bags). Run :
  python wsi_tools/compute_features.py --config /path/to/config :
- Modify the type of features (imagnet, simclr). Employing SimCLR features requires training a model on the extracted patches. Use train_simclr.py and scripts/train/wsi_simclr.sh.
- Optionally, use the DSMIL single-scale model trained SimCLR model to extract features i.e., download the weights (x20) and place them in /checkpoints/dsmil_models/model-v0.pth: Use this model with option dsmil_cm16 in the compute features section of the configuration.
- Train your new model, extracted bags will be placed in the desired save location following the format: \
```
  |---- CM16/
    ---- train/
      ---- 0/ XX_WSI_ID_1.pth, XX_WSI_ID_2.pth .... XX_WSI_ID_N.pth
      ---- 1/ XX_WSI_ID_1.pth, XX_WSI_ID_2.pth .... XX_WSI_ID_N.pth
    ---- test/
      ---- 0/ XX_WSI_ID_1.pth, XX_WSI_ID_2.pth .... XX_WSI_ID_N.pth
      ---- 1/ XX_WSI_ID_1.pth, XX_WSI_ID_2.pth .... XX_WSI_ID_N.pth
```

Train | Test

Run bash scripts/train/wsi_frmil.sh

Results

The test results will be recorded in ./checkpoints/results_cm16.txt

References

Our implementation builds upon several existing publicly available code.

Citation

If you find this code useful, please consider citing our paper.

@inproceedings{chikontwe2022feature,
  title={Feature Re-calibration Based Multiple Instance Learning for Whole Slide Image Classification},
  author={Chikontwe, Philip and Nam, Soo Jeong and Go, Heounjeong and Kim, Meejeong and Sung, Hyun Jung and Park, Sang Hyun},
  booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
  pages={420--430},
  year={2022},
  organization={Springer}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FRMIL

Official Pytorch Implementation of 'Feature Re-calibration based Multiple Instance Learning for Whole Slide Image Classification' (MICCAI 2022) arxiv

Enviroment Requirements

Conda environment installation

Getting started

Code Structure

Datasets | Pre-computed Features

Model Architecture

Datasets | Pre-process from Scratch

Train | Test

Results

References

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
checkpoints/cm16/frmil_cm16_h8_simclr_final		checkpoints/cm16/frmil_cm16_h8_simclr_final
common		common
configs/wsi		configs/wsi
datasets/wsi/cm16		datasets/wsi/cm16
models		models
scripts		scripts
wsi_tools		wsi_tools
LICENSE		LICENSE
README.md		README.md
overview.png		overview.png
plots_cm16.py		plots_cm16.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
train_simclr.py		train_simclr.py

License

PhilipChicco/FRMIL

Folders and files

Latest commit

History

Repository files navigation

FRMIL

Official Pytorch Implementation of 'Feature Re-calibration based Multiple Instance Learning for Whole Slide Image Classification' (MICCAI 2022) arxiv

Enviroment Requirements

Conda environment installation

Getting started

Code Structure

Datasets | Pre-computed Features

Model Architecture

Datasets | Pre-process from Scratch

Train | Test

Results

References

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages