SwinDepth

This is the PyTorch implementation of the paper "SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network" (ICRA 2023). [paper]

We provide pre-trained weights and evaluation codes for a simple visualization of depth estimation results on KITTI dataset.

Download the pre-trained weights from here and place them in "./checkpoints/best" folder

Setup

conda create -n ht_dcmnet python=3.8.5
conda activate ht_dcmnet
conda install pytorch torchvision cudatoolkit=11.1 -c pytorch -c nvidia
pip install -r requirements.txt

Our experiments has been done with PyTorch 1.9.0, CUDA 11.2, Python 3.8.5 and Ubuntu 18.04. We use 4 NVIDIA RTX 3090 GPUs for training, but you can still run our code with GPUs which have smaller memory by reducing the batch_size. A simpel visualziation can be done by GPUs with 3GB of memory use or CPU only is also functional.

Simple Prediction

You can simply visualize the depth estimation results on some images from KITTI with:

python test_simple.py --image_path=./test_images/

You can check depth estimation results with other images from KITTI or your own datasets by adding test images on the folder named "test_images". You can run the code without GPU by using --no_cuda flag.

KITTI Dataset

You can download the entire raw KITTI dataset by running:

wget -i splits/kitti_archives_to_download.txt -P /YOUR/DATA/PATH/

KITTI images are converted from .png to .jpg extension with this command for fast load times during training:

find /YOUR/DATA/PATH/ -name '*.png' | parallel 'convert -quality 92 -sampling-factor 2x2,1x1,1x1 {.}.png {.}.jpg && rm {}'

The commands above results in the data_path:

/YOUR/DATA/PATH
  |----2011_09_26
      |----2011_09_26_drive_0001_sync  
          |-----.......  
          |----image_02
              |-----data
                  |-----0000000000.jpg
                  |-----.......
              |-----timestamps.txt
          |-----.......
      |----.........        
  |----2011_09_28        
  |----.........

Training

For training, you have to pre-train Swin Transformer encoder in ImageNet-1k dataset.

You can either simply download ImageNet-pretrained encoder weight here named '104checkpoint.pth' or train Swin Transformer yourself with PyTorch offical code. Then, you place the pretrained weight in ./checkpoints/imagenet folder.

The depth estimation network is trained by running:

python train.py --data_path=/YOUR/DATA/PATH --log_dir=./checkpoints --model_name=ht_dcmnet --num_epochs=40 --batch_size=12

Evaluation

Before evaluation, you should prepare ground truth depth maps by running:

python export_gt_depth.py --data_path /YOUR/DATA/PATH --split eigen

The following example command evaluates best weights:

python evaluate_depth.py --data_path=/YOUR/DATA/PATH --load_weights_folder ./checkpoints/best/

Reference

Monodepth2 - https://github.com/nianticlabs/monodepth2
timm - https://github.com/rwightman/pytorch-image-models
mmsegmentation - https://github.com/open-mmlab/mmsegmentation

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
__pycache__		__pycache__
checkpoints/best		checkpoints/best
datasets		datasets
networks		networks
splits		splits
test_images		test_images
LICENSE		LICENSE
README.md		README.md
evaluate_depth.py		evaluate_depth.py
export_gt_depth.py		export_gt_depth.py
kitti_utils.py		kitti_utils.py
layers.py		layers.py
options.py		options.py
requirements.txt		requirements.txt
test_simple.py		test_simple.py
train.py		train.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SwinDepth

Setup

Simple Prediction

KITTI Dataset

Training

Evaluation

Reference

About

Releases

Packages

Languages

License

dsshim0125/SwinDepth

Folders and files

Latest commit

History

Repository files navigation

SwinDepth

Setup

Simple Prediction

KITTI Dataset

Training

Evaluation

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages