Vision Datasets Statistics

In computer vision, we have a lot of datasets that are used for different tasks (i.e, classification, detection, segmentation, tracking...). These datasets in a form of images and/or videos are used to train and test models.

Here we take a look at the statistics for datasets used for the 3 main tasks in computer vision (Detection, Segmentation, and Tracking). While attempting to do so we provide converters to standardized formats and PyTorch data loader implementations for specific datasets.

We are going to look at the statistics of each dataset and perform a comparison in the end. To do this we will need to load each dataset and extract it's statistics programmatically. We will also need to visualize the statistics in a way that is easy to understand. Last but not least the statistics will be compared.

Task based datasets lookup table

Task	Detection Based	Instance Segmentation	Multi Object Tracking	Video Instance Segmentation
Dataset	✓ COCO ✓ SkyData ✓ VisDrone ✓ KAIST ✓ VHR-10 ✓ DOTA ✓ VEDAI ✓ KITTI	✓ COCO ✓ SkyData ✓ VHR-10	✓ SkyData ✓ VisDrone-MOT ✓ MOT-17 ✓ MOT-20 ✓ DanceTrack ✓ TAO	✓ SkyData ✓ Youtube-VIS 2019 ✓ Youtube-VIS 2021

Getting Started

For more details on how to use the repo, please refer to the docs

# Get REPO
#1. clone and setup up the repo
!git clone https://github.com/ozerlabs-proxy/vision-datasets-
stats.git
#2. cd into the repo
cd vision-datasets-stats
#3.
#we require conda to be installed
#alternatively you can any other env
conda env create -f environment.yaml
conda activate VisionStats
#4. follow along the notebooks

Stats

There is a number of stats about datasets that can be generated. These may vary depending on the task, for most we will derive the following:

-	Detection	Tracking
Stats	✓ Number of images ✓ Number of objects ✓ Number of classes ✓ Number of instances per class ✓ Average number of instances per image ✓ Average number of instances per class	✓ Number of videos ✓ Number of tracks ✓ Number of categories ✓ average track length ✓ average number of tracks per video ✓ average number of tracks per category ✓ video lengths ✓ min-max resolutions ✓ areas stats ...

Contribution

We are open to contributions, if you have a dataset that you would like to add to the list, please do so by following the steps in the contribution guide.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
bases		bases
notebooks		notebooks
resources		resources
scripts		scripts
utils		utils
.gitignore		.gitignore
ReadMe.md		ReadMe.md
generate_stats.py		generate_stats.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vision Datasets Statistics

Task based datasets lookup table

Getting Started

Stats

Contribution

About

Releases

Packages

Languages

ozerlabs-proxy/vision-datasets-stats

Folders and files

Latest commit

History

Repository files navigation

Vision Datasets Statistics

Task based datasets lookup table

Getting Started

Stats

Contribution

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages