tensorrt_yolov5 💯

This project aims to produce tensorrt engine for yolov5, and calibrate the model for INT8.

Env

Ubuntu 18.04
Tesla T4
CUDA 10.2
Driver 450.80.02
tensorrt 7.0.0.11

Run method

1. generate wts

cd tensorrt_yolov5
git clone -b v4.0 https://github.com/ultralytics/yolov5.git
cd ./yolov5
cp ../gen_wts.py .
wget https://github.com/ultralytics/yolov5/releases/download/v4.0/yolov5s.pt
// or you can download it from url
python gen_wts.py
// a file 'yolov5s.wts' will be generated

2. modify parameters

You can change some key parameter in yolov5.cpp just like below.

#define USE_FP16 // set USE_INT8 or USE_FP16 or USE_FP32
#define DEVICE 0  // GPU id
#define NMS_THRESH 0.45
#define CONF_THRESH 0.5
#define BATCH_SIZE 1
bool save_txt = false;  // save detection result into txt files
bool save_img = true;  // whether save the image results

Notice

if you set USE_INT8 model, you must creat calibration_dataset, and put your dataset image in it. At least about 500 images can generate calibtate table.

save_txt means you can save detect result of every image, so that you can calculate the mAP of the model with mAP

3. generate engine

// put yolov5s.wts into ./weights
mkdir weights
cp ./yolov5/yolov5s.wts ./weights

// update CLASS_NUM in yololayer.h if your model is trained on custom dataset
mkdir build
cd build
cmake ..
make
./yolov5 -s

After this step, you can get tensorrt engine named yolov5s4.0_batch1.engine according your batch size.

4. test images

You can set test image folder for below command.

./yolov5 -d [image folder]

It will generate test result in ./experiment/images folder.

Result

1. speed result

	BatchSize	Latency,ms	Throughput (1000/latency*batchsize)	Latency Speedup (TRT latency/original latency)	Throughput Speedup (TRT throughput/original throughput)
PyTorch	1	20	50
	8	17	470
	16	18	888
	32	19	1684
TensorRT	1	4.9	204	0.245	4.08x
	8	4.1	1951	0.241	4.14x
	16	3.8	4210	0.211	4.73x
	32	2.2	14545	0.115	8.63x

2. mAP result

We test the mAP of modle in COCO_VAL_2014 dataset with 40k+ images. The result is shownd as below.
You can calculate your model result with method of https://github.com/Cartucho/mAP#create-the-predicted-objects-files, the only thing you should do is save infer result by set save_txt to true in yolov5.cpp file.

	FP32	FP16
PyTorch	36.8	-
TensorRT	39.34	39.34

TODO

Support for yolov5-v4.0 m/l/x
Support for mAP test
Comparison for tensorrt acceleration effect
Run in deepstream project
QAT will increase in the future

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
calibrator.cpp		calibrator.cpp
calibrator.h		calibrator.h
common.hpp		common.hpp
cuda_utils.h		cuda_utils.h
gen_wts.py		gen_wts.py
logging.h		logging.h
utils.h		utils.h
yololayer.cu		yololayer.cu
yololayer.h		yololayer.h
yolov5.cpp		yolov5.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tensorrt_yolov5 💯

Env

Run method

1. generate wts

2. modify parameters

3. generate engine

4. test images

Result

1. speed result

2. mAP result

TODO

Reference

Contributor

About

Releases

Packages

Languages

XiaoPengZong/tensorrt_yolov5

Folders and files

Latest commit

History

Repository files navigation

tensorrt_yolov5 💯

Env

Run method

1. generate wts

2. modify parameters

3. generate engine

4. test images

Result

1. speed result

2. mAP result

TODO

Reference

Contributor

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages