The following is adapted from Scene-Graph-Benchmark.
- Download the VG images part1 (9 Gb) part2 (5 Gb). Extract these images to the file
datasets/vg/VG_100K
. - Download the scene graphs and extract them to
datasets/vg/VG-SGG-with-attri.h5
.
- Download the GQA images Full (20.3 Gb). Extract these images to the file
datasets/gqa/images
. - In order to achieve a representative split like VG150, we use the protocol provided by SHA-GCL. You can download the annotation file from this link, and put all three files to
datasets/gqa/
.