Question regarding the role of max_image_size in image quantization #420

gpantaz · 2023-09-29T14:57:31Z

Hello!

I would like to ask a question regarding the image quantization. I dont really understand why you divide coordinates of the bounding box with the max_image_size (= 512), instead of the patch_image_size

OFA/utils/transforms.py

Lines 240 to 243 in a36b91c

    
           if "boxes" in target: 
        
               boxes = target["boxes"] 
        
               boxes = boxes / self.max_image_size 
        
               target["boxes"] = boxes

Assuming a bounding box [x1, y1, x2 x2] with width w and height h, to me it seems that the quantization of each coord would be x1 / w * (num_bins -1). For example for a bounding box [120, 200, 150, 220] with w = 600 and h = 800 the quantized x1 would be: 120 / 600 * (num_bins -1).

Could you also explain the choice behind the value of the max_image_size?

Thanks :)

The text was updated successfully, but these errors were encountered:

JJJYmmm · 2024-03-02T08:08:08Z

I have the same problem. : ）

JJJYmmm · 2024-03-02T08:13:28Z

Maybe it's just a coords normalization operation in both training and prediction.
However, when using bin2coord, it causes the coordinates to go out of the image(task.cfg.max_image_size >= task.cfg.patch_image_size).

def bin2coord(bins, w_resize_ratio, h_resize_ratio):
    bin_list = [int(bin[5:-1]) for bin in bins.strip().split()]
    coord_list = []
    coord_list += [bin_list[0] / (task.cfg.num_bins - 1) * task.cfg.max_image_size / w_resize_ratio]
    coord_list += [bin_list[1] / (task.cfg.num_bins - 1) * task.cfg.max_image_size / h_resize_ratio]
    coord_list += [bin_list[2] / (task.cfg.num_bins - 1) * task.cfg.max_image_size / w_resize_ratio]
    coord_list += [bin_list[3] / (task.cfg.num_bins - 1) * task.cfg.max_image_size / h_resize_ratio]
    return coord_list

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding the role of max_image_size in image quantization #420

Question regarding the role of max_image_size in image quantization #420

gpantaz commented Sep 29, 2023

JJJYmmm commented Mar 2, 2024

JJJYmmm commented Mar 2, 2024 •

edited

Loading

Question regarding the role of max_image_size in image quantization #420

Question regarding the role of max_image_size in image quantization #420

Comments

gpantaz commented Sep 29, 2023

JJJYmmm commented Mar 2, 2024

JJJYmmm commented Mar 2, 2024 • edited Loading

JJJYmmm commented Mar 2, 2024 •

edited

Loading