Regarding bounding box values during training and testing. #38

xiankgx · 2020-09-05T04:49:42Z

During training, we fit in batches of images of the same dimensions for training, 512 in code by default. During training the position of activated pixels to the rotated box boundaries is limited to the range [0, 512] due to the use of a sigmoid activation function. However, during testing, the model input is not restricted to the size of images used during training, instead, only resized to be divisible by 32. I'm wondering what's the effect of this when the test image dimensions are very different than the training image dimensions. Do you think it's better to squashed resized images to 512 during testing? @SakuraRiven

SakuraRiven · 2020-09-07T08:21:22Z

I guess "dimensions" is actually "scale" ? The scales in ICDAR2015 train/test are similar so we could directly inference. In fact, we can adjust the train scale and test scale to align them for better performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding bounding box values during training and testing. #38

Regarding bounding box values during training and testing. #38

xiankgx commented Sep 5, 2020 •

edited

Loading

SakuraRiven commented Sep 7, 2020

Regarding bounding box values during training and testing. #38

Regarding bounding box values during training and testing. #38

Comments

xiankgx commented Sep 5, 2020 • edited Loading

SakuraRiven commented Sep 7, 2020

xiankgx commented Sep 5, 2020 •

edited

Loading