Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding bounding box values during training and testing. #38

Open
xiankgx opened this issue Sep 5, 2020 · 1 comment
Open

Regarding bounding box values during training and testing. #38

xiankgx opened this issue Sep 5, 2020 · 1 comment

Comments

@xiankgx
Copy link

xiankgx commented Sep 5, 2020

During training, we fit in batches of images of the same dimensions for training, 512 in code by default. During training the position of activated pixels to the rotated box boundaries is limited to the range [0, 512] due to the use of a sigmoid activation function. However, during testing, the model input is not restricted to the size of images used during training, instead, only resized to be divisible by 32. I'm wondering what's the effect of this when the test image dimensions are very different than the training image dimensions. Do you think it's better to squashed resized images to 512 during testing? @SakuraRiven

@SakuraRiven
Copy link
Owner

I guess "dimensions" is actually "scale" ? The scales in ICDAR2015 train/test are similar so we could directly inference. In fact, we can adjust the train scale and test scale to align them for better performance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants