Does reducing batch size affects convergence? #1

pinakinathc · 2020-12-20T19:55:47Z

Due to a limit of 11GB GPU resource, I am forced to reduce the batch size for image2text to 8 instead of 128.
After training for a while, I was unable to get a converged trained model that replicated results in the paper.

Hence, my question is:

How much GPU memory did you use?
Does using smaller batch size leads to failure in training?

pinakinathc · 2020-12-21T00:42:07Z

@s-mahajan can you upload the trained checkpoints as mentioned in the README.md file?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does reducing batch size affects convergence? #1

Does reducing batch size affects convergence? #1

pinakinathc commented Dec 20, 2020

pinakinathc commented Dec 21, 2020

Does reducing batch size affects convergence? #1

Does reducing batch size affects convergence? #1

Comments

pinakinathc commented Dec 20, 2020

pinakinathc commented Dec 21, 2020