CIFAR-10 training settings #15

fugokidi · 2021-11-20T05:45:14Z

First of all, thank you for the interesting work.
I was experimenting the one with patch size 1 and kernel size 9 with CIFAR-10 with the following training settings:

--model tiny_convmixer
 -b 64 -j 8 
--opt adamw 
--epochs 200 
--sched onecycle 
--amp 
--input-size 3 32 32 
--lr 0.01 
--aa rand-m9-mstd0.5-inc1 
--cutmix 0.5 
--mixup 0.5 
--reprob 0.25 
--remode pixel 
--num-classes 10
--warmup-epochs 0
--opt-eps 1e-3
--clip-grad 1.0
--scale 0.75 1.0
--weight-decay 0.01
--mean 0.4914 0.4822 0.4465
--std 0.2471 0.2435 0.2616

I could get only 95.89%. I am supposed to get 96.03% according to Table 4 in the paper.
Can you please let me know any setting I missed? Thank you again.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CIFAR-10 training settings #15

CIFAR-10 training settings #15

fugokidi commented Nov 20, 2021

CIFAR-10 training settings #15

CIFAR-10 training settings #15

Comments

fugokidi commented Nov 20, 2021