-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
lower mIoU than the result of the paper #25
Comments
Note that the labels in NYUv2 are not perfect and the results may vary a bit. Therefore, we repeated each training several times and used a grid search for the learning rate ranging from 0.005 to 0.04 for NYUv2. Finally, we picked the highest value. |
Thanks for your information. I thought you used grid search and set the best hyperparameters as default values, right? That's why I feel confused. |
The best hyperparameters vary depending on the actual network configuration (network depth, block type, fusion, upsampling, ...). For the default hyperparameters, we picked the values that were best for most of the runs. We have run the default configuration once again on one of our GPU nodes used for the paper (2080TI, CUDA 10.1.243, driver 460.39) during the night and got a mIoU of 49.93 at epoch 452. |
Hi Daniel Thank you for your reply. Both of us used the same network configuration (network depth, block type, fusion, upsampling, ...) since we run default configuration. Please find the attached output files. I try to run multiple times now and see if there is a performance boost. |
I just checked the results of another runs - epoch 257. It seems that the cuda version might affect the results. However the best result within epoch 500 is still around 49.0 or 49.3%, which is less than 49.9% and 50.3%. I attached the output files output_49.zip The only difference is the cuda version :
Do you use other tricks which are not included in the source code or paper? |
Hi,
I used the .yml file to create conda environment and default setting to train NUYv2 and the result 47. 48 is much lower than the result of the paper 50.3.
Does anyone get a similar result as follows?
mIoU test kv1: 0.4748908361165817
mIoU test: 0.4748908361165817
0.4748908361165817
What do you think is the reason for this?
Do you use the maximum or average in the paper ? Do you use different hyperparameters?
# or use the default arguments python train.py \ --dataset nyuv2 \ --dataset_dir ./datasets/nyuv2 \ --pretrained_dir ./trained_models/imagenet \ --results_dir ./results
The text was updated successfully, but these errors were encountered: