-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
replicating results of leaderboard #8
Comments
Dear Hussein, |
Dear Hussein, By the way, I wonder if I can change the dataset in bdlb or not? I mean I want to use bdlb for my own dataset instead of using the "Diabetic Retinopathy Diagnosis benchmark"? I am really looking forward to your answer. Maryam |
I spent a few dozen hours with this codebase. I was not able to replicate, or even come close to replicating, the results for either the real world or medium size datasets. FYI, there are a number of bugs in the code that have to be fixed to get things to work on the real world data set. But even after these are fixed, It's still difficult, if not impossible to replicate the results of the leaderboard. I believe the authors are aware of this, and there was some talk of fixing the code to make things easier to replicate. Cheers! |
Hi there, Thank you in advance for helping me. |
I've been trying to replicate the results of your leaderboard, but I found a number of things confusing (based on the "medium" data in the linked colab):
for deterministic:
(accuracy with pink the deterministic)
and for mc_dropout:
with numbers (first is mc_dropout and second is deterministic)
In your paper mc_dropout outperformed the deterministic approach by a quite a bit, I didn't expect the deterministic approach to perform so badly, these results seem a bit more sensible but not to this other extent, can you find the reason for this discrepancy?
for mc_dropout
here is a colab to replicate the above
also recommend updating your linked colab with the proper required packages as in it's current form it does not run
The text was updated successfully, but these errors were encountered: