Replies: 1 comment 1 reply
-
@Yuyn1234 did you try and verify it to be incorrect? it worked when I originally tested it |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I notice that there is a function named "no_weight_decay" which returns the "pos_embed" and "cls_token". It is obivous that weight decay should not be applied to "pos_embed" and "cls_token". But in the function named "add_weight_decay", you judge it by "name in skip_list" where name belongs to model.named_parameters() and skip_list is [pos_embed, cls_token]. I think it actually does not work because name will be "model. pos_embed" or something else, not "pos_embed".
Beta Was this translation helpful? Give feedback.
All reactions