You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
less of a feature request and more of a question - or, at most, a documentation request.
I am trying to understand whether or why stratified bootstrapping is a valid method to obtain CIs. (Not a statistician here.) I have understood that the standard bootstrap is proven to approximate a valid CI under certain assumptions, but I can for the life of me not find any reference discussing or showing that this is still true for y-stratified sampling. Intuitively, it feels like this approach should be underestimating the width of the CI since we're keeping a core property of the test set (the class ratio) fixed. Is this not the case? Could you point out any references that discuss this, or provide an argument for why this is a valid method? (I looked through the pROC paper but there it is only stated that stratified sampling is used, with no further discussion or justification of this choice. The cited references also don't seem to discuss this, or did I overlook something?) Sorry, probably a trivial question for a statistician...
Thank you for an excellent package in any case!
The text was updated successfully, but these errors were encountered:
Hi there,
less of a feature request and more of a question - or, at most, a documentation request.
I am trying to understand whether or why stratified bootstrapping is a valid method to obtain CIs. (Not a statistician here.) I have understood that the standard bootstrap is proven to approximate a valid CI under certain assumptions, but I can for the life of me not find any reference discussing or showing that this is still true for y-stratified sampling. Intuitively, it feels like this approach should be underestimating the width of the CI since we're keeping a core property of the test set (the class ratio) fixed. Is this not the case? Could you point out any references that discuss this, or provide an argument for why this is a valid method? (I looked through the pROC paper but there it is only stated that stratified sampling is used, with no further discussion or justification of this choice. The cited references also don't seem to discuss this, or did I overlook something?) Sorry, probably a trivial question for a statistician...
Thank you for an excellent package in any case!
The text was updated successfully, but these errors were encountered: