-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Best Practice for Including Reward-Bench with Local Modifications in Our Repo #198
Comments
Having used the new setup recently, it's all somewhat in flux, but I am personally trying to avoid copying or depending on a fork. My first thought would be:
|
Yeah I mostly agree with @sanderland, but I understand if it's a research project / lesson on using open source code. Specifics:
Operationally, it's not that easy to extract changes in a flattened repo. Normally, you want make a new fork and re-apply them. I don't really have much time to do that, but would love to see the additions. I'm sure claude/chatgpt can whip up some quick bash scripts for creating git diff's from a specific commit. Let me know what you think @hank0316 |
Thank you guys @natolambert and @sanderland for the replies. My thoughts:
I appreciate your guidance on these modifications. Please let me know if there are specific procedures or additional insights required for integrating these changes. Thanks again for your time and help! |
Yup @hank0316 opening PR(s) is best. I'll provide feedback from there. |
Hi Nathan,
I’m currently preparing to release a new repository that contains the code used in my paper. As part of our experiments, we made some slight modifications to the reward-bench code (we're using the v0.1.0-dev version).
The changes include:
load_in_8bit
actually does, and I just want to load our RM infp16
).I'm reaching out to ask about the best practice for including reward-bench with our changes into our repo. At the moment, we’ve removed the .git directory and committed the entire reward-bench codebase to our repository.
Is there a better approach to incorporate reward-bench while maintaining our local modifications? Any advice would be much appreciated!
Thank you for your time and help!
Best regards,
Tzu-Han
The text was updated successfully, but these errors were encountered: