Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the generation config of GPT-J #10

Open
AOZMH opened this issue Feb 4, 2024 · 0 comments
Open

About the generation config of GPT-J #10

AOZMH opened this issue Feb 4, 2024 · 0 comments

Comments

@AOZMH
Copy link

AOZMH commented Feb 4, 2024

Hi, thanks for the amazing project!

Up to now, this repo does not contain the evaluation scripts on GPT-J. Noticing that the MQUAKE dataset is built and filtered such that all single-hop facts are 100% answerable by GPT-J,I suppose that aligning with the generation configuration of GPT-J in your experiments is crutial in reproducing your results or conducting further investigations.

Hence, could you please shed some lights on the config you used over GPT-J? (I understand the limited bandwith, but maybe some references to the conventional settings you used might also be welcomed :)

Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant