Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Request] Please generate CodeLlama-7b-Python-hf-q4f16_1-metal.so #16

Open
Shekhars opened this issue Aug 27, 2023 · 6 comments
Open

[Request] Please generate CodeLlama-7b-Python-hf-q4f16_1-metal.so #16

Shekhars opened this issue Aug 27, 2023 · 6 comments

Comments

@Shekhars
Copy link

It's missing in the latest update. Can you please add this so that M1/M2 can run the codellama? Thanks

@sangelone
Copy link
Contributor

sangelone commented Aug 27, 2023

The 13B would also be helpful 😄

@Sing-Li
Copy link
Contributor

Sing-Li commented Aug 28, 2023

And for vulkan.dll please - many users are on Windows wanting to try CodeLlama 🙏

@sangelone
Copy link
Contributor

Not sure if it fully resolves this request, but I at least built and added CodeLlama-13b-Python-hf-q4f16_1-metal in a PR: #18

@Sing-Li
Copy link
Contributor

Sing-Li commented Aug 28, 2023

Thanks @sangelone ! How did you do it? (which instructions did you follow and/or did you have to modify any config/source files?) Some of us may be able to replicate it and help build some of the other missing targets. 🙏

@sangelone
Copy link
Contributor

I just followed the docs using the appropriate model weights from HF on an M2 machine. It took a few hours to get all the dependencies installed and working in a Conda env, but then it was pretty straightforward to do the build itself, maybe half an hour more at that point.

@Sing-Li
Copy link
Contributor

Sing-Li commented Aug 29, 2023

Thanks @sangelone ! Got the Windows DLL for 13b done. The 34b model's raw weights are 300GB in size 😱 Don't have enough fast NvME to work on it 😞

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants