Sackware-current now use python 3.12.7
, but i did not upgrade my system so far.
So I have not test yet script for new python version.
Script is based on GratisStudio HowTo for windows. It is tested on Slackware64 current systems without issues. If you found a bug please open an issue.
python3.11--> {numpy,sentencepiece,gguf}
GPT4All or ollama (if you use ollama gguf models can work whith zed IDE)
git lfs
========= OPTIONAL:=============
Vulkan SDK (AMD GPU Support)
Cuda toolkit (Nvidia GPU Support)
********************************
Normally all other needs should be by default in your distro, if not..when script fail read what is missing and install from your distro package manager.
- Make script executable if not...
chmod +x quantizing_ai_models.sh
- Find the LL model you want from https://huggingface.co
Copy ONLY the provided url for git clone, example:
- Now exexute script in terminal following by model url example:
./quantizing_ai_models.sh https://huggingface.co/Cadenza-Labs/dolphin-llama3-8B-sleeper-agent-standard-l
-
Just answer questions when needed and wait for results...
-
If you have success 👊 you can now load your model.gguf using gpt4all app or ollama.
Warning: Running this script as root is not recommended.
mistral llama llama3 ggml