In this Repo you can learn how to Load LLM in CPU and GPU. Basic Parameters like Temperature, max token, device type, etc.
Reference:
LLM : https://www.promptingguide.ai/introduction/settings
HF: https://huggingface.co/models
Ctransformers: https://pypi.org/project/ctransformers/
LLama-cpp-python: https://pypi.org/project/llama-cpp-python/