GitHub - nlpnorth/snakmodel: An LLM continually pre-trained specifically for Danish.

SnakModel is a 7B-parameter, autoregressive language model specifically designed for Danish. There are both an instruction-tuned variant, as well as a base version for further fine-tuning. Our models build upon Llama 2, which we continuously pre-train on a diverse collection of Danish corpora comprising 350M documents and 13.6B words, before tuning it on 3.7M Danish instruction-answer pairs.

Developers

🧭 NLPnorth research unit at the IT University of Copenhagen, Denmark.

Resources

💬 SnakModeller:
- SnakModel-7B (base): The base LM trained on Danish text completion + its intermediate checkpoints.
- SnakModel-7B (instruct): An instruction-tuned variant of the base model + its intermediate checkpoints.
⚙️ Model Training Dynamics:
- Research Paper: coming in Q1 2025.
- Codebase: coming soon to this repository.
🇩🇰 Cultural Awareness Evaluation:
- Research Paper: coming in Q1 2025 (pre-print coming soon).
- Codebase: coming soon to this repository.
- Web-based LLM Evaluation Interface: coming soon.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
snakmodel.png		snakmodel.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resources

About

Releases

Packages

License

nlpnorth/snakmodel

Folders and files

Latest commit

History

Repository files navigation

Resources

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages