0.10.0 - Phi2
Major Changes
- Update binaries feb 2024 by @martindevans in #479
- Add CLBLAST native library to native libraries build by @jasoncouture in #468
- Introduced a new
BatchedExecutor
by @martindevans in #503 - Swapped
StatelessExecutor
to usellama_decode
! by @martindevans in #445 - LLamaToken Struct by @martindevans in #404
Bug Fixes
- KernelMemory EmbeddingMode bug correction by @zsogitbe in #485
- Normalize Embeddings by @martindevans in #507
- StreamingTextDecoder Fix & Tests by @martindevans in #428
- Tokenizer Fixes For Issue 430 by @martindevans in #433
Other Changes
- Use llama instead of libllama in
[DllImport]
by @jasoncouture in #465 - Updated Examples by @vikramvee in #502
- Added new file types to quantisation by @martindevans in #495
- Smaller Unit Test Model by @martindevans in #496
- Using
AddRange
inLLamaEmbedder
by @martindevans in #499 - Small KV Cache Handling Improvements by @martindevans in #500
- Added increment and decrement operators to
LLamaPos
by @martindevans in #501 - Swapped
GetEmbeddings
tollama_decode
by @martindevans in #474 - kv_cache_instance_methods by @martindevans in #454
- Removed
IModelParams
andIContextParams
setters. by @martindevans in #472 - Managed
LLamaBatch
by @martindevans in #442 - Check Model Path Exists by @martindevans in #437
- Model Metadata Loading Cleanup by @martindevans in #438
- Added a check for EOS token in LLamaStatelessExecutor by @martindevans in #434
- Update README.md by @Oceania2018 in #427
- Gpu layer count change by @Kaotic3 in #424
- Improved exceptions in IModelParams for unknown KV override types. by @martindevans in #416
New Contributors
- @Kaotic3 made their first contribution in #424
- @Oceania2018 made their first contribution in #427
- @jasoncouture made their first contribution in #465
- @zsogitbe made their first contribution in #485
- @vikramvee made their first contribution in #502
Full Changelog: 0.9.1...v0.10.0