Add CodeLlama-13b-Python-hf-q4f16_1-metal #18

sangelone · 2023-08-28T06:59:41Z

Addresses (in part) #16. Total param size is 6.8GB.

Build output:

Downloaded weights to dist/models/CodeLlama-13b-Python-hf
Using path "dist/models/CodeLlama-13b-Python-hf" for model "CodeLlama-13b-Python-hf"
/Users/catalyst/Workspace/mlc-ai-package-self-runner/_work/package/package/tvm/src/runtime/metal/metal_device_api.mm:167: Intializing Metal device 0, name=Apple M2
Host CPU dection:
  Target triple: arm64-apple-darwin22.6.0
  Process triple: arm64-apple-darwin22.6.0
  Host CPU: apple-m1
Target configured: metal -keys=metal,gpu -max_function_args=31 -max_num_threads=256 -max_shared_memory_per_block=32768 -max_threads_per_block=1024 -thread_warp_size=32
Host CPU dection:
  Target triple: arm64-apple-darwin22.6.0
  Process triple: arm64-apple-darwin22.6.0
  Host CPU: apple-m1
Automatically using target for weight quantization: metal -keys=metal,gpu -max_function_args=31 -max_num_threads=256 -max_shared_memory_per_block=32768 -max_threads_per_block=1024 -thread_warp_size=32
Start computing and quantizing weights... This may take a while.
Finish computing and quantizing weights.
Total param size: 6.820138931274414 GB
Start storing to cache dist/CodeLlama-13b-Python-hf-q4f16_1/params
All finished, 163 total shards committed, record saved to dist/CodeLlama-13b-Python-hf-q4f16_1/params/ndarray-cache.json
Finish exporting chat config to dist/CodeLlama-13b-Python-hf-q4f16_1/params/mlc-chat-config.json
Save a cached module to dist/CodeLlama-13b-Python-hf-q4f16_1/mod_cache_before_build.pkl.
Finish exporting to dist/CodeLlama-13b-Python-hf-q4f16_1/CodeLlama-13b-Python-hf-q4f16_1-metal.so

Sing-Li · 2023-08-30T20:50:43Z

@CharlieFRuan is it possible to merge this 🙏

MasterJH5574 · 2023-08-30T21:18:10Z

Merged. Thanks @sangelone!

sangelone mentioned this pull request Aug 28, 2023

[Request] Please generate CodeLlama-7b-Python-hf-q4f16_1-metal.so #16

Open

Add CodeLlama-13b-Python-hf-q4f16_1-metal

24ebbec

sangelone force-pushed the main branch from ae50ab1 to 24ebbec Compare August 28, 2023 07:10

MasterJH5574 merged commit 74a341e into mlc-ai:main Aug 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CodeLlama-13b-Python-hf-q4f16_1-metal #18

Add CodeLlama-13b-Python-hf-q4f16_1-metal #18

sangelone commented Aug 28, 2023 •

edited

Loading

Sing-Li commented Aug 30, 2023

MasterJH5574 commented Aug 30, 2023

Add CodeLlama-13b-Python-hf-q4f16_1-metal #18

Add CodeLlama-13b-Python-hf-q4f16_1-metal #18

Conversation

sangelone commented Aug 28, 2023 • edited Loading

Sing-Li commented Aug 30, 2023

MasterJH5574 commented Aug 30, 2023

sangelone commented Aug 28, 2023 •

edited

Loading