Enable pack hpu and from_quantized #7

HolyFalafel · 2024-07-31T09:01:14Z

Enabled pack in hpu, which is also available in Enabled hpu pack #6

Revert "Removed hpu pack until we'll implement it in HPU"
This reverts commit 92a8d41.

Enabled test_quantization and AutoGPTQForCausalLM.from_quantized

* Supporting llama int4 quantization using AutoGPTQ * Running only PT code (similar to cuda_old) on HPU * Testing convert_from_int4 * Started cleanup * code cleanup * Added weight reshape in preprocessing Added llama7b generation hpu test * Changed reshape to match matmul (still not accurate) and fixed q4 test * Fixing zero points * Update pack function * Fixed accuracy * Uncommented exllama * Marlin test fix + added hpu bias test * Review comments * Removed hpu pack until we'll implement it in HPU --------- Co-authored-by: yan tomsinsky <ytomsinsky@habana.ai>

This reverts commit 92a8d41.

HolyFalafel and others added 3 commits July 30, 2024 11:11

Revert "Removed hpu pack until we'll implement it in HPU"

79d322b

This reverts commit 92a8d41.

Enabled test_quantization and AutoGPTQForCausalLM.from_quantized

4af5ce6

HolyFalafel requested review from MrGeva, Tiefen-boop, Yantom1, nirda7, ulivne and dudilester July 31, 2024 09:01

HolyFalafel changed the title ~~Dev/danny/enable pack hpu and test~~ Enable pack hpu and from_quantized Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable pack hpu and from_quantized #7

Enable pack hpu and from_quantized #7

HolyFalafel commented Jul 31, 2024 •

edited

Loading

Enable pack hpu and from_quantized #7

Are you sure you want to change the base?

Enable pack hpu and from_quantized #7

Conversation

HolyFalafel commented Jul 31, 2024 • edited Loading

HolyFalafel commented Jul 31, 2024 •

edited

Loading