llm-compression

Here are 7 public repositories matching this topic...

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

compression language-model knowledge-distillation model-quantization pruning-algorithms llm llm-compression efficient-llm

Updated Nov 17, 2024
Python

pprp / Pruner-Zero

Star

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs

symbolic-regression llm-compression llm-pruning

Updated Jun 14, 2024
Python

Picovoice / llm-compression-benchmark

Star

LLM Compression Benchmark

llm llm-inference llm-compression

Updated May 28, 2024
Python

VITA-Group / llm-kick

Star

[ICLR 2024] Jaiswal, A., Gan, Z., Du, X., Zhang, B., Wang, Z., & Yang, Y. Compressing llms: The truth is rarely pure and never simple.

llm-inference llm-evaluation llm-compression llm-pruning

Updated Mar 13, 2024
Python

Picovoice / serverless-picollm

Star

LLM Inference on AWS Lambda

aws-lambda serverless llm serverless-inference llm-inference llm-compression

Updated Jun 3, 2024
Python

bupt-ai-club / llm-compression-papers

Star

papers of llm compression

survey pruning quantization knowledge-distillation llm llm-compression llm-survey llm-compression-survey

Updated Mar 6, 2024

GongCheng1919 / bias-compensation

Star

[CAAI AIR'24] Minimize Quantization Output Error with Bias Compensation

post-training-quantization llm-compression output-error-optimization bias-compensation llm-quantization

Updated Jun 25, 2024
Python

Improve this page

Add a description, image, and links to the llm-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-compression

Here are 7 public repositories matching this topic...

horseee / Awesome-Efficient-LLM

pprp / Pruner-Zero

Picovoice / llm-compression-benchmark

VITA-Group / llm-kick

Picovoice / serverless-picollm

bupt-ai-club / llm-compression-papers

GongCheng1919 / bias-compensation

Improve this page

Add this topic to your repo