Skip to content
@ModelTC

ModelTC

Model Infra

Pinned Loading

  1. MQBench MQBench Public

    Model Quantization Benchmark

    Shell 766 140

  2. United-Perception United-Perception Public

    United Perception

    Python 430 65

  3. NNLQP NNLQP Public

    Python 34 3

  4. Dipoorlet Dipoorlet Public

    Offline Quantization Tools for Deploy.

    Python 116 16

  5. lightllm lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 2.6k 206

Repositories

Showing 10 of 39 repositories
  • llmc Public

    [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

    ModelTC/llmc’s past year of commit activity
    Python 325 Apache-2.0 34 2 0 Updated Nov 23, 2024
  • lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    ModelTC/lightllm’s past year of commit activity
    Python 2,622 Apache-2.0 206 61 6 Updated Nov 22, 2024
  • general-sam-py Public

    Python bindings for general-sam and some utilities

    ModelTC/general-sam-py’s past year of commit activity
    Python 3 Apache-2.0 0 0 2 Updated Nov 18, 2024
  • mtc-token-healing Public

    Token healing implementation in Rust

    ModelTC/mtc-token-healing’s past year of commit activity
    Rust 3 Apache-2.0 0 0 3 Updated Nov 18, 2024
  • general-sam Public

    A general suffix automaton implementation in Rust with Python bindings

    ModelTC/general-sam’s past year of commit activity
    Rust 4 Apache-2.0 0 0 1 Updated Oct 18, 2024
  • EasyLLM Public

    Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing usability, it also ensures training efficiency.

    ModelTC/EasyLLM’s past year of commit activity
    Python 41 Apache-2.0 7 0 0 Updated Sep 18, 2024
  • DeepSpeed Public Forked from microsoft/DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    ModelTC/DeepSpeed’s past year of commit activity
    Python 0 Apache-2.0 4,295 0 0 Updated Sep 13, 2024
  • opencompass Public Forked from open-compass/opencompass

    OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

    ModelTC/opencompass’s past year of commit activity
    Python 1 Apache-2.0 446 0 0 Updated Sep 6, 2024
  • xtuner Public Forked from InternLM/xtuner

    An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

    ModelTC/xtuner’s past year of commit activity
    Python 0 Apache-2.0 316 0 0 Updated Aug 22, 2024
  • InternVL Public Forked from OpenGVLab/InternVL

    [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

    ModelTC/InternVL’s past year of commit activity
    Python 0 MIT 480 0 0 Updated Aug 16, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…