Skip to content
Change the repository type filter

All

    Repositories list

    • vllm-fork

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.7k442029Updated Nov 29, 2024Nov 29, 2024
    • Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
      Python
      Apache License 2.0
      203304Updated Nov 29, 2024Nov 29, 2024
    • Python
      Apache License 2.0
      9203Updated Nov 29, 2024Nov 29, 2024
    • Provides the examples to write and build Habana custom kernels using the HabanaTools
      C++
      191831Updated Nov 21, 2024Nov 21, 2024
    • Reference models for Intel(R) Gaudi(R) AI Accelerator
      Jupyter Notebook
      81155812Updated Nov 21, 2024Nov 21, 2024
    • AutoGPTQ

      Public
      An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
      Python
      MIT License
      488002Updated Nov 19, 2024Nov 19, 2024
    • TOWL

      Public
      HTML
      Apache License 2.0
      1200Updated Nov 19, 2024Nov 19, 2024
    • Fairseq

      Public
      Python
      MIT License
      1300Updated Nov 18, 2024Nov 18, 2024
    • NIC drivers (Ethernet, IBverbs and common) for the NIC IP that is inside Intel's data-center GPU
      C
      Other
      2000Updated Nov 12, 2024Nov 12, 2024
    • Intel® Gaudi® Software is an implementation of the runtime and graph compiler for Gaudi3
      C++
      Other
      3410Updated Nov 11, 2024Nov 11, 2024
    • Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k104Updated Nov 11, 2024Nov 11, 2024
    • tpc_llvm

      Public
      TPC-CLANG compiler that compiles a TPC C programming language which is used in HabanaLabs Deep-Learning Accelerators
      42523Updated Nov 11, 2024Nov 11, 2024
    • HABANA device plugin for Kubernetes
      Go
      Apache License 2.0
      3405Updated Nov 11, 2024Nov 11, 2024
    • Python
      MIT License
      3603Updated Nov 11, 2024Nov 11, 2024
    • C
      Other
      0105Updated Nov 11, 2024Nov 11, 2024
    • Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://developer.habana.ai/
      Jupyter Notebook
      365632Updated Nov 6, 2024Nov 6, 2024
    • SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
      Python
      Apache License 2.0
      257000Updated Oct 31, 2024Oct 31, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.4k400Updated Oct 15, 2024Oct 15, 2024
    • Intel Gaudi's Megatron DeepSpeed Large Language Models for training
      Python
      Other
      2.4k1301Updated Oct 14, 2024Oct 14, 2024
    • HCL

      Public
      C++
      2800Updated Oct 13, 2024Oct 13, 2024
    • C++
      BSD 3-Clause "New" or "Revised" License
      1100Updated Oct 13, 2024Oct 13, 2024
    • Setup and Installation Instructions for Habana binaries, docker image creation
      Python
      Apache License 2.0
      132355Updated Oct 11, 2024Oct 11, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.1k1202Updated Oct 10, 2024Oct 10, 2024
    • slurm

      Public
      Slurm: A Highly Scalable Workload Manager
      C
      Other
      671201Updated Sep 29, 2024Sep 29, 2024
    • hccl_demo

      Public
      C++
      Apache License 2.0
      101502Updated Sep 3, 2024Sep 3, 2024
    • rdma-core

      Public
      RDMA core userspace libraries and daemons
      C
      Other
      691100Updated Jul 24, 2024Jul 24, 2024
    • papers

      Public
      Academic papers by Habana research team
      2100Updated Jul 20, 2024Jul 20, 2024
    • Jupyter Notebook
      Apache License 2.0
      3801Updated Jul 17, 2024Jul 17, 2024
    • Thunk library for HabanaLabs kernel driver
      C
      Other
      8400Updated Jun 16, 2024Jun 16, 2024
    • Habana container runtime
      Go
      Apache License 2.0
      2602Updated Jun 6, 2024Jun 6, 2024