Skip to content
@neuralmagic

Neural Magic

Neural Magic helps developers in accelerating machine learning performance using automated model sparsification techniques and inference technologies.

Pinned Loading

  1. nm-vllm nm-vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 234 9

  2. deepsparse deepsparse Public

    Sparsity-aware deep learning inference runtime for CPUs

    Python 2.9k 169

  3. sparseml sparseml Public

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    Python 2k 140

  4. docs docs Public

    Top-level directory for documentation and general content

    MDX 120 7

  5. examples examples Public

    Notebooks using the Neural Magic libraries 📓

    Jupyter Notebook 39 6

  6. sparsezoo sparsezoo Public

    Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

    Python 362 23

Repositories

Showing 10 of 47 repositories
  • nm-vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    neuralmagic/nm-vllm’s past year of commit activity
    Python 234 3,275 0 24 Updated Jul 12, 2024
  • guidellm Public
    neuralmagic/guidellm’s past year of commit activity
    Python 2 Apache-2.0 0 0 5 Updated Jul 11, 2024
  • compressed-tensors Public

    A safetensors extension to efficiently store sparse quantized tensors on disk

    neuralmagic/compressed-tensors’s past year of commit activity
    Python 11 Apache-2.0 0 0 11 Updated Jul 11, 2024
  • transformers Public Forked from huggingface/transformers

    🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

    neuralmagic/transformers’s past year of commit activity
    Python 9 Apache-2.0 25,932 0 13 Updated Jul 11, 2024
  • nm-vllm-utils Public

    Various utilities for use with nm-vllm

    neuralmagic/nm-vllm-utils’s past year of commit activity
    Makefile 0 Apache-2.0 0 0 6 Updated Jul 9, 2024
  • evalplus Public Forked from evalplus/evalplus

    NeuralMagic fork of EvalPlus (Rigourous evaluation of LLM-synthesized code - NeurIPS 2023)

    neuralmagic/evalplus’s past year of commit activity
    Python 0 Apache-2.0 87 0 0 Updated Jul 9, 2024
  • alpaca_eval Public Forked from tatsu-lab/alpaca_eval

    An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

    neuralmagic/alpaca_eval’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 207 0 0 Updated Jul 9, 2024
  • nm-actions Public

    Neural Magic GHA

    neuralmagic/nm-actions’s past year of commit activity
    0 Apache-2.0 0 0 1 Updated Jul 8, 2024
  • yolov5 Public Forked from ultralytics/yolov5

    YOLOv5 in PyTorch > ONNX > CoreML > TFLite

    neuralmagic/yolov5’s past year of commit activity
    Python 20 GPL-3.0 16,040 0 6 Updated Jul 8, 2024
  • helm-charts Public

    Helm charts for deploying NM VLLM

    neuralmagic/helm-charts’s past year of commit activity
    Python 3 Apache-2.0 0 0 3 Updated Jul 6, 2024