neuralmagic Goto Github PK

Name: Neural Magic

Type: Organization

Bio: Neural Magic helps developers in accelerating machine learning performance using automated model sparsification techniques and inference technologies.

Twitter: neuralmagic

Location: Boston

Blog: neuralmagic.com

Neural Magic's Projects

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

autofp8

aws-do-eks

band_of_the_hawk

Hackathon 2022

clip_benchmark

CLIP-like model evaluation

compressed-tensors

A safetensors extension to efficiently store sparse quantized tensors on disk

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

deepsparse-digitalocean-image

Repo for building and packaging a 1-click app for DigitalOcean

docs

Top-level directory for documentation and general content

examples

Notebooks using the Neural Magic libraries 📓

hackathon_2024

woop wooop

helm-charts

Helm charts for deploying NM VLLM

inference

Reference implementations of MLPerf™ inference benchmarks

langchain

⚡ Building applications with LLMs through composability ⚡

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

mlperf_inference_results_v2.1

mlperf_inference_results_v3.0

nm-actions

Neural Magic GHA

nm-autogptq

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

nm-docker

Neural Magic Docker

nm-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

nm-vllm-utils

Various utilities for use with nm-vllm

optimum-deepsparse

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

sahi

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

sparsezoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

sparsify

ML model optimization product to accelerate inference.

to-be-removed-llm-foundry

LLM training code for MosaicML foundation models

neuralmagic Goto Github PK

Neural Magic's Projects

Recommend Projects

Recommend Topics

Recommend Org