bfineran

Benjamin Fineran bfineran

Achievements

vllm-project/llm-compressor vllm-project/llm-compressor Public

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 664 54
neuralmagic/compressed-tensors neuralmagic/compressed-tensors Public

A safetensors extension to efficiently store sparse quantized tensors on disk

Python 46 1
neuralmagic/deepsparse neuralmagic/deepsparse Public

Sparsity-aware deep learning inference runtime for CPUs

Python 3k 176
neuralmagic/sparseml neuralmagic/sparseml Public

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2.1k 147
neuralmagic/sparsezoo neuralmagic/sparsezoo Public

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

Python 371 25
neuralmagic/sparsify neuralmagic/sparsify Public

ML model optimization product to accelerate inference.

Python 320 29