#

sparsity

Here are 131 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Nov 26, 2024
Python

sparseml

neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

nlp sparsity tensorflow keras pytorch deep-learning-algorithms image-classification deep-learning-library pruning object-detection transfer-learning automl computer-vision-algorithms onnx deep-learning-models sparsification pruning-algorithms smaller-models sparsification-recipes

Updated Aug 1, 2024
Python

pytorch / ao

PyTorch native quantization and sparsity for training and inference

training sparsity cuda inference optimizer pytorch transformer offloading llama quantization mx brrr dtypes float8

Updated Nov 26, 2024
Python

PaddlePaddle / PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

sparsity compression detection transformer segmentation pruning quantization nas bert tensorrt distillation ernie yolov5 yolov6 yolov7

Updated Nov 20, 2024
Python

tensorflow / model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

machine-learning sparsity compression deep-learning tensorflow optimization keras ml pruning quantization model-compression quantized-training quantized-neural-networks quantized-networks

Updated Nov 21, 2024
Python

openvinotoolkit / nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

nlp sparsity compression deep-learning tensorflow transformers pytorch classification pruning object-detection quantization semantic-segmentation bert hawq onnx openvino mmdetection mixed-precision-training quantization-aware-training

Updated Nov 26, 2024
Python

Eric-mingjie / network-slimming

Network Slimming (Pytorch) (ICCV 2017)

sparsity deep-learning pytorch convolutional-neural-networks channel-pruning

Updated Nov 6, 2020
Python

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

sparsity compression quantization

Updated Nov 26, 2024
Python

Bobo-y / flexible-yolov5

More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam，dcn and so on), and tensorrt

sparsity backbone pytorch resnet object-detection gcn tensorrt neck qat shufflenet yolov3 cbam hrnet dcnv2 yolov5 moblienet swin-transformer triton-server ptq

Updated Aug 19, 2024
Python

FMInference / H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

sparsity high-throughput heavy-hitters kv-cache gpt-3 large-language-models

Updated Aug 1, 2024
Python

wenwei202 / caffe

Caffe for Sparse and Low-rank Deep Neural Networks

deep-neural-networks sparsity acceleration compression caffe low-rank-approximation sparse-convolution

Updated Mar 8, 2020
C++

intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

Updated Aug 30, 2024
C++

mehtadushy / SelecSLS-Pytorch

Reference ImageNet implementation of SelecSLS CNN architecture proposed in the SIGGRAPH 2020 paper "XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera". The repository also includes code for pruning the model based on implicit sparsity emerging from adaptive gradient descent methods, as detailed in the CVPR 2019 paper "On i…

sparsity deep-learning efficient cnn pytorch imagenet pruning siggraph pytorch-implementation cvpr2019 efficient-architectures

Updated Jul 23, 2020
Python

bwohlberg / sporco

Sparse Optimisation Research Code

python sparsity optimization cuda admm sparse-coding dictionary-learning optimization-algorithms robust-pca fista convolutional-sparse-coding total-variation sparse-representations convolutional-dictionary-learning total-variation-minimization plug-and-play-priors

Updated Apr 29, 2024
Python

dcmocanu / sparse-evolutionary-artificial-neural-networks

Always sparse. Never dense. But never say never. A Sparse Training repository for the Adaptive Sparse Connectivity concept and its algorithmic instantiation, i.e. Sparse Evolutionary Training, to boost Deep Learning scalability on various aspects (e.g. memory and computational time efficiency, representation and generalization power).

deep-neural-networks sparsity deep-learning scalability randomization neuroevolution deep-learning-algorithms classification evolutionary-algorithms artificial-neural-networks restricted-boltzmann-machine complex-networks deep-learning-papers multi-layer-perceptron generative-models sparse-neural-networks scalable-deep-learning sparse-training sparse-evolutionary-training adaptive-sparse-connectivity

Updated Jul 21, 2021
Python

IntelLabs / SkimCaffe

Caffe for Sparse Convolutional Neural Network

sparsity caffe intel convolution pruning winograd

Updated Dec 16, 2022
C++

The-Learning-And-Vision-Atelier-LAVA / SMSR

[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference

sparsity super-resolution efficient-inference

Updated Oct 18, 2021
Python

vene / sparse-structured-attention

Sparse and structured neural attention mechanisms

deep-neural-networks sparsity deep-learning sparse segmentation deeplearning attention-mechanism fused-lasso attention-mechanisms

Updated Aug 31, 2020
Python

jack-willturner / deep-compression

Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626

sparsity deep-learning pytorch pruning

Updated Nov 10, 2022
Jupyter Notebook

lucaslie / torchprune

A research library for pytorch-based neural network pruning, compression, and more.

machine-learning sparsity compression deep-learning pytorch neural-networks pruning tensor-decomposition neural-architecture-search weight-pruning sparsification filter-pruning tinyml generalization-ability coresets pruning-algorithms

Updated Nov 28, 2022
Shell

Improve this page

Add a description, image, and links to the sparsity topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sparsity topic, visit your repo's landing page and select "manage topics."