quantized-networks

Here are 4 public repositories matching this topic...

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

QKeras: a quantization deep learning library for Tensorflow Keras

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

research cuda mlsys quantized-networks llm-inference

model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏，量化，剪枝)

Add a description, image, and links to the quantized-networks topic page so that developers can more easily learn about it.

To associate your repository with the quantized-networks topic, visit your repo's landing page and select "manage topics."