A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
-
Updated
Dec 16, 2024 - Python
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
QKeras: a quantization deep learning library for Tensorflow Keras
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏,量化,剪枝)
Add a description, image, and links to the quantized-networks topic page so that developers can more easily learn about it.
To associate your repository with the quantized-networks topic, visit your repo's landing page and select "manage topics."