Compress Transformers for faster inference using techniques like Knowledge Distillation, Quantization, ONNX Conversion and Pruning (Sparsification)
-
Notifications
You must be signed in to change notification settings - Fork 0
subhasisj/Model-Compression-Techniques
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Compress Transformers for faster inference using techniques like Knowledge Distillation, Quantization, ONNX Conversion and Pruning (Sparsification)
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published