Quantizers

Quantizers is a library that provides an easy-to-use interface for quantizing LLMs into various formats by using YAML configs.

Supported Operating Systems

Linux
Windows
macOS

Supported Quantizations

Installation

To get started, clone the repo recursively:

git clone https://github.com/PygmalionAI/quantizers.git
cd quantizers
git submodule update --init --recursive
python3 -m pip install -e .
python3 -m pip install -r requirements.txt

To build with GPU support (currently for imatrix only), run this instead:

LLAMA_CUBLAS=1 python3 -m pip install -e .

Usage

Only GGUF is supported for now. You will need a YAML config file. An example is provided in the examples directory.

Once you've filled out your YAML file, run:

quantizers examples/gguf/config.yaml

Contribution

At the moment, we don't accept feature contributions until we've finished supporting all the planned quantization methods. PRs for bug fixes and OS support are welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
examples/gguf		examples/gguf
quantizers		quantizers
third_party		third_party
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quantizers

Supported Operating Systems

Supported Quantizations

Installation

Usage

Contribution

About

Releases

Packages

Languages

License

Nexesenex/quantizers_nq

Folders and files

Latest commit

History

Repository files navigation

Quantizers

Supported Operating Systems

Supported Quantizations

Installation

Usage

Contribution

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages