Can I implement block quantization through tensorflow-quantization? #4283

lqq-feel · 2024-12-15T03:45:47Z

I want to implement block quantization through tensorflow-quantization, what process should I need to follow? Or can you support a simple case?

lix19937 · 2024-12-16T05:20:05Z

Maybe it helps, https://github.com/NVIDIA/TensorRT/tree/release/10.7/tools/tensorflow-quantization

lqq-feel · 2024-12-16T11:37:56Z

Maybe it helps, https://github.com/NVIDIA/TensorRT/tree/release/10.7/tools/tensorflow-quantization
I have read through it before, but not found a example to implement block quantization. So I do not know that does it really support block quantization?

lix19937 · 2024-12-17T05:12:42Z

trt-llm has do some work on lora, which include Block-wise Quantization.

lqq-feel · 2024-12-17T11:58:27Z

trt-llm has do some work on lora, which include Block-wise Quantization.
Can trt-llm do block-wise quantization on non-LLM, a graphdef model of tensorflow？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can I implement block quantization through tensorflow-quantization? #4283

Can I implement block quantization through tensorflow-quantization? #4283

lqq-feel commented Dec 15, 2024

lix19937 commented Dec 16, 2024

lqq-feel commented Dec 16, 2024

lix19937 commented Dec 17, 2024

lqq-feel commented Dec 17, 2024

Can I implement block quantization through tensorflow-quantization? #4283

Can I implement block quantization through tensorflow-quantization? #4283

Comments

lqq-feel commented Dec 15, 2024

lix19937 commented Dec 16, 2024

lqq-feel commented Dec 16, 2024

lix19937 commented Dec 17, 2024

lqq-feel commented Dec 17, 2024