AutoGPTQ-triton

An easy-to-use model quantization package with user-friendly apis, based on GPTQ algorithm.

Stars

15

Forks

3

Language

Python

Last Updated

Apr 26, 2023

Similar Repos