SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Stars

573

Forks

36

Language

Python

Last Updated

May 15, 2024

Similar Repos