jp6/cu124/: auto-gptq versions
Because this project isn't in the mirror_whitelist
,
no releases from root/pypi are included.
Latest version on stage is: 0.7.1+cu124
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Index | Version | Documentation |
---|---|---|
jp6/cu124 | 0.7.1+cu124 |