jp6/cu122/: autoawq versions
Because this project isn't in the mirror_whitelist
,
no releases from root/pypi are included.
Latest version on stage is: 0.2.4+cu122
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
Index | Version | Documentation |
---|---|---|
jp6/cu122 | 0.2.4+cu122 |