proxy.golang.org : github.com/intel/auto-round
Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.
Registry
-
Source
- Documentation
- JSON
- codemeta.json
purl: pkg:golang/github.com/intel/auto-round
Keywords:
int4
, mxfp4
, nvfp4
, quantization
, rounding
, transformers
, vllm
License: Apache-2.0
Latest release: 1 day ago
First release: over 1 year ago
Stars: 668 on GitHub
Forks: 56 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 1 day ago