← Explore
TOPIC

#nvfp4

Open source repositories tagged with #nvfp4, ranked by health score.

intel
intel/auto-round
Python
87
health

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

1.4k