TOPIC

#long-context

Open source repositories tagged with #long-context, ranked by health score.

KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

★ 389