flashinfer-ai/flashinfer
PythonApache-2.0activepopular
Health
FlashInfer: Kernel Library for LLM Serving
Health Breakdown
Activity25
Community25
Maintenance12
Popularity25
#attention#cuda#distributed-inference#gpu#jit#large-large-models#llm-inference#moe#nvidia#pytorch
Community
PythonApache 2.0
activepopular
★ 5.6k979 contributors3d ago