Open source repositories tagged with #high-performance-inference, ranked by health score.
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.