← Explore
TOPIC

#decode

Open source repositories tagged with #decode, ranked by health score.

guqiong96
guqiong96/Lvllm
Python
89
health

LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.

370