← Back to Discover
jjang-ai

jjang-ai/vmlx

PythonApache-2.0activerising
88Health

vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!

Stars529
Forks65
Open Issues26
Contributors65
Last Push0d ago

Health Breakdown

Activity
25
Community
25
Maintenance
13
Popularity
25
#anthropic-api#kvcache-compression#kvcache-optimization#kvcache-reuse#llm#lmstudio#macbook#mcp-server#mlx#mlxllm#mlxstudio#omlx#omlx-alternative#openai-api#openclaw#openclaw-agent#persistent-memory#prefix-cache#vmlx
View on GitHub ↗Issues (26) ↗Pull Requests ↗

Community

jjang-ai
jjang-ai/vmlx
PythonApache 2.0
88

vMLX - JANGTQ Uber Compressed MLX Models - L2 Disk Cache (survives restart) + L1 Paged (super fast ttft) + Hybrid SSM Scheduler + Cont Batching + etc!

activerising
52965 contributors0d ago

More Python repos

sciencepal
sciencepal/sciencepal
Woooh, it's just me, myself and I
15499
WCGKING
WCGKING/BrandrdXMusic
ᴅɪꜱᴄᴏᴠᴇʀ ᴛʜᴇ ᴇɴᴄʜᴀɴᴛɪɴɢ ᴡᴏʀʟᴅ ᴏꜰ ᴍᴜꜱɪᴄ ᴡɪᴛʜ ⛦🦋ʙʀᴀɴᴅʀᴅ ❥! ᴛʜɪꜱ ᴘʏᴛʜᴏɴ-ʙᴀꜱᴇᴅ ᴛᴇʟᴇɢʀᴀᴍ ᴍᴜꜱɪᴄ ʙᴏᴛ, ᴘᴏᴡᴇʀᴇᴅ ʙʏ ᴘʏʀᴏɢʀᴀᴍ ᴠ2, ʙʀɪɴɢꜱ ʜᴀʀᴍᴏɴʏ ᴛᴏ ʏᴏᴜʀ ᴄʜᴀᴛꜱ. ᴇɴᴊᴏʏ ꜱᴇᴀᴍʟᴇꜱꜱ ᴍᴜꜱɪᴄ ꜱʜᴀʀɪɴɢ ᴀɴᴅ ꜱᴛʀᴇᴀᴍɪɴɢ ɪɴ ꜱᴛʏʟᴇ.
10395
haoyiyin
haoyiyin/basjoo
Open-source AI customer support platform — RAG knowledge base, multi-provider LLM agents, embeddable chat widget. FastAPI + Next.js + R2R + pgvector.
9994