Tags: #disaggregated-architecture
LLM Serving Platform
Python
5.2k
kvcache-ai/Mooncake
A KVCache-centric disaggregated architecture for high-performance LLM serving, powering leading AI services.
A KVCache-centric disaggregated architecture for high-performance LLM serving, powering leading AI services.