Tags: #llm-serving

AI/ML Serving Framework

26.4k

SGLang is a high-performance serving framework for large language models and multimodal models, optimizing inference throughput and latency.

LLM Serving Platform

5.2k

A KVCache-centric disaggregated architecture for high-performance LLM serving, powering leading AI services.

LLM Serving Framework

12.3k

A framework for easily self-hosting and serving any open-source Large Language Models as OpenAI-compatible API endpoints in the cloud.

Replaces:

AI Service Framework

21.9k

A cloud-native framework for building and deploying high-performance multimodal AI applications with built-in scaling and orchestration.

Replaces: