Tags: #inference-framework
AI/ML Serving Framework
NVIDIA GPUs
26.4k
sgl-project/sglang
SGLang is a high-performance serving framework for large language models and multimodal models, optimizing inference throughput and latency.
SGLang is a high-performance serving framework for large language models and multimodal models, optimizing inference throughput and latency.