Tags: #high-performance
vllm-project/vllm
A high-throughput and memory-efficient open-source engine designed for fast, easy, and cost-effective serving of large language models.
PaddlePaddle/FastDeploy
A high-performance inference and deployment toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs) based on PaddlePaddle.
kvcache-ai/Mooncake
A KVCache-centric disaggregated architecture for high-performance LLM serving, powering leading AI services.
qdrant/qdrant
Qdrant is a high-performance, massive-scale vector database and search engine designed for next-generation AI applications.
vortex-data/vortex
Vortex is a next-generation, high-performance, and extensible columnar file format and toolkit designed for blazing-fast data processing and storage.
mcmonkeyprojects/SwarmUI
A modular, high-performance web UI for AI image and video generation, emphasizing accessible powertools and extensibility.
rathole-org/rathole
A lightweight, high-performance, and secure reverse proxy written in Rust, designed to expose services behind NAT to the internet.
Stability-AI/StableSwarmUI
A modular, high-performance, and extensible web user interface for Stable Diffusion, emphasizing easy access to powerful tools for both beginners and advanced users.
questdb/questdb
QuestDB is a high-performance, open-source time-series database designed for blazingly fast data ingestion and low-latency SQL queries.