Ecosystem & Stack: kubernetes
ray-project/ray
A unified framework for scaling AI and Python applications from a laptop to a cluster, providing a distributed runtime and AI libraries.
OpenHands/OpenHands
An AI-driven development platform providing an SDK, CLI, GUI, and cloud services to build, run, and scale autonomous software agents for various development tasks.
onyx-dot-app/onyx
Onyx is an open-source AI platform providing a feature-rich interface for Large Language Models, enabling advanced AI chat with RAG, web search, and agentic capabilities.
langbot-app/LangBot
A production-grade, open-source platform for building and deploying intelligent, agentic instant messaging bots across various chat platforms.
archestra-ai/archestra
A secure, enterprise-grade AI platform providing guardrails, a private model registry, and orchestration for managing AI usage, costs, and data security.
open-webui/open-webui
A user-friendly, extensible, and feature-rich self-hosted AI platform supporting various LLM runners like Ollama and OpenAI-compatible APIs, with built-in RAG capabilities.
milvus-io/milvus
A high-performance, cloud-native vector database built for scalable Approximate Nearest Neighbor (ANN) search and AI applications.
agentscope-ai/agentscope
A production-ready, easy-to-use AI agent framework designed for building, running, and finetuning intelligent agents with transparent and trustworthy capabilities.
modular/modular
A unified, open platform for accelerating AI model serving and scaling GenAI deployments with industry-leading performance across various hardware.
kubesphere/kubesphere
A distributed operating system for cloud-native application management, built on Kubernetes, offering full-stack IT operations and streamlined DevOps workflows across multi-cloud, datacenter, and edge environments.
pinpoint-apm/pinpoint
Pinpoint is an open-source APM tool designed for large-scale distributed systems, offering real-time monitoring and code-level visibility to identify performance bottlenecks.
LazyAGI/LazyLLM
A low-code development tool for building and iteratively optimizing multi-agent LLM applications with agility and efficiency.
control-theory/gonzo
Gonzo is a Go-based terminal UI for real-time log analysis, offering interactive dashboards, AI-powered insights, and advanced filtering capabilities.
gpustack/gpustack
An open-source GPU cluster manager that orchestrates high-performance AI inference engines across diverse environments, optimizing model deployment and resource utilization.
kserve/kserve
KServe is a standardized, scalable, and multi-framework platform for deploying and serving both generative and predictive AI models on Kubernetes.
weaviate/weaviate
An open-source, cloud-native vector database enabling semantic search at scale by combining vector similarity with structured filtering and integrated AI models.
skypilot-org/skypilot
A unified system to run, manage, and scale AI workloads efficiently across any infrastructure, including Kubernetes, Slurm, and over 20 cloud providers, optimizing cost and resource availability.
flyteorg/flyte
Dynamic, resilient open-source orchestrator for building scalable and reproducible data and ML pipelines on Kubernetes.
SwanHubX/SwanLab
An open-source, modern-design AI training tracking and visualization tool that integrates with 50+ mainstream frameworks, simplifying experiment management for AI teams.
argoproj/argo-workflows
A container-native workflow engine for orchestrating parallel jobs and multi-step tasks on Kubernetes.
kubeflow/pipelines
Kubeflow Pipelines is a Kubernetes-native platform for building, deploying, and managing end-to-end machine learning workflows.
polyaxon/polyaxon
A comprehensive MLOps platform for managing, orchestrating, and scaling the entire machine learning and deep learning lifecycle.
casdoor/casdoor
An open-source, AI-first Identity and Access Management (IAM) solution providing comprehensive authentication, authorization, and user management with a web UI.
tensorchord/envd
envd simplifies the creation of reproducible, container-based development environments for AI/ML projects, streamlining setup and ensuring consistency.
kubeflow/trainer
A Kubernetes-native platform for scalable distributed AI model training and large language model (LLM) fine-tuning across various frameworks.
clearml/clearml
ClearML is an open-source MLOps/LLMOps solution that streamlines the entire AI workflow, from experiment management and data versioning to pipeline orchestration and model serving.
Netflix/maestro
A general-purpose workflow orchestrator providing a fully managed workflow-as-a-service for data and ML pipelines at scale.
bentoml/OpenLLM
Self-host and serve any open-source LLM as an OpenAI-compatible API endpoint with ease.
SeldonIO/seldon-core
An MLOps and LLMOps framework for deploying, managing, and scaling modular, data-centric AI applications and models on Kubernetes.
akuity/awesome-argo
A curated list of awesome projects and resources related to Argo, a CNCF graduated project for deploying and running applications and workloads on Kubernetes.
tencentmusic/cube-studio
A comprehensive, cloud-native, one-stop platform for machine learning, deep learning, and large language model development, covering the entire MLOps lifecycle.
ludwig-ai/ludwig
Ludwig is a low-code, declarative deep learning framework designed to simplify the building, training, and deployment of custom AI models, including LLMs and neural networks.
jenkinsci/kubernetes-plugin
A Jenkins plugin that enables dynamic provisioning and scaling of build agents in a Kubernetes cluster, optimizing resource utilization.
alibaba/OpenSandbox
A secure, fast, and extensible general-purpose sandbox runtime platform for AI agents, offering multi-language SDKs and flexible deployment.
crate/crate
CrateDB is a distributed SQL database combining the benefits of SQL with NoSQL scalability for real-time data storage and analysis.
vearch/vearch
A cloud-native distributed vector database designed for efficient similarity search of embedding vectors in AI applications.
zilliztech/attu
A modern, AI-native management tool for Milvus vector databases, offering multi-cluster management, data exploration, and AI-driven operations.
devflowinc/trieve
An all-in-one API platform providing advanced search, recommendations, and Retrieval-Augmented Generation (RAG) capabilities for developers.
dstackai/dstack
A vendor-agnostic unified control plane for GPU provisioning and orchestration across clouds, Kubernetes, and on-prem for AI/ML workloads.
predibase/lorax
A multi-LoRA inference server designed to efficiently serve thousands of fine-tuned Large Language Models on a single GPU, drastically cutting serving costs while maintaining high throughput and low latency.
IBM/mcp-context-forge
A unified AI gateway and proxy for federating MCP, A2A, and REST/gRPC APIs, offering centralized discovery, governance, and observability for AI agents and tools.
langwatch/langwatch
A comprehensive platform for end-to-end testing, simulation, evaluation, and monitoring of LLM-powered agents.
GetStream/Vision-Agents
A framework for building intelligent, low-latency multi-modal AI agents that can process real-time video and audio using various LLMs and vision models.
Eventual-Inc/Daft
A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.
getumbrel/llama-gpt
A private, self-hosted, and offline ChatGPT-like chatbot powered by Llama 2 and Code Llama, ensuring 100% data privacy.
kitops-ml/kitops
KitOps is a CNCF open-source MLOps tool for packaging, versioning, and securely sharing AI/ML models, datasets, and code as OCI artifacts.
Kong/kong
A high-performance, extensible API and AI Gateway for orchestrating microservices, traditional APIs, and AI/LLM traffic with advanced features like routing, security, and plugins.
camunda/camunda
Orchestrates complex business processes across people, systems, and devices, offering scalable, on-demand process automation.
jina-ai/serve
A cloud-native framework for building, deploying, and scaling multimodal AI applications and services with gRPC, HTTP, and WebSockets.
HolmesGPT/holmesgpt
An open-source AI agent for investigating production incidents and finding root causes across any stack.
apache/dolphinscheduler
Apache DolphinScheduler is a modern, low-code data orchestration platform designed for agile creation of high-performance workflows and managing complex data pipeline dependencies.
instill-ai/instill-core
An end-to-end AI infrastructure platform for data, model, and pipeline orchestration, designed to streamline the development of versatile AI-first applications.
windmill-labs/windmill
An open-source developer platform that transforms scripts into webhooks, workflows, and UIs, accelerating the creation of internal tools and automation.
quickwit-oss/quickwit
Quickwit is a cloud-native, open-source search engine designed for high-performance observability data (logs, traces, metrics) with sub-second queries on cloud storage.
sorry-cypress/sorry-cypress
An open-source, free, self-hosted alternative to Cypress Dashboard, enabling unlimited parallel test execution and comprehensive test result management.
kubero-dev/kubero
A free and self-hosted PaaS that simplifies application deployment on Kubernetes for developers without specialized knowledge.
octelium/octelium
Octelium is a self-hosted, open-source unified zero-trust secure access platform that integrates VPN, ZTNA, API/AI gateway, and PaaS functionalities for secure access and deployment across various environments.
gravitational/teleport
Teleport provides secure, unified access to all infrastructure, including servers, Kubernetes, databases, and web applications, enforcing zero-trust principles.