Ecosystem & Stack: kubernetes

Distributed AI/ML Computing Framework
python
42.1k

ray-project/ray

A unified framework for scaling AI and Python applications from a laptop to a cluster, providing a distributed runtime and AI libraries.

AI Software Engineering Platform
Python
71.1k

OpenHands/OpenHands

An AI-driven development platform providing an SDK, CLI, GUI, and cloud services to build, run, and scale autonomous software agents for various development tasks.

Open Source AI Platform & Advanced AI Chat Application
docker
26.3k

onyx-dot-app/onyx

Onyx is an open-source AI platform providing a feature-rich interface for Large Language Models, enabling advanced AI chat with RAG, web search, and agentic capabilities.

Replaces:
Details
AI Bot Development Platform
Python
15.8k

langbot-app/LangBot

A production-grade, open-source platform for building and deploying intelligent, agentic instant messaging bots across various chat platforms.

Enterprise AI Platform
Docker
3.5k

archestra-ai/archestra

A secure, enterprise-grade AI platform providing guardrails, a private model registry, and orchestration for managing AI usage, costs, and data security.

Self-hosted AI Chat Platform
Docker
131.4k

open-webui/open-webui

A user-friendly, extensible, and feature-rich self-hosted AI platform supporting various LLM runners like Ollama and OpenAI-compatible APIs, with built-in RAG capabilities.

Replaces:
Details
Vector Database
Go
43.8k

milvus-io/milvus

A high-performance, cloud-native vector database built for scalable Approximate Nearest Neighbor (ANN) search and AI applications.

AI Agent Framework
kubernetes
23.7k

agentscope-ai/agentscope

A production-ready, easy-to-use AI agent framework designed for building, running, and finetuning intelligent agents with transparent and trustworthy capabilities.

AI Development & Deployment Platform
pip
25.9k

modular/modular

A unified, open platform for accelerating AI model serving and scaling GenAI deployments with industry-leading performance across various hardware.

Container Platform
kubernetes
16.9k

kubesphere/kubesphere

A distributed operating system for cloud-native application management, built on Kubernetes, offering full-stack IT operations and streamlined DevOps workflows across multi-cloud, datacenter, and edge environments.

Application Performance Management Tool
java
13.8k

pinpoint-apm/pinpoint

Pinpoint is an open-source APM tool designed for large-scale distributed systems, offering real-time monitoring and code-level visibility to identify performance bottlenecks.

AI Application Development Platform
Python
3.8k

LazyAGI/LazyLLM

A low-code development tool for building and iteratively optimizing multi-agent LLM applications with agility and efficiency.

Terminal-based Log Analysis Tool
Go
2.6k

control-theory/gonzo

Gonzo is a Go-based terminal UI for real-time log analysis, offering interactive dashboards, AI-powered insights, and advanced filtering capabilities.

GPU Cluster Management Platform
Docker
4.8k

gpustack/gpustack

An open-source GPU cluster manager that orchestrates high-performance AI inference engines across diverse environments, optimizing model deployment and resource utilization.

AI Inference Platform
kubernetes
5.3k

kserve/kserve

KServe is a standardized, scalable, and multi-framework platform for deploying and serving both generative and predictive AI models on Kubernetes.

Vector Database
Docker
16.0k

weaviate/weaviate

An open-source, cloud-native vector database enabling semantic search at scale by combining vector similarity with structured filtering and integrated AI models.

AI Infrastructure Management Platform
Python
9.8k

skypilot-org/skypilot

A unified system to run, manage, and scale AI workloads efficiently across any infrastructure, including Kubernetes, Slurm, and over 20 cloud providers, optimizing cost and resource availability.

MLOps Platform / Workflow Orchestrator
Kubernetes
6.9k

flyteorg/flyte

Dynamic, resilient open-source orchestrator for building scalable and reproducible data and ML pipelines on Kubernetes.

AI Experiment Tracking and Visualization Platform
Python
3.8k

SwanHubX/SwanLab

An open-source, modern-design AI training tracking and visualization tool that integrates with 50+ mainstream frameworks, simplifying experiment management for AI teams.

Kubernetes Workflow Engine
kubernetes
16.6k

argoproj/argo-workflows

A container-native workflow engine for orchestrating parallel jobs and multi-step tasks on Kubernetes.

Machine Learning Workflow Orchestration Platform
Kubernetes
4.1k

kubeflow/pipelines

Kubeflow Pipelines is a Kubernetes-native platform for building, deploying, and managing end-to-end machine learning workflows.

MLOps Platform
Kubernetes
3.7k

polyaxon/polyaxon

A comprehensive MLOps platform for managing, orchestrating, and scaling the entire machine learning and deep learning lifecycle.

Identity and Access Management (IAM) System
docker
13.3k

casdoor/casdoor

An open-source, AI-first Identity and Access Management (IAM) solution providing comprehensive authentication, authorization, and user management with a web UI.

Replaces:
Details
CLI Tool for AI/ML Development Environments
Docker
2.2k

tensorchord/envd

envd simplifies the creation of reproducible, container-based development environments for AI/ML projects, streamlining setup and ensuring consistency.

Distributed AI/ML Orchestration Platform
Kubernetes
2.1k

kubeflow/trainer

A Kubernetes-native platform for scalable distributed AI model training and large language model (LLM) fine-tuning across various frameworks.

MLOps Platform
Python
6.6k

clearml/clearml

ClearML is an open-source MLOps/LLMOps solution that streamlines the entire AI workflow, from experiment management and data versioning to pipeline orchestration and model serving.

Workflow Orchestration Platform
Java
3.8k

Netflix/maestro

A general-purpose workflow orchestrator providing a fully managed workflow-as-a-service for data and ML pipelines at scale.

LLM Serving Framework
Docker
12.3k

bentoml/OpenLLM

Self-host and serve any open-source LLM as an OpenAI-compatible API endpoint with ease.

MLOps Platform
Kubernetes
4.7k

SeldonIO/seldon-core

An MLOps and LLMOps framework for deploying, managing, and scaling modular, data-centric AI applications and models on Kubernetes.

Resource List
Kubernetes
2.4k

akuity/awesome-argo

A curated list of awesome projects and resources related to Argo, a CNCF graduated project for deploying and running applications and workloads on Kubernetes.

Cloud-Native MLOps Platform
Kubernetes
4.9k

tencentmusic/cube-studio

A comprehensive, cloud-native, one-stop platform for machine learning, deep learning, and large language model development, covering the entire MLOps lifecycle.

Low-Code AI/ML Framework
python
11.7k

ludwig-ai/ludwig

Ludwig is a low-code, declarative deep learning framework designed to simplify the building, training, and deployment of custom AI models, including LLMs and neural networks.

CI/CD Integration Plugin
Kubernetes
2.3k

jenkinsci/kubernetes-plugin

A Jenkins plugin that enables dynamic provisioning and scaling of build agents in a Kubernetes cluster, optimizing resource utilization.

AI Agent Sandbox Platform
Docker
10.0k

alibaba/OpenSandbox

A secure, fast, and extensible general-purpose sandbox runtime platform for AI agents, offering multi-language SDKs and flexible deployment.

Distributed SQL Database
docker
4.4k

crate/crate

CrateDB is a distributed SQL database combining the benefits of SQL with NoSQL scalability for real-time data storage and analysis.

Distributed Vector Database
kubernetes
2.3k

vearch/vearch

A cloud-native distributed vector database designed for efficient similarity search of embedding vectors in AI applications.

Vector Database Management Tool
Milvus
2.8k

zilliztech/attu

A modern, AI-native management tool for Milvus vector databases, offering multi-cluster management, data exploration, and AI-driven operations.

AI-powered Search & RAG Platform
Qdrant
2.6k

devflowinc/trieve

An all-in-one API platform providing advanced search, recommendations, and Retrieval-Augmented Generation (RAG) capabilities for developers.

GPU Orchestration Platform
git
2.1k

dstackai/dstack

A vendor-agnostic unified control plane for GPU provisioning and orchestration across clouds, Kubernetes, and on-prem for AI/ML workloads.

LLM Inference Server
Docker
3.8k

predibase/lorax

A multi-LoRA inference server designed to efficiently serve thousands of fine-tuned Large Language Models on a single GPU, drastically cutting serving costs while maintaining high throughput and low latency.

AI Gateway & API Management
Python
3.6k

IBM/mcp-context-forge

A unified AI gateway and proxy for federating MCP, A2A, and REST/gRPC APIs, offering centralized discovery, governance, and observability for AI agents and tools.

AI/LLM Observability and Evaluation Platform
Docker
3.2k

langwatch/langwatch

A comprehensive platform for end-to-end testing, simulation, evaluation, and monitoring of LLM-powered agents.

AI Agent Development Framework
Python
7.7k

GetStream/Vision-Agents

A framework for building intelligent, low-latency multi-modal AI agents that can process real-time video and audio using various LLMs and vision models.

High-Performance Data Engine
Python
5.4k

Eventual-Inc/Daft

A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.

Self-hosted AI Chatbot
Docker
11.0k

getumbrel/llama-gpt

A private, self-hosted, and offline ChatGPT-like chatbot powered by Llama 2 and Code Llama, ensuring 100% data privacy.

Replaces:
Details
1.3k

kitops-ml/kitops

KitOps is a CNCF open-source MLOps tool for packaging, versioning, and securely sharing AI/ML models, datasets, and code as OCI artifacts.

API Gateway, AI Gateway, Microservices Orchestration Platform
Docker
43.2k

Kong/kong

A high-performance, extensible API and AI Gateway for orchestrating microservices, traditional APIs, and AI/LLM traffic with advanced features like routing, security, and plugins.

Business Process Management (BPM) Platform
Docker
4.1k

camunda/camunda

Orchestrates complex business processes across people, systems, and devices, offering scalable, on-demand process automation.

AI Service Framework
docker
21.9k

jina-ai/serve

A cloud-native framework for building, deploying, and scaling multimodal AI applications and services with gRPC, HTTP, and WebSockets.

AI-powered SRE Agent
kubernetes
2.2k

HolmesGPT/holmesgpt

An open-source AI agent for investigating production incidents and finding root causes across any stack.

Data Orchestration Platform
Docker
14.2k

apache/dolphinscheduler

Apache DolphinScheduler is a modern, low-code data orchestration platform designed for agile creation of high-performance workflows and managing complex data pipeline dependencies.

AI Infrastructure Platform
Docker
2.3k

instill-ai/instill-core

An end-to-end AI infrastructure platform for data, model, and pipeline orchestration, designed to streamline the development of versatile AI-first applications.

Developer Platform & Workflow Automation
Docker
16.3k

windmill-labs/windmill

An open-source developer platform that transforms scripts into webhooks, workflows, and UIs, accelerating the creation of internal tools and automation.

Cloud-Native Observability Search Engine
Kubernetes
11.1k

quickwit-oss/quickwit

Quickwit is a cloud-native, open-source search engine designed for high-performance observability data (logs, traces, metrics) with sub-second queries on cloud storage.

Test Automation Dashboard
Docker
2.8k

sorry-cypress/sorry-cypress

An open-source, free, self-hosted alternative to Cypress Dashboard, enabling unlimited parallel test execution and comprehensive test result management.

Self-hosted Platform as a Service (PaaS)
kubernetes
4.2k

kubero-dev/kubero

A free and self-hosted PaaS that simplifies application deployment on Kubernetes for developers without specialized knowledge.

Unified Secure Access Platform
Docker
3.7k

octelium/octelium

Octelium is a self-hosted, open-source unified zero-trust secure access platform that integrates VPN, ZTNA, API/AI gateway, and PaaS functionalities for secure access and deployment across various environments.

Zero Trust Access Platform
Go
20.2k

gravitational/teleport

Teleport provides secure, unified access to all infrastructure, including servers, Kubernetes, databases, and web applications, enforcing zero-trust principles.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.