Tags: #observability

LLM Engineering Platform
ClickHouse
24.8k

langfuse/langfuse

An open-source LLM engineering platform for developing, monitoring, evaluating, and debugging AI applications.

AI Observability and LLM Development Platform
python
18.8k

comet-ml/opik

An open-source platform for debugging, evaluating, and monitoring LLM applications, RAG systems, and agentic workflows from prototype to production.

AI Agent Orchestration Platform
10.2k

aden-hive/hive

Hive is a production-grade runtime harness for multi-AI agents, providing state management, fault recovery, observability, and human oversight for reliable business process automation.

Infrastructure Monitoring and Observability Platform
Linux
78.4k

netdata/netdata

Netdata provides real-time, AI-powered full-stack observability for infrastructure, offering instant insights with zero configuration.

Generative AI Agent Framework
python
16.3k

pydantic/pydantic-ai

A Python agent framework built by the Pydantic team to quickly and confidently develop production-grade Generative AI applications with a focus on type safety and observability.

LLMOps Platform
openai-sdk
11.2k

tensorzero/tensorzero

An open-source LLMOps platform unifying LLM gateway, observability, evaluation, optimization, and experimentation for robust AI application development.

AI Gateway
Go
3.1k

looplj/axonhub

An open-source AI gateway enabling seamless integration with 100+ LLMs using any SDK, offering failover, load balancing, cost control, and end-to-end tracing to prevent vendor lock-in.

Observability Platform
Docker
7.2k

apache/hertzbeat

An AI-powered open-source real-time observability system for unified metrics, logs, alerting, and notification.

Observability Data Pipeline
Rust
21.6k

vectordotdev/vector

A high-performance, end-to-end observability data pipeline that empowers users to collect, transform, and route all their logs and metrics with significant cost reduction and enhanced control.

LLM Agent Development Framework
2.2k

datapizza-labs/datapizza-ai

A Python framework for building reliable, observable, and production-ready Generative AI solutions and agents with minimal overhead.

AI Agent Development and Operations Platform
Docker
5.4k

coze-dev/coze-loop

Coze Loop is an open-source, full-lifecycle management platform for AI agent development, debugging, evaluation, and monitoring.

AI/MLOps Platform
Python
25.3k

mlflow/mlflow

An open-source AI engineering platform for managing the complete lifecycle of AI applications, including agents, LLMs, and ML models, from debugging and evaluation to monitoring and optimization.

Backend Orchestration Platform
Docker
15.3k

iii-hq/iii

iii unifies diverse backend tools like API frameworks, task queues, and schedulers into a single engine with Function, Trigger, and Worker primitives, simplifying distributed backend development.

AI Workflow Platform
Node.js
14.4k

triggerdotdev/trigger.dev

Trigger.dev is an open-source platform for building and deploying durable, long-running AI agents and workflows in TypeScript, offering features like retries, queues, and elastic scaling without serverless timeouts.

Container Platform
kubernetes
16.9k

kubesphere/kubesphere

A distributed operating system for cloud-native application management, built on Kubernetes, offering full-stack IT operations and streamlined DevOps workflows across multi-cloud, datacenter, and edge environments.

AI/ML Observability Platform
python
9.3k

Arize-ai/phoenix

An open-source platform for comprehensive AI/ML model observability, evaluation, and debugging.

AI Agent Observability Platform
python
5.5k

AgentOps-AI/agentops

A Python SDK and platform for comprehensive observability, monitoring, and cost management of AI agents and LLM applications.

ML/LLM Observability Framework
Python
7.4k

evidentlyai/evidently

An open-source Python library for evaluating, testing, and monitoring ML and LLM systems across experiments and production environments.

AI Agent Development and Deployment Toolkit
python
6.2k

GoogleCloudPlatform/agent-starter-pack

Accelerates the development and deployment of production-ready AI agents on Google Cloud with pre-built templates, CI/CD, evaluation, and observability.

Data Orchestration Platform
Python
15.3k

dagster-io/dagster

A cloud-native data pipeline orchestrator designed for the development, production, and observation of data assets, featuring integrated lineage, observability, and a declarative programming model.

LLMOps Platform
Node.js
3.2k

pezzolabs/pezzo

An open-source, developer-first LLMOps platform for streamlining prompt design, version management, observability, and collaboration in AI operations.

Educational Resource
Docker
18.8k

NirDiamant/agents-towards-production

An open-source playbook offering end-to-end, code-first tutorials for building and deploying production-grade GenAI agents from prototype to enterprise scale.

AI Database Integration Server
go
14.0k

googleapis/mcp-toolbox

Connects AI agents, IDEs, and applications to enterprise databases via an open-source Model Context Protocol (MCP) server, offering prebuilt tools and a framework for custom, secure data interaction.

AI Agent Engineering Platform
node.js
8.1k

VoltAgent/voltagent

An end-to-end AI Agent Engineering Platform providing an open-source TypeScript framework for building intelligent agents and a console for operations and observability.

AI Gateway & API Management
Python
3.6k

IBM/mcp-context-forge

A unified AI gateway and proxy for federating MCP, A2A, and REST/gRPC APIs, offering centralized discovery, governance, and observability for AI agents and tools.

Data Repository / Transparency Tool
14.7k

elder-plinius/CL4R1T4S

A public repository of leaked and extracted system prompts from major AI models and agents, promoting transparency and observability into their hidden behaviors and biases.

AI/LLM Observability and Evaluation Platform
Docker
3.2k

langwatch/langwatch

A comprehensive platform for end-to-end testing, simulation, evaluation, and monitoring of LLM-powered agents.

Workflow Orchestration Framework
Python
22.2k

PrefectHQ/prefect

Prefect is a Python-based workflow orchestration framework designed to build resilient and dynamic data pipelines, automating complex data processes with features like scheduling, caching, and retries.

AI-powered SRE Agent
kubernetes
2.2k

HolmesGPT/holmesgpt

An open-source AI agent for investigating production incidents and finding root causes across any stack.

Observability Platform
OpenTelemetry
26.6k

SigNoz/signoz

SigNoz is an open-source, OpenTelemetry-native observability platform that unifies logs, traces, and metrics to monitor applications and troubleshoot issues efficiently.

Cloud-Native Observability Search Engine
Kubernetes
11.1k

quickwit-oss/quickwit

Quickwit is a cloud-native, open-source search engine designed for high-performance observability data (logs, traces, metrics) with sub-second queries on cloud storage.

Observability Platform
Rust
18.6k

openobserve/openobserve

An open-source, cost-effective observability platform for logs, metrics, traces, and RUM, offering significant storage cost savings and high performance.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.