Tags: #observability

LLM Engineering Platform

26.1k

langfuse/langfuse

An open-source LLM engineering platform providing observability, metrics, evaluations, and prompt management for developing and debugging AI applications.

llm observability ai

Details

AI Observability and MLOps Platform

Python

19.1k

comet-ml/opik

An open-source platform for comprehensive tracing, evaluation, and optimization of LLM applications, RAG systems, and agentic workflows.

llm observability evaluation

Details

AI Agent Orchestration Platform

10.1k

aden-hive/hive

Hive is a production-grade runtime harness for multi-AI agents, providing state management, fault recovery, observability, and human oversight for reliable business process automation.

ai-agents multi-agent production-ai

Details

Infrastructure Monitoring Platform

Docker

78.6k

netdata/netdata

An open-source, real-time infrastructure monitoring platform providing instant, per-second insights with zero configuration and AI-powered anomaly detection.

observability monitoring real-time

Replaces:

Datadog New Relic...

Details

LLMOps Platform

rust

11.3k

tensorzero/tensorzero

An open-source LLMOps platform unifying LLM gateway, observability, evaluation, optimization, and experimentation for robust AI application development.

llmops llm-gateway observability

Details

AI Gateway

3.4k

looplj/axonhub

AxonHub is an open-source AI gateway that enables seamless integration with over 100 LLMs using any SDK, featuring built-in failover, load balancing, cost control, and end-to-end tracing.

ai gateway llm load balancing

Details

Observability Platform

Docker

7.2k

apache/hertzbeat

An AI-powered open-source real-time observability system for unified metrics, logs, alerting, and notification.

observability monitoring alerting

Replaces:

Datadog New Relic...

Details

Observability Data Pipeline

Rust

21.7k

vectordotdev/vector

A high-performance, end-to-end observability data pipeline that empowers users to collect, transform, and route all their logs and metrics with significant cost reduction and enhanced control.

observability data-pipeline logs

Replaces:

Splunk Elastic Stack

Details

Gen AI Agent Development Framework

Python

2.2k

datapizza-labs/datapizza-ai

A Python framework for building reliable, predictable, and observable Generative AI agents with minimal overhead.

genai llm framework

Details

AI Agent Development and Operations Platform

docker

5.4k

coze-dev/coze-loop

Coze Loop is an open-source platform providing full-lifecycle management for AI agents, covering development, debugging, evaluation, and monitoring to streamline their creation and operation.

ai-agent llm-ops prompt-engineering

Details

AI Engineering Platform

Python

25.8k

mlflow/mlflow

An open-source AI engineering platform for debugging, evaluating, monitoring, and optimizing production-quality AI applications, including agents, LLMs, and ML models.

ai engineering llm mlops

Details

AI Workflow Orchestration Platform

Node.js

14.8k

triggerdotdev/trigger.dev

An open-source platform for building and deploying fully-managed, long-running AI agents and workflows with built-in durability, observability, and elastic scaling using TypeScript.

ai-workflow serverless typescript

Replaces:

AWS Lambda Vercel

Details

Container Platform

kubernetes

16.9k

kubesphere/kubesphere

A distributed operating system for cloud-native application management, leveraging Kubernetes as its kernel for multi-cloud, datacenter, and edge environments.

kubernetes container platform devops

Replaces:

Red Hat OpenShift VMware Tanzu

Details

AI/ML Observability Platform

Python

9.4k

Arize-ai/phoenix

An open-source platform for debugging, evaluating, and monitoring AI/ML models and pipelines.

ai ml observability

Replaces:

Commercial AI Observability Platforms

Details

AI Agent Observability Platform

python

5.5k

AgentOps-AI/agentops

A Python SDK and platform providing comprehensive observability, monitoring, and evaluation tools for AI agents, from prototype to production.

ai agents observability llm

Details

AI Agent Development & Deployment Toolkit

Python

6.3k

GoogleCloudPlatform/agent-starter-pack

A Python package offering production-ready templates and infrastructure for deploying GenAI agents on Google Cloud, now superseded by `agents-cli` for ongoing development.

ai-agents google-cloud genai

Details

Data Orchestration Platform

Python

15.4k

dagster-io/dagster

A cloud-native data pipeline orchestrator designed for the development, production, and observation of data assets, featuring integrated lineage, observability, and a declarative programming model.

orchestration data pipeline etl

Replaces:

Talend Data Integration Informatica PowerCenter...

Details

LLMOps Platform

Docker

3.2k

pezzolabs/pezzo

An open-source, developer-first LLMOps platform for streamlined prompt design, version management, observability, and AI operations.

llmops prompt management observability

Details

AI Gateway & API Management Platform

Python

3.6k

IBM/mcp-context-forge

A unified open-source AI gateway and proxy for federating MCP, A2A, and REST/gRPC APIs, offering centralized governance, discovery, and observability for AI clients and agents.

ai gateway api gateway agent orchestration

Details

AI System Prompt Repository

25.8k

elder-plinius/CL4R1T4S

A public repository of leaked and extracted system prompts from major AI models and agents, promoting transparency and observability in AI systems.

ai transparency system prompts llm

Details

LLMOps Platform

4.1k

Agenta-AI/agenta

An open-source LLMOps platform integrating prompt management, evaluation, and observability to accelerate reliable LLM application development.

llmops prompt engineering llm evaluation

Details

LLM Operations Platform

Node.js

3.2k

langwatch/langwatch

A unified platform for end-to-end LLM evaluation, AI agent testing, monitoring, and optimization, designed to streamline the development and deployment of reliable AI systems.

llm-evaluation ai-agent-testing observability

Details

AI-powered SRE Agent

kubernetes

2.3k

HolmesGPT/holmesgpt

An open-source AI agent for investigating production incidents and finding root causes across any stack.

ai agent sre incident management

Details

Observability Platform

OpenTelemetry

26.7k

SigNoz/signoz

SigNoz is an open-source, OpenTelemetry-native observability platform that unifies logs, traces, and metrics to monitor applications and troubleshoot issues efficiently.

observability apm opentelemetry

Replaces:

Datadog New Relic

Details

Cloud-Native Observability Search Engine

Kubernetes

11.1k

quickwit-oss/quickwit

A cloud-native, open-source search engine optimized for fast, cost-effective observability data (logs, traces, metrics) on cloud storage.

observability search-engine cloud-native

Replaces:

Datadog Elasticsearch...

Details

Observability Platform

18.8k

openobserve/openobserve

An open-source, cost-effective observability platform for logs, metrics, traces, and RUM, offering 140x lower storage costs and single binary deployment as an alternative to commercial solutions.

observability logs metrics

Replaces:

Datadog Splunk...

Details

AI Agent Backend Template

FastAPI

2.2k

wassim249/fastapi-langgraph-agent-production-ready-template

A production-ready FastAPI template for building scalable and secure AI agent applications with LangGraph, handling common infrastructure challenges.

fastapi langgraph ai-agent

Details

AI/Database Integration Server

15.1k

googleapis/mcp-toolbox

Connects AI agents, IDEs, and applications to enterprise databases via an open-source MCP server, offering prebuilt tools and a framework for custom, secure data interaction.

ai database mcp

Details