Tags: #observability
langfuse/langfuse
An open-source LLM engineering platform providing observability, metrics, evaluations, and prompt management for developing and debugging AI applications.
comet-ml/opik
An open-source platform for comprehensive tracing, evaluation, and optimization of LLM applications, RAG systems, and agentic workflows.
aden-hive/hive
Hive is a production-grade runtime harness for multi-AI agents, providing state management, fault recovery, observability, and human oversight for reliable business process automation.
netdata/netdata
An open-source, real-time infrastructure monitoring platform providing instant, per-second insights with zero configuration and AI-powered anomaly detection.
tensorzero/tensorzero
An open-source LLMOps platform unifying LLM gateway, observability, evaluation, optimization, and experimentation for robust AI application development.
looplj/axonhub
AxonHub is an open-source AI gateway that enables seamless integration with over 100 LLMs using any SDK, featuring built-in failover, load balancing, cost control, and end-to-end tracing.
apache/hertzbeat
An AI-powered open-source real-time observability system for unified metrics, logs, alerting, and notification.
vectordotdev/vector
A high-performance, end-to-end observability data pipeline that empowers users to collect, transform, and route all their logs and metrics with significant cost reduction and enhanced control.
datapizza-labs/datapizza-ai
A Python framework for building reliable, predictable, and observable Generative AI agents with minimal overhead.
coze-dev/coze-loop
Coze Loop is an open-source platform providing full-lifecycle management for AI agents, covering development, debugging, evaluation, and monitoring to streamline their creation and operation.
mlflow/mlflow
An open-source AI engineering platform for debugging, evaluating, monitoring, and optimizing production-quality AI applications, including agents, LLMs, and ML models.
triggerdotdev/trigger.dev
An open-source platform for building and deploying fully-managed, long-running AI agents and workflows with built-in durability, observability, and elastic scaling using TypeScript.
kubesphere/kubesphere
A distributed operating system for cloud-native application management, leveraging Kubernetes as its kernel for multi-cloud, datacenter, and edge environments.
Arize-ai/phoenix
An open-source platform for debugging, evaluating, and monitoring AI/ML models and pipelines.
AgentOps-AI/agentops
A Python SDK and platform providing comprehensive observability, monitoring, and evaluation tools for AI agents, from prototype to production.
GoogleCloudPlatform/agent-starter-pack
A Python package offering production-ready templates and infrastructure for deploying GenAI agents on Google Cloud, now superseded by `agents-cli` for ongoing development.
dagster-io/dagster
A cloud-native data pipeline orchestrator designed for the development, production, and observation of data assets, featuring integrated lineage, observability, and a declarative programming model.
pezzolabs/pezzo
An open-source, developer-first LLMOps platform for streamlined prompt design, version management, observability, and AI operations.
IBM/mcp-context-forge
A unified open-source AI gateway and proxy for federating MCP, A2A, and REST/gRPC APIs, offering centralized governance, discovery, and observability for AI clients and agents.
elder-plinius/CL4R1T4S
A public repository of leaked and extracted system prompts from major AI models and agents, promoting transparency and observability in AI systems.
Agenta-AI/agenta
An open-source LLMOps platform integrating prompt management, evaluation, and observability to accelerate reliable LLM application development.
langwatch/langwatch
A unified platform for end-to-end LLM evaluation, AI agent testing, monitoring, and optimization, designed to streamline the development and deployment of reliable AI systems.
HolmesGPT/holmesgpt
An open-source AI agent for investigating production incidents and finding root causes across any stack.
SigNoz/signoz
SigNoz is an open-source, OpenTelemetry-native observability platform that unifies logs, traces, and metrics to monitor applications and troubleshoot issues efficiently.
quickwit-oss/quickwit
A cloud-native, open-source search engine optimized for fast, cost-effective observability data (logs, traces, metrics) on cloud storage.
openobserve/openobserve
An open-source, cost-effective observability platform for logs, metrics, traces, and RUM, offering 140x lower storage costs and single binary deployment as an alternative to commercial solutions.
wassim249/fastapi-langgraph-agent-production-ready-template
A production-ready FastAPI template for building scalable and secure AI agent applications with LangGraph, handling common infrastructure challenges.