DEV Community

# observability

Gaining deep insights into system behavior through metrics, logs, and traces.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Your Agent Passed Every Eval and Still Cost $4,000 a Day

Your Agent Passed Every Eval and Still Cost $4,000 a Day

1
Comments
5 min read
Why ClickHouse Merges and Mutations Are Difficult to Track in Production

Why ClickHouse Merges and Mutations Are Difficult to Track in Production

2
Comments
3 min read
When Your AI Agent Goes Silent: The Failure Patterns Most Developers Miss

When Your AI Agent Goes Silent: The Failure Patterns Most Developers Miss

Comments
5 min read
Fixing AI Observability: How I Added GenAI Semantic Support for RAG Embedding Spans in Mastra

Fixing AI Observability: How I Added GenAI Semantic Support for RAG Embedding Spans in Mastra

10
Comments
3 min read
OpenTelemetry CNCF Graduation: The Turning Point for Production AI Observability in Kubernetes

OpenTelemetry CNCF Graduation: The Turning Point for Production AI Observability in Kubernetes

Comments
3 min read
Ruby Reactor Now Has Middlewares and OpenTelemetry — Here's Why That Matters

Ruby Reactor Now Has Middlewares and OpenTelemetry — Here's Why That Matters

Comments
3 min read
Why Setting Up Observability Takes Forever (And What To Do About It)

Why Setting Up Observability Takes Forever (And What To Do About It)

Comments
4 min read
You Are Debugging a Distributed System With Single-Process Tools. That Is Why It Takes Days.

You Are Debugging a Distributed System With Single-Process Tools. That Is Why It Takes Days.

Comments
4 min read
Monitoring Video Aggregator Health with a Go Prometheus Exporter

Monitoring Video Aggregator Health with a Go Prometheus Exporter

Comments
11 min read
Troubleshooting Kubernetes Events with TKE and Tencent Cloud CLS

Troubleshooting Kubernetes Events with TKE and Tencent Cloud CLS

Comments
2 min read
hosted coding agents make observability a product feature

hosted coding agents make observability a product feature

Comments
6 min read
Real-Time Monitoring for AI Agents: Beyond Log Streaming

Real-Time Monitoring for AI Agents: Beyond Log Streaming

1
Comments
1 min read
Auditing What Your Email Agent Actually Did

Auditing What Your Email Agent Actually Did

Comments
5 min read
LLM-as-Judge Is Three Decisions

LLM-as-Judge Is Three Decisions

Comments
6 min read
Great Stack to Doesn't Work #9 — Distributed Tracing: "Why Does This Request Take 3 Seconds?"

Great Stack to Doesn't Work #9 — Distributed Tracing: "Why Does This Request Take 3 Seconds?"

1
Comments
9 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.