close

DEV Community

# observability

Gaining deep insights into system behavior through metrics, logs, and traces.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The 1.4 Seconds That Weren't on Any Span

The 1.4 Seconds That Weren't on Any Span

Comments
7 min read
I Reduced Our Alert Volume by 90% — Here's the Playbook

I Reduced Our Alert Volume by 90% — Here's the Playbook

Comments
2 min read
My verdict layer had two readers. Only one of them had eyes.

My verdict layer had two readers. Only one of them had eyes.

Image 2
Comments
6 min read
ML Observability on EKS: Logs, Metrics and Tracing Head-to-Head

ML Observability on EKS: Logs, Metrics and Tracing Head-to-Head

Comments
11 min read
LLM Observability in Production: From GPU Metrics to Response Quality

LLM Observability in Production: From GPU Metrics to Response Quality

Comments
11 min read
CloudWatch to OTel: Tearing Down the Observability Bridge Pattern

CloudWatch to OTel: Tearing Down the Observability Bridge Pattern

Comments
9 min read
Monitoring LLM costs in production: tokens, tenants, and alerts

Monitoring LLM costs in production: tokens, tenants, and alerts

Comments
9 min read
Good Architecture Includes Observability

Good Architecture Includes Observability

Comments
8 min read
Correlation IDs: Trace a Single Request Across Every Service in Your API

Correlation IDs: Trace a Single Request Across Every Service in Your API

Comments
3 min read
You Can't Reproduce Your Agent's Bugs—That's Why You Can't Fix Them

You Can't Reproduce Your Agent's Bugs—That's Why You Can't Fix Them

Image 2
Comments 2
6 min read
Evaluating LLM Output Quality In Production

Evaluating LLM Output Quality In Production

Image Image Image 6
Comments
10 min read
Scarab Diagnostic Field Test #033 - Prometheus Remote-Write Label Order Boundary

Scarab Diagnostic Field Test #033 - Prometheus Remote-Write Label Order Boundary

Image 1
Comments
5 min read
How do you know if your AI agent is working or just burning money?

How do you know if your AI agent is working or just burning money?

Image 1
Comments 1
3 min read
Structured Logging That Actually Helps Debugging at 3 AM

Structured Logging That Actually Helps Debugging at 3 AM

Comments
8 min read
Deploying Zabbix Open-Source Monitoring Platform on Ubuntu 24.04

Deploying Zabbix Open-Source Monitoring Platform on Ubuntu 24.04

Image Image Image 7
Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.