Campaign Delivery Health
Real-time observability powered by Honeycomb + autonomous incident response by AWS DevOps Agent
Service Health (from Honeycomb)
| Service | P50 (ms) | P99 (ms) | Requests | Error Rate | Status |
|---|---|---|---|---|---|
| Loading from Honeycomb... | |||||
Recent DevOps Agent Investigations
Investigation Center
Trigger and monitor AWS DevOps Agent investigations using Honeycomb telemetry
Trigger New Investigation
Incident Simulation
Demonstrate the full Honeycomb β DevOps Agent β Resolution loop with realistic Zeta Global scenarios
Campaign Delivery Latency
Personalization service latency spike affecting real-time campaign delivery for email and push channels.
Identity Graph Failures
503 errors from identity graph service. Audience segmentation degraded, campaigns falling back to broad targeting.
Signal Ingestion Backpressure
Kafka consumer lag exceeding threshold. Real-time behavioral signals delayed, scoring engine using stale data.
Integration Architecture
How AWS DevOps Agent connects to Honeycomb for Zeta Global's observability stack
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β ZETA GLOBAL PLATFORM β
β β
β ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ β
β β Campaign β β Identity β β Signal β β Delivery β β
β β Router β β Graph β β Ingestion β β Engine β β
β ββββββββ¬ββββββββ ββββββββ¬ββββββββ ββββββββ¬ββββββββ ββββββββ¬ββββββββ β
β β β β β β
β ββββββββββββββββββββ΄βββββββββββββββββββ΄βββββββββββββββββββ β
β β β
β OpenTelemetry SDK β
β (traces, metrics, logs) β
ββββββββββββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β HONEYCOMB β
β β
β βββββββββββββββ βββββββββββββββ βββββββββββββββ βββββββββββββββ β
β β Traces & β β SLOs & β β BubbleUp β β Triggers & β β
β β Spans β β Burn Alerts β β Analysis β β Webhooks β β
β βββββββββββββββ βββββββββββββββ βββββββββββββββ ββββββββ¬βββββββ β
β β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β Honeycomb MCP Server β β
β β (Exposes query, traces, SLOs to AI agents via Model Context Protocol) β
β ββββββββββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
β
βββββββββββββββββββββββββββββββββββββββ
β Webhook Trigger βββ MCP Connection β
ββββββββββ¬ββββββββββββββββββββ¬βββββββββ
β β β
βΌ βΌ βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β AWS DEVOPS AGENT β
β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β Agent Space β β
β β β β
β β 1. Receive alert from Honeycomb trigger β β
β β 2. Query Honeycomb traces via MCP (identify slow spans) β β
β β 3. Correlate with deployment data (GitHub/CodeDeploy) β β
β β 4. Analyze patterns using BubbleUp-style correlation β β
β β 5. Identify root cause β β
β β 6. Generate mitigation plan β β
β β 7. Post findings to Slack / create ticket β β
β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
β β
β Connected Tools: β
β ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββ β
β βHoneycomb β β GitHub β β Slack β βCloudWatchβ β Runbooks β β
β β (MCP) β β (MCP) β β (MCP) β β (native) β β (MCP) β β
β ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββ β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β OUTCOMES β
β β
β β’ MTTR reduced from 45 min β 4 min (75% reduction) β
β β’ On-call engineers wake up to root cause, not raw alerts β
β β’ Proactive recommendations prevent repeat incidents β
β β’ Full audit trail of investigation reasoning β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Key Integration Points
1. Honeycomb MCP Server
DevOps Agent connects to Honeycomb via the official MCP server. This gives the agent direct access to query traces, analyze SLOs, and run BubbleUp correlations β the same tools your engineers use daily.
Honeycomb MCP β DevOps Agent Space β Tool Registration
2. Webhook Triggers
Honeycomb Triggers fire when SLO budgets burn or error thresholds breach. These webhooks hit the DevOps Agent endpoint with HMAC authentication, starting an autonomous investigation immediately.
Honeycomb Trigger β HMAC Webhook β DevOps Agent Investigation
3. Trace Correlation
During investigation, DevOps Agent queries Honeycomb for the specific traces showing the anomaly. It uses BubbleUp-style analysis to find what changed β deployment version, feature flag, infrastructure shift.
Agent queries β Honeycomb API β Trace analysis β Root cause
4. Resolution Loop
Once root cause is identified, DevOps Agent can trigger rollbacks, scale resources, or create tickets. The resolution is verified by re-querying Honeycomb to confirm metrics return to normal.
Mitigation β Verify via Honeycomb β Close investigation