Error watch

Detection that knows the difference between weird and normal.

Static CloudWatch thresholds fire constantly because they don't know what your normal looks like. Error watch keeps a rolling 6-hour baseline per service, suppresses known recurring patterns, and fires only when something actually changed.

What you'll actually see
error watch · acme-api · last 24h3 surfaced · 412 suppressed
suppressed · known noise
  • 14:02edge-cdn 502 burst — known upstream flap
  • 13:48search-api timeout pattern — recurring
  • 13:31auth-svc cold-start spike — within baseline
  • 13:14primary-db slow-query — on follow-up cooldown
  • 12:45session-svc 429 — within rate-limit baseline
surfaced · signal
  • 14:21payments-api p99 11.4s — deploy v3.42.1 correlated
  • 12:08users-api 5xx +820% — SQS depth ×4.2
  • 09:44rds-prod-1 IOPS saturation — customer-facing

What changes for the on-call engineer

Alert fatigue, gone

412 known-noise blips suppressed today. Your team only saw 3 pages — the ones that mattered.

Baseline-aware

We know your Tuesday 10am traffic looks different from Saturday 3am. Static thresholds don't.

Memory of past lessons

Mark a finding as known noise once — Radar never wakes you for it again.

How it works

step · 01

Sample

Every 60 seconds, error patterns are extracted from your service log groups.

step · 02

Compare

Rate, novelty, and concentration are compared against the 6-hour rolling baseline.

step · 03

Gate

Multiple gates (volume, novelty, customer-impact, deploy proximity) decide: surface or suppress.

Resolve incidents in 30 seconds, not 30 minutes.

Connect your AWS account in read-only mode and let Radar take the next page.