Sustained 87% IOPS utilization for 6 days. Burst balance trending toward zero. Recommend bump to db.r6g.2xlarge or move read traffic to a replica.
max_connections at 92% during peak. PgBouncer not deployed. Recommend pgbouncer or raising the cap before the next release.
p99 task CPU > 80% for 12 of last 14 weekday peaks. Suggested desired: 8 (currently 6).
swap activity climbing daily. r6i.xlarge would clear it.
p95 CPU < 5% for 30 days. Safe to drop to t3.micro and save ~$38/mo.
Storage growing 4%/wk. No action needed yet.
All groups still on 30-day retention as configured.
The boring stuff — saturation, drift, retention — that never trips an alarm but eventually causes the 3 a.m. page.
Radar tags each finding high / med / low / info, so you know what to fix this sprint vs next quarter.
Reviews land in your team channel with the actionable subset pinned. No portal to remember to check.
Pick a cadence per service: weekly, biweekly, monthly. Defaults to weekly on tier-1.
Radar pulls ECS, RDS, EC2, Atlas, and CloudWatch metrics over the period and runs them through the review prompt.
Past reviews are remembered. Recurring findings get flagged as such — no Groundhog Day.