When Your Cluster Is on Fire
12 prompts for the 2 AM PagerDuty alert. Paste your kubectl output, get a diagnosis and fix.
What's Inside
- CrashLoopBackOff Debugger — Analyzes pod events + logs to identify the exact failure
- OOMKilled Analyzer — Memory profiling recommendations with resource limit calculations
- Ingress 502/503 Resolver — Traces the request path from ingress → service → pod to find the break
- Certificate Rotation Planner — Generates cert-manager renewal strategy with zero-downtime steps
- Node Pressure Mitigator — Disk/memory/PID pressure diagnosis with eviction priority recommendations
- HPA Runaway Stopper — Analyzes scaling behavior and recommends stabilization window + target tuning
- + 6 more: network policy debugger, PV/PVC troubleshooter, RBAC permission tracer, helm upgrade rollback guide, resource quota optimizer, cluster upgrade pre-flight check