Site Reliability Engineering

SRE Tools

Error budget calculators, incident timeline analyzers, runbook format checkers, chaos engineering blast radius estimators, and SRE reference tools that run entirely in your browser.

🔒 Browser-only — no data sent ⚡ Zero account required 📦 16 free tools
slo
Error Budget Calculator
Calculate SLO error budget remaining, burn rate, and time to exhaustion.
incidents
Incident Timeline Analyzer
Calculate MTTD, MTTR, MTTM from incident event timestamps.
documentation
Runbook Format Checker
Check runbook content for completeness, actionability, and on-call usability.
chaos
Chaos Engineering Blast Radius Estimator
Estimate the blast radius of a chaos experiment from traffic percentage and dependency fan-out.
slo
SLI Definition Reference
Reference guide to SLI types — request success rate, latency, availability, and data freshness.
toil
Toil Calculator
Calculate weekly toil hours, annual cost, and automation ROI for SRE toil reduction.
slo
Dependency Reliability Calculator
Calculate composite system availability from dependency availability percentages.
reliability
Load Shedding Threshold Calculator
Calculate load shedding threshold, current headroom, and 503 budget from service capacity.
incidents
Postmortem Quality Checker
Check postmortem documents for blameless language, actionable items, and five-whys depth.
dora
Change Failure Rate Calculator
Calculate DORA change failure rate tier from deployment and failure counts.
slo
Availability SLA Calculator
Convert availability percentage to allowed downtime minutes and hours per window.
alerting
Capacity Alert Threshold Calculator
Calculate safe alert thresholds for CPU, memory, disk, and queue depth resources.
on call
On-Call Burden Calculator
Quantify per-engineer on-call hours, cost, and sustainability.
general
Incident Frequency Benchmark
Place your team's incidents-per-engineer rate on the Pagerduty State of Digital Operations distribution. Inputs: mont...
general
Retry Storm Impact Calculator
Compute retry amplification factor and effective load from baseline rate, retry count, backoff model, and failure con...
general
SLO Target Benchmark
Place an availability SLO target against published SaaS and infrastructure benchmark bands by customer impact and ser...