Site Reliability Engineering
SRE Tools
Error budget calculators, incident timeline analyzers, runbook format checkers, chaos engineering blast radius estimators, and SRE reference tools that run entirely in your browser.
🔒 Browser-only — no data sent
⚡ Zero account required
📦 16 free tools
slo
Error Budget Calculator→
Calculate SLO error budget remaining, burn rate, and time to exhaustion.
incidents
Incident Timeline Analyzer→
Calculate MTTD, MTTR, MTTM from incident event timestamps.
documentation
Runbook Format Checker→
Check runbook content for completeness, actionability, and on-call usability.
chaos
Chaos Engineering Blast Radius Estimator→
Estimate the blast radius of a chaos experiment from traffic percentage and dependency fan-out.
slo
SLI Definition Reference→
Reference guide to SLI types — request success rate, latency, availability, and data freshness.
toil
Toil Calculator→
Calculate weekly toil hours, annual cost, and automation ROI for SRE toil reduction.
slo
Dependency Reliability Calculator→
Calculate composite system availability from dependency availability percentages.
reliability
Load Shedding Threshold Calculator→
Calculate load shedding threshold, current headroom, and 503 budget from service capacity.
incidents
Postmortem Quality Checker→
Check postmortem documents for blameless language, actionable items, and five-whys depth.
dora
Change Failure Rate Calculator→
Calculate DORA change failure rate tier from deployment and failure counts.
slo
Availability SLA Calculator→
Convert availability percentage to allowed downtime minutes and hours per window.
alerting
Capacity Alert Threshold Calculator→
Calculate safe alert thresholds for CPU, memory, disk, and queue depth resources.
on call
On-Call Burden Calculator→
Quantify per-engineer on-call hours, cost, and sustainability.
general
Incident Frequency Benchmark→
Place your team's incidents-per-engineer rate on the Pagerduty State of Digital Operations distribution. Inputs: mont...
general
Retry Storm Impact Calculator→
Compute retry amplification factor and effective load from baseline rate, retry count, backoff model, and failure con...
general
SLO Target Benchmark→
Place an availability SLO target against published SaaS and infrastructure benchmark bands by customer impact and ser...