Home/Examples/MTTR Calculation for Q1 Production Incidents: Mean and Median Analysis
● Calculations run locally in your browser. Some site features, such as usage analytics or shared results, may use network requests.
Example — MTTR Calculator (SRE)
MTTR Calculation for Q1 Production Incidents: Mean and Median Analysis
Calculate Mean Time to Recover (MTTR) from Q1 production incident data. Compare mean vs median MTTR, identify outliers, and benchmark against DORA elite performance tiers.
The mean MTTR (47.5 min) is heavily skewed by the 180-minute database incident. Median MTTR (20 min) better represents typical recovery performance. DORA Elite teams achieve median MTTR < 1 hour — this team qualifies. The priority action is improving recovery from the database corruption class of incidents, which alone consumed 63% of Q1 downtime.
What to do next
Post-mortem the 180-minute database incident specifically: what would have made recovery faster? Automated PITR (Point-In-Time Recovery) automation, a tested database recovery runbook, and a recovery fire drill would likely cut that incident to 30-45 minutes. Set a goal: no single incident > 60 minutes in Q2.
Use the MTTR Calculator (SRE) to run this on your own input.
External site · Independent provider · We may receive a commission · Not a recommendation
Frequently asked questions
Should MTTR include detection time or just recovery time?
MTTR (Mean Time to Recover/Resolve) traditionally measures from incident detection (alert firing) to resolution. Some organizations measure MTTR from incident start (when the issue actually began, including the detection gap). The second approach produces higher MTTRs but is more honest about customer impact. Document your definition clearly — inconsistencies between teams make MTTR comparisons meaningless.
What is the relationship between MTTR and deployment frequency?
DORA research shows high-performing teams have both high deployment frequency AND low MTTR — these are correlated. Teams deploying frequently get better at incident response (more practice), have smaller change sets to diagnose, and have faster rollback paths. Teams that deploy infrequently deploy larger changes with more complex failure modes, resulting in longer MTTR. Improving deployment frequency is one of the highest-leverage MTTR improvement investments.