Question 1

Should MTTR include detection time or just recovery time?

Accepted Answer

MTTR (Mean Time to Recover/Resolve) traditionally measures from incident detection (alert firing) to resolution. Some organizations measure MTTR from incident start (when the issue actually began, including the detection gap). The second approach produces higher MTTRs but is more honest about customer impact. Document your definition clearly — inconsistencies between teams make MTTR comparisons meaningless.

Question 2

What is the relationship between MTTR and deployment frequency?

Accepted Answer

DORA research shows high-performing teams have both high deployment frequency AND low MTTR — these are correlated. Teams deploying frequently get better at incident response (more practice), have smaller change sets to diagnose, and have faster rollback paths. Teams that deploy infrequently deploy larger changes with more complex failure modes, resulting in longer MTTR. Improving deployment frequency is one of the highest-leverage MTTR improvement investments.

MTTR Calculation for Q1 Production Incidents: Mean and Median Analysis

Worked example

Frequently asked questions