The DORA Metrics
The DORA Metrics
In 2014, the DevOps Research and Assessment (DORA) programme began the largest longitudinal study of software delivery performance ever conducted. By 2023 the dataset covered over 36,000 professionals across thousands of organisations worldwide. The study's central finding: four metrics reliably distinguish high-performing teams from low-performing ones, and those metrics predict both software delivery speed and organisational profitability.
These four metrics — Deployment Frequency, Lead Time for Changes, Mean Time to Restore (MTTR), and Change Failure Rate — are now the industry-standard vocabulary for measuring an engineering team's delivery capability. Every major cloud provider (Google, AWS, Azure) tracks them internally. If you are joining a mature DevOps team, you will be asked about them in your first week.
Metric 1 — Deployment Frequency
Definition: How often does your team deploy to production?
Deployment frequency is a proxy for batch size. Teams that deploy once a quarter are batching months of work into a single risky release. Teams that deploy multiple times per day are shipping tiny increments — each deployment is so small that reverting it is trivial.
DORA performance bands (2023 report):
- Elite: On-demand (multiple deploys per day)
- High: Between once per day and once per week
- Medium: Between once per week and once per month
- Low: Less than once per month
Google's frontend teams deploy to production hundreds of times per day. Amazon deploys to production every 11.7 seconds on average. This is only possible because deployments are automated, tested, and limited in blast radius by feature flags and canary releases.
Metric 2 — Lead Time for Changes
Definition: How long does it take from a commit being merged to that code running in production?
Lead time measures the speed of your feedback loop. A short lead time means developers know quickly whether their code works in production. A long lead time means bugs sit in flight for days or weeks before discovery.
DORA performance bands (2023):
- Elite: Less than one hour
- High: Between one day and one week
- Medium: Between one week and one month
- Low: More than six months
Lead time is not just a speed metric — it is a risk metric. A six-month lead time means your deployment contains six months of untested interactions between changes. A one-hour lead time means you are shipping one small, well-understood change at a time.
Metric 3 — Mean Time to Restore (MTTR)
Definition: When a service incident occurs, how long does it take to restore service to users?
MTTR is the most direct measure of your team's operational resilience. It captures your ability to detect a problem, understand it, and fix or mitigate it quickly. The fix does not have to be a code change — it can be a rollback, a feature flag toggle, a config change, or a traffic reroute.
DORA performance bands (2023):
- Elite: Less than one hour
- High: Less than one day
- Medium: Between one day and one week
- Low: More than six months
Elite teams achieve sub-hour MTTR by maintaining three operational capabilities:
- Fast detection: Alerting that fires within seconds of user-visible impact (not when a human notices).
- Fast rollback: The ability to revert a deployment in under 5 minutes via a single CLI command or dashboard button.
- Feature flags: The ability to disable a broken feature for 0% of users without a deployment at all.
Metric 4 — Change Failure Rate
Definition: What percentage of deployments to production result in a degraded service or require a hotfix/rollback?
Change failure rate measures the quality of your delivery process. It is the ratio of bad deployments to total deployments. Note that a bad deployment is one that causes user-visible impact — not one that has a bug discovered in a code review.
DORA performance bands (2023):
- Elite / High: 0–15%
- Medium: 16–30%
- Low: 16–30% (same band, but correlates with lower frequency)
The most important insight from DORA research: high deployment frequency does not cause high failure rate. Elite performers deploy most often and have the lowest failure rate. The mechanism is batch size — small, frequent changes are easier to test, review, and reason about than large quarterly releases.
Collecting DORA Metrics in Practice
Most organisations instrument DORA metrics by combining three data sources: their CI/CD platform (deployment frequency and lead time), their incident management tool (MTTR), and their deployment log or change management system (change failure rate). The important thing is to automate the collection — manually counting incidents at the end of a quarter produces misleading numbers.
The Speed vs Stability Trade-off — A Myth
A common objection to aiming for high deployment frequency is "we cannot move faster because stability will suffer." DORA data consistently refutes this. Throughput (deployment frequency, lead time) and stability (MTTR, change failure rate) are positively correlated — teams that score high on one tend to score high on the other. The mechanism is again batch size: small changes are inherently less risky.
In Lesson 6 you will look at how these metrics connect to the broader story of infrastructure evolution — understanding where your system runs is as important as understanding how well you deliver changes to it.