DORA Metrics

metricscollection

Four research-backed metrics for measuring software delivery performance, derived from the DORA programme and the book Accelerate, covering deployment frequency, lead time, failure rate, and recovery speed.

How fast, how often, how reliably, and how safely does our team ship software?

▼On this page

Structure Properties Description Examples

Structure

Deployment Frequency

How often you deploy to production

EliteOn demand

HighDaily–Weekly

MediumMonthly

Low< Monthly

Lead Time for Changes

Time from commit to production

Elite< 1 hour

High< 1 day

Medium< 1 week

Low< 6 months

Change Failure Rate

% of deployments causing failures

Elite0–5%

High5–10%

Medium10–15%

Low15–50%

Time to Restore

How long to recover from a failure

Elite< 1 hour

High< 1 day

Medium< 1 week

Low< 6 months

DORA Metrics by Google Cloud (DORA Team)🔢 All metrics use Metric

Entities & Properties

🔢Metric21ExpandCollapse

🔢Metricitem

Added by this framework

DORA metricdora_metricenum

Which of the four DORA metrics this measures

Deployment FrequencyLead Time For ChangesChange Failure RateTime To Restore

Performance tierperformance_tierenum

Where this metric sits on the DORA elite-to-low benchmark

EliteHighMediumLow

Core properties19

designationenum

actionstring

unit_of_analysisstring

statistical_functionenum

formulastring

impact_levelenum

indicator_directionenum

metric_categoryenum

current_valuenumber

target_valuenumber

unitstring

range_minnumber

range_maxnumber

cadenceenum

ownerstring

metric_healthenum

guardrail_threshold_minnumber

guardrail_threshold_maxnumber

guardrail_statusenum

Deep Dive

DORA metrics are four quantitative measures of software delivery and operational performance: Deployment Frequency, Lead Time for Changes, Change Failure Rate, and Time to Restore Service. Together they let an engineering team place itself on a performance spectrum from low to elite, track improvement over time, and make the cost of slow or fragile delivery concrete to a non-technical audience.

See moreSee less

Origin & evolution

The metrics come from the DevOps Research and Assessment programme, founded by Nicole Forsgren, Jez Humble, and Gene Kim. The research began in 2013 as an annual State of DevOps survey, co-produced with Puppet, drawing on thousands of responses from software practitioners worldwide. The goal was to move the conversation about DevOps maturity away from tool adoption ("are you using containers?") and onto observable outcomes ("how fast can you ship, and how often do you break things?"). The programme produced a large-scale dataset linking delivery practices to both software performance and organisational outcomes.

Forsgren, Humble, and Kim published the findings in Accelerate: The Science of Lean Software and DevOps in 2018 (IT Revolution Press). The book is notable for grounding its claims in statistical analysis of years of survey data. It showed, across years of data, that high performers on the four metrics also outperformed on commercial outcomes: higher profitability, higher market share, better employee satisfaction scores.

Google acquired DORA in 2018. The programme now publishes annual State of DevOps reports and maintains a reference site at dora.dev. In 2021 the DORA team added a fifth metric, Reliability (capturing availability, latency, and error-rate targets), reflecting that delivery speed means little if the system is unhealthy. The four original keys remain the primary reference in most engineering teams' measurement work.

How it works in practice

Each metric measures a different part of the delivery and recovery loop.

Deployment Frequency measures how often an organisation deploys code to production. Elite performers deploy on demand, multiple times per day. Low performers deploy monthly or less. The number is a proxy for batch size: teams deploying frequently are working in small, low-risk increments. High deployment frequency is both a goal and a prerequisite for the other metrics to be meaningful.

Lead Time for Changes measures the elapsed time from a code commit to that code running in production. Elite performers measure this in hours. Low performers measure in months. Lead time reflects the efficiency of the whole pipeline: code review practices, build times, deployment automation, approval gates. Long lead times signal bottlenecks worth finding.

Change Failure Rate measures the percentage of deployments that cause a production incident or require a rollback. Elite performers target below 5 per cent. Low performers see 46 to 60 per cent. This metric distinguishes speed from recklessness. A team with high deployment frequency and a high failure rate is shipping fast but also breaking things frequently. The goal is high frequency paired with low failure rate.

Time to Restore Service measures how long it takes to recover from a production failure. Elite performers restore in under an hour. Low performers take between one week and one month. This metric reflects both the quality of incident response (runbooks, on-call practice, observability tooling) and the architectural properties that make recovery fast (the ability to roll back, feature-flag off a bad change, or deploy a hotfix quickly).

A worked example. A mid-size SaaS company runs a quarterly measurement. Deployment Frequency is once per week. Lead Time is four days from commit to production. Change Failure Rate is 12 per cent. Time to Restore is six hours. This profile places the team in the "medium" band. The data prompts two conversations: the Change Failure Rate suggests test coverage or review discipline is weak, and the four-day Lead Time suggests a slow CI pipeline or a heavyweight approval process. The team targets both. After a quarter of work, Deployment Frequency has risen to twice weekly, Lead Time has dropped to eighteen hours, and Change Failure Rate has fallen to 6 per cent. Time to Restore is unchanged at six hours. They have a concrete thread to pull.

When to use it (and when not)

DORA metrics are most valuable in teams that have enough deployment volume to produce meaningful numbers, a baseline of observability (you can see when things fail and when they recover), and the organisational buy-in to act on what the data shows.

They suit engineering organisations at any scale. A team of five using the metrics as a retrospective tool gets value. A platform engineering team using them to benchmark fifty product squads gets more value at the cost of instrumentation investment.

They are less useful in contexts where deployments are genuinely rare by design (regulated industries with mandatory change advisory board processes, hardware-tied firmware releases), because the denominator is too small for Deployment Frequency and Change Failure Rate to be meaningful. In those environments the metrics can still inform the internal software pipeline while acknowledging that the final deployment gate has external constraints.

The common failure modes: measuring only Deployment Frequency because it is the easiest to instrument and ignoring the others, treating low Change Failure Rate as the goal at the expense of deployment frequency (a team that ships once a quarter can keep failure rate low simply by taking very few risks), and gaming the numbers (marking incidents as "planned maintenance" to improve Change Failure Rate). The metrics work as a system. Optimise for all four together.

A second trap is presenting the numbers to leadership as a performance ranking of teams. DORA research is clear that the metrics are improvement tools, not league tables. Teams in different contexts (new product development versus a legacy codebase with twelve years of accumulated debt) are not comparable on the same scale.

In the Unified Product Graph

DORA metrics form a collection framework in the metrics category. All four measures map to the same entity type, reflecting that they are a family of related performance indicators and not a hierarchy:

Deployment Frequency is a metric entity, capturing the rate of production deployments over a measurement window.
Lead Time for Changes is a metric entity, capturing the cycle time from commit to production.
Change Failure Rate is a metric entity, capturing the proportion of deployments that cause an incident or rollback.
Time to Restore Service is a metric entity, capturing the mean recovery time from production incidents.

The Unified Product Graph models all four as metric nodes, which can carry target values, current values, and trend data as properties. Because each metric is a distinct entity, it can link to the capability or practice node that is expected to move it. A metric for Change Failure Rate can link to a feature node representing the investment in automated testing, making the causal chain explicit in the graph.

Examples

Placing a team on the performance spectrum

An engineering org measures its four DORA metrics and finds it deploys fortnightly, takes nine days from commit to production, fails one in five changes, and needs a day to restore service, placing it in the low-to-medium band. Making those numbers concrete to leadership justifies investment in a CI pipeline and automated rollbacks, and the team tracks the same four metrics quarterly to show movement toward elite.

Catching a speed-stability trade-off

A team that pushes deployment frequency from weekly to daily watches its change-failure rate climb from 8 percent to 22 percent, with time-to-restore creeping up too. Reading the four metrics together, rather than celebrating throughput alone, shows the team is shipping faster by shipping more breakage, so they invest in test coverage to bring failure rate back down without losing the cadence.

In short

A fitness tracker for engineering teams: four numbers that together tell you whether you are shipping fast, shipping safely, and recovering quickly when things go wrong.

When to use

When you need to work with "Deployment Frequency": how often you deploy to production
When you need to work with "Lead Time for Changes": time from commit to production
When setting up a measurement framework from scratch
When current metrics aren't capturing what actually matters
When you need to four key metrics for software delivery performance: deployment

When not to use

When you don't have the instrumentation to track the metrics
For feature delivery decisions; metrics inform strategy, not sprint execution

Origin

DORA Team (Google Cloud)

Source ↗

DORA Metrics

metricscollection

How fast, how often, how reliably, and how safely does our team ship software?

▼On this page

Structure Properties Description Examples

Structure

Deployment Frequency

How often you deploy to production

EliteOn demand

HighDaily–Weekly

MediumMonthly

Low< Monthly

Lead Time for Changes

Time from commit to production

Elite< 1 hour

High< 1 day

Medium< 1 week

Low< 6 months

Change Failure Rate

% of deployments causing failures

Elite0–5%

High5–10%

Medium10–15%

Low15–50%

Time to Restore

How long to recover from a failure

Elite< 1 hour

High< 1 day

Medium< 1 week

Low< 6 months

DORA Metrics by Google Cloud (DORA Team)🔢 All metrics use Metric

Entities & Properties

🔢Metric21ExpandCollapse

🔢Metricitem

Added by this framework

DORA metricdora_metricenum

Which of the four DORA metrics this measures

Deployment FrequencyLead Time For ChangesChange Failure RateTime To Restore

Performance tierperformance_tierenum

Where this metric sits on the DORA elite-to-low benchmark

EliteHighMediumLow

Core properties19

designationenum

actionstring

unit_of_analysisstring

statistical_functionenum

formulastring

impact_levelenum

indicator_directionenum

metric_categoryenum

current_valuenumber

target_valuenumber

unitstring

range_minnumber

range_maxnumber

cadenceenum

ownerstring

metric_healthenum

guardrail_threshold_minnumber

guardrail_threshold_maxnumber

guardrail_statusenum

Deep Dive

See moreSee less

Origin & evolution

How it works in practice

Each metric measures a different part of the delivery and recovery loop.

When to use it (and when not)

In the Unified Product Graph

Deployment Frequency is a metric entity, capturing the rate of production deployments over a measurement window.
Lead Time for Changes is a metric entity, capturing the cycle time from commit to production.
Change Failure Rate is a metric entity, capturing the proportion of deployments that cause an incident or rollback.
Time to Restore Service is a metric entity, capturing the mean recovery time from production incidents.

Examples

Placing a team on the performance spectrum

Catching a speed-stability trade-off

In short

A fitness tracker for engineering teams: four numbers that together tell you whether you are shipping fast, shipping safely, and recovering quickly when things go wrong.

When to use

When you need to work with "Deployment Frequency": how often you deploy to production
When you need to work with "Lead Time for Changes": time from commit to production
When setting up a measurement framework from scratch
When current metrics aren't capturing what actually matters
When you need to four key metrics for software delivery performance: deployment

When not to use

When you don't have the instrumentation to track the metrics
For feature delivery decisions; metrics inform strategy, not sprint execution

Origin

DORA Team (Google Cloud)

Source ↗