What is the purpose of a Test Plan?

Test plans prevent post-hoc rationalisation. By documenting what "success" looks like before running an experiment, teams avoid interpreting ambiguous results as confirmation of whatever they hoped to prove. In the Unified Product Graph, test plans connect hypotheses to experiments.

Where does the concept of a Test Plan come from?

Test plans originate in software QA and were adapted for product validation by lean startup practitioners. The concept of planning tests before running them (defining method, sample size, and success criteria upfront) draws from scientific methodology and statistical hypothesis testing.

📝

Test Plan

Q: What is an example of a Test Plan?

Onboarding guided tour test plan: Method: A/B test. Control: current onboarding. Variant: 3-step guided tour. Sample: 500 users per variant. Success: variant activation rate > control by ≥ 5pp at 95% confidence.

A structured plan for testing a hypothesis

Validationcoretype: 'test_plan'interface: BaseNode

validation

View in Graph

Description

A test plan is the written specification for validating a single hypothesis: the method you will use, the metric that counts as success, the sample you will run it against, and the threshold at which you stop and decide. It exists because a hypothesis on its own is just a sentence. The plan is what turns a belief into something a team can run, observe, and be wrong about on purpose.

See moreSee less

Origin & evolution

The discipline traces to the lean startup movement's insistence that beliefs be tested cheaply before they are built expensively. The clearest codification arrived with David J. Bland and Alexander Osterwalder's Testing Business Ideas (2019), which cataloged 43 experiment types organised by cost, time, and strength of evidence, each framed as a way to attack a stated hypothesis under desirability, feasibility, or viability.

Bland's earlier work, developed alongside Jeff Gothelf and Josh Seiden, sharpened the targeting logic. Assumptions mapping ranks a team's beliefs by how risky each one is and how little evidence supports it, so that testing effort lands on the assumption whose failure would sink the whole idea. That assumption became known as the riskiest assumption, and testing it first is the economic case for writing a test plan at all: you spend on evidence where the payoff of being wrong early is highest.

The thinking has settled on a separation that older "test everything" habits blurred. A test plan is scoped to one hypothesis and one decision. It names success before the test runs, which is the discipline that stops teams from reading whatever result they get as confirmation.

How it works in practice

A subscription team believes that solo founders will pay £19 a month for an automated bookkeeping feature. That belief is the riskiest assumption on their map: high impact, thin evidence. The test plan reads as follows. Method: a pricing page with a live checkout, driven by a £400 ad spend. Sample: 600 visitors from the solo-founder segment. Success criterion: at least 4% click through to checkout and 1.5% complete payment intent. Decision rule: below 1.5%, the feature is reframed or dropped; at or above, it proceeds to a built prototype.

The test runs for nine days. Click-through lands at 5.1%, but payment-intent completion stalls at 0.7%. The pre-committed rule does its job: interest is real, willingness to pay at £19 is not. The team learns this for £400 rather than for a quarter of engineering.

Test plan vs. its neighbours

Experiment plan designs the experiment itself: variables, control, how the run is instrumented. A test plan starts from a hypothesis and asks whether to keep believing it. One specifies the apparatus; the other specifies the verdict.
Research plan runs a discovery study to understand a problem or a user, with open questions and qualitative analysis. A test plan runs validation against a claim you already hold, with a success threshold set in advance. Research asks "what is going on here"; a test plan asks "is this specific belief true enough to build on".
Experiment run is the execution: the actual nine days, the actual 600 visitors, the recorded numbers. The test plan is the document that the run instantiates, written before any data exists.

In the graph

In the Unified Product Graph, test_plan sits in the validation region as the bridge between a belief and its evidence. A hypothesis connects down to it via hypothesis_planned_via_test_plan, and the plan connects forward to its execution via test_plan_ran_as_experiment_run. Both edges are hierarchical, which encodes the real dependency: a plan that validates no hypothesis is busywork, and a plan that never ran as an experiment is an intention with no outcome. The structure makes the riskiest-assumption discipline queryable, because you can ask which hypotheses still lack a plan and which plans still lack a run.

Properties

Type-specific fields on BaseNode

plan_typestring

Test type

sample_sizenumber

Participants or observations

durationstring

Run duration. @example "2 weeks"

success_criteriastring

Criteria determining whether the test passes

Inherited from BaseNode (6 fields)

idstringrequired

Unique identifier (UUID)

typeNodeTyperequired

Discriminator for the entity type

titlestringrequired

Display name

descriptionstring

Optional detailed description

statusstring

Lifecycle status

tagsstring[]

Freeform tags for filtering

Lifecycle

4 phases — initial: drafted

All lifecycles

Relationships

2 edge types connected to this entity.

Parents

Entities that can contain this type

Hypothesishypothesis_planned_via_test_plan

Children

Entities this type can contain

Experiment Runtest_plan_ran_as_experiment_run

Graph Position

1parent

📝Test Plan

1child

Usage Guidance

Define: what you're testing, how you'll test it, what sample size is needed for statistical significance, and what result would confirm vs refute the hypothesis, all before running the test. Treat ambiguous results as "inconclusive", not "positive".

Think of it as...

Test Plan is like a scientific lab notebook — it forces you to state what you believe, test it, and record what actually happened, so you learn whether your bet was right.

Anti-Patterns

Defining success criteria after the results are in invites moving the goalposts until the data appears to confirm the favoured conclusion. Planning a test with no clear hypothesis produces activity that generates data nobody knows how to interpret. Choosing a sample or duration too small to detect the effect guarantees an inconclusive result dressed up as a negative. Bundling several changes into one test makes any result impossible to attribute to a single cause.

Examples

Onboarding guided tour test plan

Method: A/B test. Control: current onboarding. Variant: 3-step guided tour. Sample: 500 users per variant. Success: variant activation rate > control by ≥ 5pp at 95% confidence.

Test Plan

A structured plan for testing a hypothesis

Validationcoretype: 'test_plan'interface: BaseNode

validation

View in Graph

Description

See moreSee less

Origin & evolution

How it works in practice

Test plan vs. its neighbours

Experiment plan designs the experiment itself: variables, control, how the run is instrumented. A test plan starts from a hypothesis and asks whether to keep believing it. One specifies the apparatus; the other specifies the verdict.
Research plan runs a discovery study to understand a problem or a user, with open questions and qualitative analysis. A test plan runs validation against a claim you already hold, with a success threshold set in advance. Research asks "what is going on here"; a test plan asks "is this specific belief true enough to build on".
Experiment run is the execution: the actual nine days, the actual 600 visitors, the recorded numbers. The test plan is the document that the run instantiates, written before any data exists.

In the graph

Properties

Type-specific fields on BaseNode

plan_typestring

Test type

sample_sizenumber

Participants or observations

durationstring

Run duration. @example "2 weeks"

success_criteriastring

Criteria determining whether the test passes

Inherited from BaseNode (6 fields)

idstringrequired

Unique identifier (UUID)

typeNodeTyperequired

Discriminator for the entity type

titlestringrequired

Display name

descriptionstring

Optional detailed description

statusstring

Lifecycle status

tagsstring[]

Freeform tags for filtering

Lifecycle

4 phases — initial: drafted

All lifecycles

Relationships

2 edge types connected to this entity.

Parents

Entities that can contain this type

Hypothesishypothesis_planned_via_test_plan

Children

Entities this type can contain

Experiment Runtest_plan_ran_as_experiment_run

Graph Position

1parent

📝Test Plan

1child

Usage Guidance

Think of it as...

Test Plan is like a scientific lab notebook — it forces you to state what you believe, test it, and record what actually happened, so you learn whether your bet was right.

Anti-Patterns

Examples

Onboarding guided tour test plan

Method: A/B test. Control: current onboarding. Variant: 3-step guided tour. Sample: 500 users per variant. Success: variant activation rate > control by ≥ 5pp at 95% confidence.

Test Plan

Description

Origin & evolution

How it works in practice

Test plan vs. its neighbours

In the graph

Properties

Lifecycle

Relationships

Parents

Children

Graph Position

Related Entities

Test Plan

Description

Origin & evolution

How it works in practice

Test plan vs. its neighbours

In the graph

Properties

Lifecycle

Relationships

Parents

Children

Graph Position

Related Entities