What is a Hallucination Report?

A logged instance of an AI model producing false or fabricated output, such as an invented citation, a confident wrong number, or a claim the source never made.

What is the purpose of a Hallucination Report?

A hallucination report turns a one-off AI failure into data. A corpus of these reports builds institutional knowledge about a model's failure modes, providing the evidence that drives evals, prompt improvements, and guardrail design.

How do you use a Hallucination Report in product management?

Log every user-reported or automatically-detected hallucination. Categorise by type (factual error, fabrication, inconsistency). Link to the prompt version and model that produced it.

Where does the concept of a Hallucination Report come from?

The word 'hallucination' for confident, fabricated model output entered wide use with the rise of neural text generation and was cemented during the LLM boom of 2022–2023; the survey 'Survey of Hallucination in Natural Language Generation' (Ji et al., 2022) helped formalise the term. The practice of filing a structured report on a specific hallucination is an operational habit borrowed from bug and incident reporting; it has no single coiner.

What are common mistakes with a Hallucination Report?

Logging a hallucination without the exact prompt and context that produced it gives engineers a symptom they cannot reproduce or fix. Treating each report as an isolated bug, rather than clustering them, hides the systemic failure mode underneath. Reacting to every false output with a one-off prompt patch accumulates brittle rules instead of addressing the model or retrieval gap. And closing reports without verifying the fix against the original case lets the same fabrication quietly return.

👻

Hallucination Report

Q: What is an example of a Hallucination Report?

Fabricated citation: A report logs that the assistant cited a non-existent court case. It links the offending ai_trace, the prompt_version, and the model, and proposes a retrieval guardrail as the fix.

A documented instance where an AI model produced factually incorrect, fabricated, or misleading output.

AI & Machine LearningEngineering & Platformtype: 'hallucination_report'interface: BaseNode

View in Graph

▼On this page

Description Properties Lifecycle Relationships Graph Position Related Entities

Description

A hallucination report is a logged instance of a language model producing output that is false or fabricated: an invented citation, a confident wrong number, a claim the source never made. The report turns a one-off failure into data. A single hallucination is an anecdote; a corpus of reports is the evidence that drives evals and guardrails.

See moreSee less

Origin & evolution

The term entered NLP through image captioning and translation research, where models described objects that were not in the input. The framing that stuck came from Ji et al.'s Survey of Hallucination in Natural Language Generation (ACM Computing Surveys, 2022), which split the phenomenon in two. An intrinsic hallucination contradicts the source the model was given; an extrinsic one cannot be verified from the source at all, true or false. The distinction matters operationally, because the two demand different fixes: intrinsic errors point at how the model uses context, extrinsic ones at what it should refuse to assert.

Later work refined the axis into factuality versus faithfulness, separating disagreement with the real world from disagreement with the provided input. Recent surveys, including a 2025 taxonomy paper, formalise these definitions further. For a product team the value is practical: a report tagged by type routes to the right remedy and feeds a measurable eval rather than a vague sense that the model "makes things up".

How it works in practice

A support assistant tells a user a refund window is 60 days when the policy document loaded into context says 30. A reviewer files a hallucination report: type intrinsic, because it contradicts the supplied source; severity high, because it misstates policy to a customer. Twenty similar reports accumulate over a month. They become a regression set in the next eval_run, and the failure rate on that set drops from 8 percent to under 1 after a retrieval-grounding fix. The report closed the loop from a single bad answer to a guardrail and a tracked metric.

Hallucination report vs. its neighbours

AI guardrail is the control that prevents or catches bad output. The hallucination report is the evidence that a guardrail is missing or leaking; reports motivate guardrails, and guardrails are tested against the reports.
Eval run is a systematic measurement across a test set. A hallucination report is a single logged case; a stream of reports becomes the test set an eval run scores against.
AI model is the system that produced the output. The edge ai_model_flagged_by_hallucination_report attaches the failure to the specific model and version, so a regression after an upgrade is traceable to the change that caused it.

In the graph

In the Unified Product Graph, a hallucination report sits in the AI region and attaches to its source through ai_model_flagged_by_hallucination_report. Logging each instance as a node keeps a model's failure record durable and queryable, so reports can be counted by type, fed into an eval_run, and used to justify an ai_guardrail. The structure turns scattered "the model got this wrong" complaints into an auditable trail from observed failure to measured fix.

Preview

Presets

title

report_type

severity

Significant Has to change approach

user_facing

remediation

Hallucination Report

Builder agent fabricated record field in tool proposal

Report typefabricationSeveritySignificantUser facingtrue

RemediationAdded a post-generation validation step that cross-checks every proposed field against the workspace schema and the description tokens

Properties

Type-specific fields on BaseNode

report_typeenum

Classification

factuallogicalfabricationinconsistency

severityassessment

Impact severity (1 = trivial, 5 = dangerous misinformation)

Severity (5-point) scale →

Mild inconvenience

Notices but works around easily

Annoying

Frustrated but can continue

Significant

Has to change approach

Severe

Struggles to accomplish goal

Blocker

Cannot accomplish goal

user_facingboolean

Visible to end users

remediationstring

Remediation steps

Inherited from BaseNode (6 fields)

idstringrequired

Unique identifier (UUID)

typeNodeTyperequired

Discriminator for the entity type

titlestringrequired

Display name

descriptionstring

Optional detailed description

statusstring

Lifecycle status

tagsstring[]

Freeform tags for filtering

Lifecycle

6 phases, initial: open · template: INCIDENT

All lifecycles

Relationships

4 edge types connected to this entity.

Parents

Entities that can contain this type

AI Modelai_model_flagged_by_hallucination_report

Cross-References

Contextual links across the graph

AI Tracehallucination_report_traces_to_ai_trace

Root Causehallucination_report_caused_by_root_cause

Root Causehallucination_report_has_root_cause

Graph Position

1parent

👻Hallucination Report

3cross-ref

Definition

A hallucination report is a logged instance of an AI model producing fabricated or incorrect output, such as an invented citation or a confidently wrong number. It records the failure mode so it can inform guardrail and prompt changes.

Usage Guidance

Log every user-reported or automatically-detected hallucination.
Categorise by type (factual error, fabrication, inconsistency).
Link to the prompt version and model that produced it.

Anti-Patterns

Logging a hallucination without the exact prompt and context that produced it gives engineers a symptom they cannot reproduce or fix.
Treating each report as an isolated bug, rather than clustering them, hides the systemic failure mode underneath.
Reacting to every false output with a one-off prompt patch accumulates brittle rules instead of addressing the model or retrieval gap.
And closing reports without verifying the fix against the original case lets the same fabrication quietly return.

Examples

Fabricated citation

A report logs that the assistant cited a non-existent court case. It links the offending ai_trace, the prompt_version, and the model, and proposes a retrieval guardrail as the fix.

Invented price

A user is quoted a plan price the product never offered. The report captures the input, the wrong output, and the missing tool call that should have fetched the real figure.

Hallucination Report

A documented instance where an AI model produced factually incorrect, fabricated, or misleading output.

AI & Machine LearningEngineering & Platformtype: 'hallucination_report'interface: BaseNode

View in Graph

▼On this page

Description Properties Lifecycle Relationships Graph Position Related Entities

Description

See moreSee less

Origin & evolution

How it works in practice

Hallucination report vs. its neighbours

AI guardrail is the control that prevents or catches bad output. The hallucination report is the evidence that a guardrail is missing or leaking; reports motivate guardrails, and guardrails are tested against the reports.
Eval run is a systematic measurement across a test set. A hallucination report is a single logged case; a stream of reports becomes the test set an eval run scores against.
AI model is the system that produced the output. The edge ai_model_flagged_by_hallucination_report attaches the failure to the specific model and version, so a regression after an upgrade is traceable to the change that caused it.

In the graph

Preview

Presets

title

report_type

severity

Significant Has to change approach

user_facing

remediation

Hallucination Report

Builder agent fabricated record field in tool proposal

Report typefabricationSeveritySignificantUser facingtrue

RemediationAdded a post-generation validation step that cross-checks every proposed field against the workspace schema and the description tokens

Properties

Type-specific fields on BaseNode

report_typeenum

Classification

factuallogicalfabricationinconsistency

severityassessment

Impact severity (1 = trivial, 5 = dangerous misinformation)

Severity (5-point) scale →

Mild inconvenience

Notices but works around easily

Annoying

Frustrated but can continue

Significant

Has to change approach

Severe

Struggles to accomplish goal

Blocker

Cannot accomplish goal

user_facingboolean

Visible to end users

remediationstring

Remediation steps

Inherited from BaseNode (6 fields)

idstringrequired

Unique identifier (UUID)

typeNodeTyperequired

Discriminator for the entity type

titlestringrequired

Display name

descriptionstring

Optional detailed description

statusstring

Lifecycle status

tagsstring[]

Freeform tags for filtering

Lifecycle

6 phases, initial: open · template: INCIDENT

All lifecycles

Relationships

4 edge types connected to this entity.

Parents

Entities that can contain this type

AI Modelai_model_flagged_by_hallucination_report

Cross-References

Contextual links across the graph

AI Tracehallucination_report_traces_to_ai_trace

Root Causehallucination_report_caused_by_root_cause

Root Causehallucination_report_has_root_cause

Graph Position

1parent

👻Hallucination Report

3cross-ref

Definition

Usage Guidance

Log every user-reported or automatically-detected hallucination.
Categorise by type (factual error, fabrication, inconsistency).
Link to the prompt version and model that produced it.

Anti-Patterns

Logging a hallucination without the exact prompt and context that produced it gives engineers a symptom they cannot reproduce or fix.
Treating each report as an isolated bug, rather than clustering them, hides the systemic failure mode underneath.
Reacting to every false output with a one-off prompt patch accumulates brittle rules instead of addressing the model or retrieval gap.
And closing reports without verifying the fix against the original case lets the same fabrication quietly return.

Examples

Fabricated citation

A report logs that the assistant cited a non-existent court case. It links the offending ai_trace, the prompt_version, and the model, and proposes a retrieval guardrail as the fix.

Invented price

A user is quoted a plan price the product never offered. The report captures the input, the wrong output, and the missing tool call that should have fetched the real figure.