A curated, reusable data asset
A data product is a dataset treated with the same discipline as a shipped product: it has an owner, documented quality and freshness guarantees, and is discoverable by the people who needNeedUserA user need, pain, desire, or constraintView reference → it. The shift it asks for is cultural. The team that generates the data becomes accountable for serving it well to consumers it may never meet.
The term comes from Zhamak Dehghani, who defined data mesh in 2019 while a principal consultant at Thoughtworks and developed its principles through 2020. "Data as a product" is one of the four principles, alongside domain ownership, a self-serve platform, and federated computational governance. The argument is that applying product thinking to analytical data, with owners and consumers and quality bars, fixes the neglect that central data lakes tend to breed. She set it out in full in Data Mesh (O'Reilly, 2022). Practice since has tempered the more ambitious org-restructuring claims, while the data-product unit itself has stuck.
A logistics company's fulfilment domain owns a delivery_performance data product. It publishes a documented table updated hourly, with a stated freshness SLA and a named owner. A finance dashboardDashboardData & AnalyticsAn analytics dashboardView reference → and a customer-facing tracking service both consume it. When finance needs a new field, it files a request against the product's owner rather than reverse-engineering a raw warehouse table. The fulfilment team treats those consumers as customers: a breaking change goes through deprecation, not a surprise.
In the Unified Product Graph, a data product anchors part of the data and analytics region exactly as data mesh frames it. A domain produces it (Data DomainproducesData Producthierarchy), a pipeline feeds it (data_domain_produces_data_productData PipelinefeedsData Productcross-domain), it serves dashboards (data_pipeline_feeds_data_productData ProductservesDashboardcross-domain), and services consume it (data_product_serves_dashboardData Productconsumed byServicecross-domain). Those edges encode the contract Dehghani describes: every data product has a clear producer, a feeding pipeline, and named consumers, so an orphaned product with no consumer is visible at a glance.data_product_consumed_by_service
Type-specific fields on BaseNode
data_product_typestringClassification of the data product (e.g. "report", "dataset", "stream")
sla_freshnessstringFreshness SLA commitment (e.g. "< 1 hour")
idstringrequiredUnique identifier (UUID)
typeNodeTyperequiredDiscriminator for the entity type
titlestringrequiredDisplay name
descriptionstringOptional detailed description
statusstringLifecycle status
tagsstring[]Freeform tags for filtering
4 phases — initial: alpha · template: MATURITY
4 edge types connected to this entity.
data_domain_produces_data_productdata_product_serves_dashboarddata_pipeline_feeds_data_productdata_product_consumed_by_service