A record of data origin and transformations
Data lineage is the traceable path of data from where it originates, through every transformation, to where it gets used. Its value shows up in two moments that feel opposite: forward, when you needNeedUserA user need, pain, desire, or constraintView reference → to know what a change will break, and backward, when a number looks wrong and you need to find where it went bad.
A revenue figure on an executive dashboardDashboardData & AnalyticsAn analytics dashboardView reference → reads 12 percent low one morning. With table-level lineage, an analyst learns that five tables feed the dashboard and has to inspect each. With column-level lineage, the trace is sharper: the net_revenue field depends on fx_rate from one upstream table, and that table's last load failed silently overnight. The root causeRoot CauseEngineeringAn identified root cause of an issueView reference → is found in minutes instead of an afternoon. The same map run forwards answers the other question: if someone drops fx_rate, which downstream models and dashboards go dark?
In the Unified Product Graph, data lineage sits in the data and analytics region and ties tightly to Data SourceData & AnalyticsA data source or integrationView reference →. A source is traced via lineage (data_sourceData Sourcetraced viaData Lineagehierarchy), and lineage both draws from a source (data_source_traced_via_data_lineageData Lineagesourced fromData Sourcecross-domain) and feeds into one (data_lineage_sourced_from_data_sourceData LineagefeedsData Sourcecross-domain). Those two directional edges are the point: lineage is inherently a path with an upstream and a downstream, so the graph models it as a directed connection between sources rather than a flat label, which is what lets you walk it for impact analysis.data_lineage_feeds_data_source
Type-specific fields on BaseNode
transformationstringDescription of how the data is transformed
idstringrequiredUnique identifier (UUID)
typeNodeTyperequiredDiscriminator for the entity type
titlestringrequiredDisplay name
descriptionstringOptional detailed description
statusstringLifecycle status
tagsstring[]Freeform tags for filtering
3 edge types connected to this entity.
data_source_traced_via_data_lineagedata_lineage_sourced_from_data_sourcedata_lineage_feeds_data_source