DataHub · Arazzo Workflow
DataHub Trace Dataset Lineage
Version 1.0.0
Confirm a dataset, query its downstream relationships, then batch fetch the related datasets' aspects.
View Spec
View on GitHub
Data CatalogData DiscoveryData GovernanceData LineageMetadataArazzoWorkflows
Provider
Workflows
trace-lineage
Walk downstream lineage from a dataset and hydrate the related entities.
Confirms a dataset exists, queries its DownstreamOf relationships, and batch fetches the aspects of the first downstream dataset discovered.
1
confirmDataset
getEntityLatestAspects
Retrieve the latest aspects for the dataset URN to confirm the entity exists before walking its lineage.
2
queryDownstream
getRelationships
Query the relationship graph for DownstreamOf edges from the dataset URN to discover datasets that depend on it.
3
hydrateRelated
batchGetEntities
Batch fetch the latest aspects for the first downstream dataset discovered to enrich the lineage view.