Semantic Substrate

Inquire

The .com substrate position for dataset lineage tracking.

A canonical coordinate for the discipline of recording where a dataset came from, how it was transformed, and what systems and models it flows into — the lineage graph applied specifically to data assets.

Coordinated sets this position belongs to — the coverage it extends. Counts are the live cluster size in the graph.

Primary home

Also appears in

Architectural context

Lineage · Cross-Vertical · 2 compound moats. Architectural surface: Lineage.

Layer position: Substrate (L1)

DataLineage

Why this is canonical

'Dataset lineage' is the precise term used in data engineering, MLOps, and AI governance conversations for tracing the lifecycle of a dataset from origin through transformation to consumption. It is distinct from model lineage and pipeline lineage, naming the asset itself.

Where it fits

A few directions this coordinate opens —

Data engineering / governance
Infrastructure for automated lineage tracking across data warehouses, lakes, and pipelines — who touched the data and when.
Data catalog vendors, data observability platforms, data governance tooling
AI/ML provenance
The substrate layer recording which datasets trained which models, enabling reproducibility, audit, and attribution at the asset level.
MLOps vendors, AI auditing platforms, foundation model labs

Illustrative, not exhaustive — held as a transferable canonical position, open to the buyer's own use.