Series: lakehouse-layer-spec · Part 0
Dev Productivity & Tools
Series: Lakehouse Layer Spec (7 layers)
Intro to a 7-post series specifying each Lakehouse layer: Raw, Staging, Curated, Analytics, Metadata & Governance, Query & BI, Consumption.
2026-03-171 min read
Series: lakehouse-layer-spec
This 7-post series describes each layer in a Lakehouse architecture (raw → consumption):
- Layer 1 — Raw: Source data, immutable, ingested from sources.
- Layer 2 — Staging: Cleansing, normalization, validation.
- Layer 3 — Curated: Business and domain enrichment, ready for BI/ML.
- Layer 4 — Analytics: Aggregations, pre-aggregation, KPIs.
- Layer 5 — Metadata & Governance: Catalog, lineage, ownership, policy.
- Layer 6 — Query & BI: SQL engine, views, BI integration.
- Layer 7 — Consumption: Dashboards, APIs, ML serving.
Reference stack: Airbyte, Kafka, Spark, dbt, Iceberg/Delta, MinIO, DataHub, Trino, Superset. Start with Layer 1 — Raw.
