MHP

Hands-On Exercise — Databricks Pipeline

1
Bronze Ingestion
Run 00_setup.py to create schemas
Ingest ADLS2 Parquet into Bronze
Verify ~3M rows in Unity Catalog
2
Silver Cleaning
Quality filters: ~5-15% row reduction
Derived columns & zone enrichment
Delta Lake time travel & history
3
Gold KPIs
12 business aggregation tables
Trips by hour, day, zone, borough
Dashboard-ready for Priya's KPIs