Build production data pipelines with Airflow, dbt, and Spark. Extract data from APIs and files, transform it, and load into data warehouses.
Fetch weather API daily. Transform. Load to PostgreSQL. Email on failure.
Staging → intermediate → mart models for e-commerce sales data. All tested.
Full schema.yml with descriptions + tests. Docs site generated and shared.
Process 1M rows with PySpark. Compare performance to Pandas.
5 complex queries with joins + window functions on a large dataset.
+15 more projects available after enrollment
Build a real project in 4 weeks