Boot Camp: Community Edition

Join the free community boot camp that covers everything you need to know to go from junior data engineer to senior data engineer!

Zach Wilson

Taught by Zach Wilson

Founder at DataExpert.io

What you'll learn

Six weeks covering Spark, Flink, Data Modeling, and more!
Automatic Graded Homework
Get Certified

Learn directly from the experts

Zach Wilson

Zach Wilson

Founder at DataExpert.io

Course syllabus

41 lessons • 27+ hours of content • 8 assignments

Bootcamp Orientation
1
Bootcamp Kickoff
2
Boot Camp Database Setup
Dimensional Data Modeling
1
Dimensional Data Modeling Complex Data Type and Cumulation Day 1 Lecture
2
Dimensional Data Modeling Complex Data Type and Cumulation Day 1 Lab
3
Dimensional Data Modeling: Building Slowly Changing Dimensions Day 2 Lecture
4
Dimensional Data Modeling: Building Slowly Changing Dimensions Day 2 Lab
5
Dimensional Data Modeling: Graph Data Modeling Day 3 Lecture
6
Dimensional Data Modeling: Graph Data Modeling Day 3 Lab
Fact Data Modeling
1
Fact Data Modeling: Core Concepts, Deduplication Day 1 Lecture
2
Fact Data Modeling: Practical Insights into Data Modeling Day 1 Lab
3
Fact Data Modeling: Core Elements in Data Modeling Day 2 Lecture
4
Fact Data Modeling: Compact Tables for Efficient Data Representation Day 2 Lab
5
Fact Data Modeling: Minimizing Shuffle and Reducing Facts Day 3 Lecture
6
Fact Data Modeling: Practical Guide to Formatting and Aggregating Data Day 3 Lab
Apache Spark Fundamentals
1
Apache Spark: Architecture, Optimization, and Best Practices Day 1 Lecture
2
Apache Spark: Hands-On for Broadcast and Hash Joins Day 1 Lab
3
Apache Spark: Managing Spark Jobs and Notebooks Day 2 Lecture
4
Apache Spark: User-Defined Functions and Broadcast Join Day 2 Lab
5
Unit Testing Spark Jobs: Importance, Challenges, and Leadership Perspectives Lecture
6
Unit Testing Spark Jobs: Mastering Spark and PySpark Testing Lab
Applying Analytical Patterns
1
Applying Analytical Patterns: Exploring SQL, Scaling Projects and Aggregation Analysis Day 1 Lecture
2
Applying Analytical Patterns: Mastering Growth Accounting and Retention Analysis Day 1 Lab
3
Applying Analytical Patterns: Recursive CTEs and Window Functions Day 2 Lecture
4
Applying Analytical Patterns: Aggregations and Cardinality Reduction Day 2 Lab
Real-time pipelines with Flink and Kafka
1
Flink Lab Setup
2
Streaming Pipelines: Mastering Streaming and Real-time Pipelines Day 1 Lecture
3
Streaming Pipelines: Setting up Streaming Pipelines Day 1 Lab
4
Streaming Pipelines: Exploring Data Collection and Processing Day 2 Lecture
5
Streaming Pipelines: Kafka, Postgres, Spark Integrations and Parallelism Day 2 Lab
Data Visualization and Impact
1
Data Visualization and Impact: Mastering Data Engineering Day 1 Lecture
2
Data Visualization and Impact: Hands-On with the CSV files Day 1 Lab
3
Data Visualization and Impact: Insights and Best Practices Day 2 Lecture
4
Data Visualization and Impact: Exploring Data Visualization and Aggregation Techniques Day 2 Lab
Data Pipeline Maintenance
1
Data Pipeline Maintenance: Navigating the Complexities of Data Engineering Day 1 Lecture
2
Data Pipeline Maintenance: Strategies for Maintenance and Dock Building Day 2 Lecture
KPIs and Experimentation
1
KPIs and Experimentation: Decoding Business Success: Metrics, Growth Strategies and Collaborative Approaches Day 1 Lecture
2
KPIs and Experimentation: Setting up and Analysing Experiments Day 1 Lab
3
KPIs and Experimentation: Leading and Lagging Metrics Day 2 Lecture
Data Quality Patterns
1
Data Quality Patterns: MIDAS Process from Airbnb Day 1 Lecture
2
Data Quality Patterns: Spec-Building Document Day 1 Lab
3
Data Quality Patterns: WAP Patterns Day 2 Lecture

Before you join

Prerequisites

Basic SQL and Python skills
Exposure to Docker

Platform Access Included

DiscordDiscord
GitHubGitHub

Social proof

What students say