
DataExpert.io Community Academy
This boot camp will help you get the fundamentals needed to be a great data engineer!
What Our Students Say
Diana A
Student"Easy to follow, clear and direct to the point!"
Kabir Olawale Mohammed
Self employed at Skyfort"Its been an awesome experience taking the course"
Kabir Olawale Mohammed
Self employed at Skyfort"I recommend this course to all beginners in DE"

Payezhi Chegattil Abhishek
Professional Data and AI Engineer at Capgemini Engineering"Highly recommended to any data engineers who want to upgrade themself. Homework's are challenging, and its not like another certification where you can sleep walk some multiple choice questions."
Olamide Solihu
Student"Despite the tutors advance skills, this course was well curated to give a soft landing to beginners. I love the raw and the hands on/project format. Thanks Zach"
Nadia Talaat
Student"This bootcamp was truly outstanding. Zach has a wonderful gift for explaining complex concepts in ways that work for students at every level - whether you're a complete beginner or have some prior experience"
Explore Our Featured Programs
Boot Camp: Community Edition
Join the free community boot camp that covers everything you need to know to go from junior data engineer to senior data engineer!
- Six weeks covering Spark, Flink, Data Modeling, and more!
- Automatic Graded Homework
- Get Certified
Platform Access Included:
Discord
GitHub- Basic SQL and Python skills
- Exposure to Docker
- Basic SQL and Python skills
- Exposure to Docker
Main Content
Bootcamp Orientation0.7 hrs
Dimensional Data Modeling4.2 hrs
- Dimensional Data Modeling Complex Data Type and Cumulation Day 1 Lecture
- Dimensional Data Modeling Complex Data Type and Cumulation Day 1 Lab
- Dimensional Data Modeling: Building Slowly Changing Dimensions Day 2 Lecture
- Dimensional Data Modeling: Building Slowly Changing Dimensions Day 2 Lab
- Dimensional Data Modeling: Graph Data Modeling Day 3 Lecture
- Dimensional Data Modeling: Graph Data Modeling Day 3 Lab
Fact Data Modeling3.9 hrs
- Fact Data Modeling: Core Concepts, Deduplication Day 1 Lecture
- Fact Data Modeling: Practical Insights into Data Modeling Day 1 Lab
- Fact Data Modeling: Core Elements in Data Modeling Day 2 Lecture
- Fact Data Modeling: Compact Tables for Efficient Data Representation Day 2 Lab
- Fact Data Modeling: Minimizing Shuffle and Reducing Facts Day 3 Lecture
- Fact Data Modeling: Practical Guide to Formatting and Aggregating Data Day 3 Lab
Apache Spark Fundamentals3.6 hrs
- Apache Spark: Architecture, Optimization, and Best Practices Day 1 Lecture
- Apache Spark: Hands-On for Broadcast and Hash Joins Day 1 Lab
- Apache Spark: Managing Spark Jobs and Notebooks Day 2 Lecture
- Apache Spark: User-Defined Functions and Broadcast Join Day 2 Lab
- Unit Testing Spark Jobs: Importance, Challenges, and Leadership Perspectives Lecture
- Unit Testing Spark Jobs: Mastering Spark and PySpark Testing Lab
Applying Analytical Patterns2.8 hrs
- Applying Analytical Patterns: Exploring SQL, Scaling Projects and Aggregation Analysis Day 1 Lecture
- Applying Analytical Patterns: Mastering Growth Accounting and Retention Analysis Day 1 Lab
- Applying Analytical Patterns: Recursive CTEs and Window Functions Day 2 Lecture
- Applying Analytical Patterns: Aggregations and Cardinality Reduction Day 2 Lab
Real-time pipelines with Flink and Kafka2.8 hrs
- Flink Lab Setup
- Streaming Pipelines: Mastering Streaming and Real-time Pipelines Day 1 Lecture
- Streaming Pipelines: Setting up Streaming Pipelines Day 1 Lab
- Streaming Pipelines: Exploring Data Collection and Processing Day 2 Lecture
- Streaming Pipelines: Kafka, Postgres, Spark Integrations and Parallelism Day 2 Lab
Data Visualization and Impact1.8 hrs
- Data Visualization and Impact: Mastering Data Engineering Day 1 Lecture
- Data Visualization and Impact: Hands-On with the CSV files Day 1 Lab
- Data Visualization and Impact: Insights and Best Practices Day 2 Lecture
- Data Visualization and Impact: Exploring Data Visualization and Aggregation Techniques Day 2 Lab