Skip to content

Data Engineering Study Guide

This is a data engineering study guide that you can use to help prepare yourself for your interview. This was developed by people who have interviewed and gotten jobs at FAANGs and several other tech companies. We hope these help you get great jobs as well.

Author: Benjamin Rogojan Organization: Data, Automation And Analytics Consulting Source: LinkedIn Post

How to Use: Make a copy of this guide and follow along with the study plan. Keep track of your progress to know where you are and how you are doing.


  1. 262. Trips and Users - LeetCode
  2. 626. Exchange Seats - LeetCode
  3. Book Availability Update
  4. Hackerrank The Report
  5. Empty Neighborhoods
  6. Employee Salaries (ETL Error)
  7. 177. Nth Highest Salary - LeetCode
  8. Symmetric Pairs
  9. Occupations
  10. Placements
  11. Ollivander’s Inventory

  • SQL Interview Question Walk Through
  • IQ15: 6 SQL Query Interview Questions
  • Learning about ROW_NUMBER and Analytic Functions
  • Advanced Implementation Of Analytic Functions
  • Advanced Implementation Of Analytic Functions Part 2
  • Wise Owl SQL Videos

Practice these problems after watching the video tutorials:

  1. Binary Tree Nodes
  2. Weather Observation Station 18
  3. Print Prime Numbers
  4. Rolling Bank Transactions
  5. 595. Big Countries - LeetCode
  6. 626. Exchange Seats - LeetCode
  7. SQL Interview Questions: 3 Tech Screening Exercises (For Data Analysts)

  • Modern Data Infra Video
  • Designing A Traditional Relational Database Video
  • Data Warehouse Design Video
  • ETL Design Video
  • 38 ETL Interview Questions

Practice designing a Database/ETL and Data Warehouse for:

  1. Dating App
  2. Bicycle Rental Service
  3. Music Streaming App
  4. Job Search Website
  5. Udemy-like Website

Basic Projects:

  • Basic Project: Webscraping Data From G2/W Snowflake And Tableau
  • Basic Project Level 2: Stock Market Real-Time Data Analysis Using Kafka | End-To-End Data Engineering Project

Solve these problems before diving deep into algorithms:

  1. 985. Sum of Even Numbers After Queries - LeetCode
  2. 657. Robot Return to Origin - LeetCode
  3. 961. N-Repeated Element in Size 2N Array - LeetCode
  4. 110. Balanced Binary Tree - LeetCode
  5. 3. Longest Substring Without Repeating Characters - LeetCode
  6. 19. Remove Nth Node From End of List - LeetCode
  7. 23. Merge k Sorted Lists - LeetCode
  8. 31. Next Permutation - LeetCode

Essential video tutorials on data structures:

  • Data Structures & Algorithms #1 - What Are Data Structures?
  • Data Structures: Linked Lists
  • Data Structures: Trees
  • Data Structures: Heaps
  • Data Structures: Hash Tables
  • Data Structures: Stacks and Queues
  • Data Structures: Crash Course Computer Science #14
  • Data Structures: Tries

Essential algorithm tutorials:

  • Python Algorithms for Interviews
  • Algorithms: Graph Search, DFS and BFS
  • Algorithms: Binary Search
  • Algorithms: Recursion
  • Algorithms: Bubble Sort
  • Algorithms: Merge Sort
  • Algorithms: Quicksort
  • Introduction to Big O Notation and Time Complexity (Data Structures & Algorithms #7)

Watch these interview walkthroughs to understand the interview process:

  • Amazon Coding Interview Question - Recursive Staircase Problem
  • Google Coding Interview - Universal Value Tree Problem
  • Google Coding Interview Question and Answer #1: First Recurring Character

Advanced problems to solve after completing the study materials:

  1. Bigger Is Greater
  2. 6. ZigZag Conversion - LeetCode
  3. 7. Reverse Integer - LeetCode
  4. 40. Combination Sum II - LeetCode
  5. 43. Multiply Strings - LeetCode
  6. Larry’s Array
  7. Short Palindrome
  8. 65. Valid Number - LeetCode
  9. Nth Fibonacci

Practice these operational-style programming problems:

  1. Kangaroo Problem
  2. Breaking Records
  3. Find A String
  4. itertools.permutations()
  5. No Idea!
  6. Days of the Programmer
  7. Leaderboard
  8. Word Order
  9. Sherlock And Squares
  10. Equalize The Array
  11. Apples And Oranges
  12. More Operational Style Questions

Watch these system design videos to understand large-scale system architecture:

  • Tiny URL
  • Parking Lot System
  • WhatsApp
  • Uber Design
  • Instagram
  • Tinder Service

  • Architecture Overview & Use Cases (Added By Paul Russel)
  • Spark By Examples (Tutorial Documentation)
  • PySpark Syntax Cheat Sheet

  1. The Coding Interview Bootcamp: Algorithms + Data Structures
  2. Data Warehouse Concepts: Basic to Advanced concepts
  3. Data Structures and Algorithms Bootcamp
  4. Data Structures and Algorithms - The Complete Masterclass

  1. Cracking The Code

    • Essential for coding interview preparation
  2. The Algorithm Design Manual

    • Comprehensive guide to algorithm design and problem-solving
  3. The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd Edition

    • The industry standard for data warehouse design

  • What I Learned From 100+ Data Engineering Interviews - Interview Tips
  • How To Start Your Next Data Engineering Project
  • How To Go From Data Analyst To Data Engineer

Use this guide by making a copy and tracking your progress through each section:

  • Date Completed: Track when you finish each item
  • Notes: Add your personal notes and insights
  • Personal Difficulty (1-5): Rate each item’s difficulty for future reference

Learn More: Seattle Data Guy’s YouTube Channel