Data Engineering for Beginners: Learn SQL, Python & Spark
Master SQL, Python, and Apache Spark (PySpark) with Hands-On Projects using Databricks on Google Cloud
4.45 (6558 reviews)

94 081
students
56 hours
content
Mar 2025
last update
$109.99
regular price
What you will learn
Setup Environment to learn SQL and Python essentials for Data Engineering
Database Essentials for Data Engineering using Postgres such as creating tables, indexes, running SQL Queries, using important pre-defined functions, etc.
Data Engineering Programming Essentials using Python such as basic programming constructs, collections, Pandas, Database Programming, etc.
Data Engineering using Spark Dataframe APIs (PySpark) using Databricks. Learn all important Spark Data Frame APIs such as select, filter, groupBy, orderBy, etc.
Data Engineering using Spark SQL (PySpark and Spark SQL). Learn how to write high quality Spark SQL queries using SELECT, WHERE, GROUP BY, ORDER BY, ETC.
Relevance of Spark Metastore and integration of Dataframes and Spark SQL
Ability to build Data Engineering Pipelines using Spark leveraging Python as Programming Language
Use of different file formats such as Parquet, JSON, CSV etc in building Data Engineering Pipelines
Setup Hadoop and Spark Cluster on GCP using Dataproc
Understanding Complete Spark Application Development Life Cycle to build Spark Applications using Pyspark. Review the applications using Spark UI.
Loading charts...
Comidoc Review
Our Verdict
Data Engineering for Beginners: Learn SQL, Python & Spark provides a solid foundation in data engineering but may benefit from more practical exercises. Recommended for those willing to invest time into learning and seeking a comprehensive introduction to key technologies.
What We Liked
- Comprehensive coverage of data engineering topics, including SQL, Python, Spark SQL, and PySpark.
- Well-organized course with clear explanations and a variety of exercises and projects to consolidate concepts.
- Instructor goes beyond teaching syntax and explains underlying concepts, making the course accessible to beginners.
Potential Drawbacks
- Some may find the course overly long and tedious, lacking opportunities for practical application.
- Lectures on basic Python and SQL tutorials may be unnecessary and eat up free cloud credits.
- Presentation has a strong accent which might make understanding difficult for some non-native speakers.
Related Topics
3848374
udemy ID
14/02/2021
course created date
03/04/2021
course indexed date
Bot
course submited by