Data Engineering for Beginners: Learn SQL, Python & Spark

Master SQL, Python, and Apache Spark (PySpark) with Hands-On Projects using Databricks on Google Cloud
4.45 (6558 reviews)
Udemy
platform
English
language
Other
category
Data Engineering for Beginners: Learn SQL, Python & Spark
94 081
students
56 hours
content
Mar 2025
last update
$109.99
regular price

What you will learn

Setup Environment to learn SQL and Python essentials for Data Engineering

Database Essentials for Data Engineering using Postgres such as creating tables, indexes, running SQL Queries, using important pre-defined functions, etc.

Data Engineering Programming Essentials using Python such as basic programming constructs, collections, Pandas, Database Programming, etc.

Data Engineering using Spark Dataframe APIs (PySpark) using Databricks. Learn all important Spark Data Frame APIs such as select, filter, groupBy, orderBy, etc.

Data Engineering using Spark SQL (PySpark and Spark SQL). Learn how to write high quality Spark SQL queries using SELECT, WHERE, GROUP BY, ORDER BY, ETC.

Relevance of Spark Metastore and integration of Dataframes and Spark SQL

Ability to build Data Engineering Pipelines using Spark leveraging Python as Programming Language

Use of different file formats such as Parquet, JSON, CSV etc in building Data Engineering Pipelines

Setup Hadoop and Spark Cluster on GCP using Dataproc

Understanding Complete Spark Application Development Life Cycle to build Spark Applications using Pyspark. Review the applications using Spark UI.

Loading charts...

Comidoc Review

Our Verdict

Data Engineering for Beginners: Learn SQL, Python & Spark provides a solid foundation in data engineering but may benefit from more practical exercises. Recommended for those willing to invest time into learning and seeking a comprehensive introduction to key technologies.

What We Liked

  • Comprehensive coverage of data engineering topics, including SQL, Python, Spark SQL, and PySpark.
  • Well-organized course with clear explanations and a variety of exercises and projects to consolidate concepts.
  • Instructor goes beyond teaching syntax and explains underlying concepts, making the course accessible to beginners.

Potential Drawbacks

  • Some may find the course overly long and tedious, lacking opportunities for practical application.
  • Lectures on basic Python and SQL tutorials may be unnecessary and eat up free cloud credits.
  • Presentation has a strong accent which might make understanding difficult for some non-native speakers.

Related Topics

3848374
udemy ID
14/02/2021
course created date
03/04/2021
course indexed date
Bot
course submited by