Master RDDs, DataFrames, and Spark SQL. Build high-performance data pipelines and process large-scale datasets with ease