Simple play icon Course
Skills Expanded

Machine Learning with Apache Spark

by Avdhesh Gaur

Learn the essentials of machine learning using Apache Spark. This course will teach you about the basic components of Spark ML Pipelines, how to extract, transform and select features, and build and train classification and regression models.

What you'll learn

Apache Spark simplifies the process with its robust ML Pipelines. However, understanding its components and effectively building and evaluating ML models can be challenging.

In this course, Machine Learning with Apache Spark, you’ll explore the basic components of a Spark ML Pipeline, enabling you to set up end-to-end workflows seamlessly.

First, you’ll discover how to extract, transform, and select features to prepare your data for machine learning tasks.

Then, you’ll learn how to build and train both classification and regression models, becoming familiar with the foundational techniques needed for predictive analytics.

When you’re finished with this course, you’ll have the skills and knowledge of machine learning in Apache Spark to tackle real-world data challenges and build effective ML solutions.

About the author

Avdhesh Gaur is a Senior Data Scientist, an instructor, an author, a storyteller and consultant as well when it comes to bringing insights out of data and presenting the story behind. His career spans more than 10 years with a focus on taking down various business requirements by building Data-driven BI models using Tableau, SQL, Hive, and numerous Statistical data analysis Techniques. ​ Avdhesh Gaur got one of his submissions to Tableau accepted on 7th January 2019. On average, only 1 out of ev... more

Ready to upskill? Get started