Contact sales

Learning Paths

Skills

Apache Spark on Databricks

7 courses
14 hours
Skill IQ

Apache Spark on Databricks is a unified analytics platform that combines the powerful data processing capabilities of Apache Spark with the collaborative and managed environment of Databricks, enabling scalable and efficient big data processing, real-time analytics, and machine learning applications in a cloud-native architecture. This learning path is intended to give learners foundational skills to start working with Apache Spark on Databricks for these purposes.

Courses in this path

Beginner

You will learn Spark transformations, actions, visualizations, and functions leveraging the Databricks API. You will also learn how to transform and aggregate batch data using Spark with built-in and user defined functions, and perform windowing and join operations on batch data.

Getting Started with Apache Spark on Databricks

by Janani Ravi
1h 52m 38s
4.6 (68)

Handling Batch Data with Apache Spark on Databricks

by Janani Ravi
2h 22m 21s
4.8 (28)

Intermediate

You will learn how to use Spark abstractions for streaming data and perform transformations on streaming data using the Spark streaming APIs on Databricks as well as how to leverage windowing, watermarking and join operations on streaming data in Spark for your specific use-cases.

Processing Streaming Data with Apache Spark on Databricks

by Janani Ravi
2h 1m 25s
4.8 (26)

Windowing and Join Operations on Streaming Data with Apache Spark on Databricks

by Janani Ravi
2h 2m 36s
4.5 (12)

Advanced

You will understand and implement important techniques for predictive analytics such as regression and classification using Apache Spark MLlib APIs on Databricks as well as learn how to implement graph algorithms such as Triangle Count and PageRank and visualize them using the GraphFrames API on Spark Databricks. You will also learn how to optimize the performance of Spark clusters by identifying and mitigating various performance issues such as data ingestion problems and leveraging the new features offered by Spark 3.

Predictive Analytics Using Apache Spark MLlib on Databricks

by Janani Ravi
1h 57m 38s
4.8 (13)

Executing Graph Algorithms with GraphFrames on Databricks

by Janani Ravi
1h 34m 48s

Optimizing Apache Spark on Databricks

by Janani Ravi
2h 32s
4.9 (27)

Try this learning path for free

Access this learning path and other top-rated tech content with a free trial.

Free individual trial Free team trial

Have questions?

Get them answered now.

Start a live chat

In Apache Spark on Databricks you will learn the in's and out's of of Apache Spark via Databricks. You will learn how to handle batch data, processing streaming data, windowing and joining operations, predictive analytics using MLib, executing graph algorithms and optimizing Apache Spark.

Experience

Intermediate programming experience in Python or Scala. Beginner experience with the DataFrame API.

Learn with the best

Janani Ravi

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

Join our learners and upskill
in leading technologies

Ready to skill up
your entire team?

Subscriptions

Continue to checkout Continue to checkout

Cancel

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Access thousands of videos to develop critical skills
Give up to 50 users access to thousands of video courses
Practice and apply skills with interactive courses and projects
See skills, usage, and trend data for your teams
Prepare for certifications with industry-leading practice exams
Measure proficiency across skills and roles
Align learning to your goals with paths and channels

Ready to skill up
your entire team?

Subscriptions

Continue to checkout

Cancel

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Access thousands of videos to develop critical skills
Give up to 50 users access to thousands of video courses
Practice and apply skills with interactive courses and projects
See skills, usage, and trend data for your teams
Prepare for certifications with industry-leading practice exams
Measure proficiency across skills and roles
Align learning to your goals with paths and channels

Apache Spark on Databricks

Courses in this path

Beginner

Getting Started with Apache Spark on Databricks

Handling Batch Data with Apache Spark on Databricks

Intermediate

Processing Streaming Data with Apache Spark on Databricks

Windowing and Join Operations on Streaming Data with Apache Spark on Databricks

Advanced

Predictive Analytics Using Apache Spark MLlib on Databricks

Executing Graph Algorithms with GraphFrames on Databricks

Optimizing Apache Spark on Databricks

Learn with the best

Join our learners and upskill in leading technologies

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Join our learners and upskill
in leading technologies

Ready to skill up
your entire team?

Ready to skill up
your entire team?