Handling Fast Data with Apache Spark SQL and Streaming

Apache Spark is a leader in enabling quick and efficient data processing. This course will teach you how to use Spark's SQL, Streaming, and even the newer Structured Streaming APIs to create applications able to handle data as it arrives.

by Justin Pihony

Get started Preview course

What you'll learn

Analyzing data used to be something you did once a night. Now you need to be able to process data on the fly so you can provide up to the minute insights. But, how do you accomplish in real time what used to take hours without a complicated code base? In this course, Handling Fast Data with Apache Spark SQL and Streaming, you'll learn to use Apache Spark Streaming and SQL libraries as a great way to handle this new world of real time, fast data processing. First, you'll dive into SparkSQL. Next, you'll explore how to catch potential fraud by analyzing streams with Spark Streaming. Finally, you'll discover the newer Structured Streaming API. By the end of this course, you'll have a deeper understanding of these APIs, along with a number of streaming concepts that have driven the API design.

Try this course for free

Access this course and other top-rated tech content with a free trial.

Free individual trial Free team trial

Have questions?

Get them answered now.

Start a live chat

Course Info

Rating

(39 reviews)

Level

Intermediate

Last updated

Feb 28, 2025

Duration

4h 34m 37s

Course Overview | 2m 4s

About the author

Justin Pihony

Justin is a software journeyman, continuously learning and honing his skills.

More Courses by Justin

Handling Fast Data with Apache Spark SQL and Streaming

What you'll learn

Table of contents

Course Overview 2m 4s

Introduction 21m 49s

Querying Data with the DataFrames (Part 1) 43m 10s

Querying Data with the DataFrames (Part 2) 41m 8s

Improving Type Safety with Datasets 41m 19s

Processing Data with the Streaming API 1h 7m 1s

Optimizing, Structured Streaming, and Spark 2.x 58m 3s

About the author