Real-time Data Processing for Data Engineers

In today's world, valuable data can often be produced in great quantity and at great speed. This course will teach you to process data in real time and keep up with high-volume data throughput.

by Daniel Stern

Get started Preview course

What you'll learn

Data is often the most valuable asset an organization has, but how do you process data in real time, at scale, when data can be coming in at huge volumes from disparate sources in varying formats?

In this course, Real-time Data Processing for Data Engineers, you’ll gain the ability to build and contribute to professional data pipelines capable of processing big data at scale.

First, you’ll explore the fundamentals of data processing at scale, including what data pipelines are, how they are implemented, and how they vary from batch data processing.

Next, you’ll discover key data pipeline tooling such as Apache Kafka and Spark, and take a look at their strengths and where they fit into data pipelines.

Finally, you’ll learn how to implement data pipelines, first by reviewing an example scenario, and then via a coding example where you will implement a working data pipeline in minutes.

When you’re finished with this course, you’ll have the skills and knowledge of real-time data processing needed to effectively process real-time data at scale.

Course Overview | 1m 26s

About the author

Daniel Stern

Daniel Stern is a freelance web developer from Toronto, Ontario who specializes in Angular, ES6, TypeScript and React. His work has been featured in CSS Weekly, JavaScript Weekly and at Full Stack Conf in England.

More Courses by Daniel

Real-time Data Processing for Data Engineers

What you'll learn

Table of contents

Course Overview 1m 26s

Understanding and Engineering Data Pipelines 18m 37s

Implementing Data Pipelines 14m 50s

About the author