Learning Path

Libraries: This path is only available in the libraries listed. To access this path, purchase a license for the corresponding library.

Cloud

Data

Data Engineering on Google Cloud

10 Courses

14 Hours

Skill IQ

This path provides participants a hands-on introduction to designing and building data processing systems on Google Cloud Platform. Through a combination of presentations, demos, and hand-on labs, participants will learn how to design data processing systems, build end-to-end data pipelines, analyze data and derive insights. The courses cover structured, unstructured, and streaming data.

Get started

Content in this path

Beginner

This section introduces participants to the Big Data and Machine Learning capabilities of Google Cloud Platform (GCP). It provides a quick overview of the Google Cloud Platform and a deeper dive of the data processing capabilities.

Course

Preparing for Your Professional Data Engineer Journey

by Google Cloud
48m
Dec 13, 2024

Intermediate

This section opens with the two key components of any data pipeline, which are data lakes and warehouses. The first course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on Google Cloud Platform in technical detail. Also, the course describes the role of a data engineer, the benefits of a successful data pipeline to business operations, and examines why data engineering should be done in a cloud environment. Data pipelines typically fall under one of the Extra-Load, Extract-Load-Transform or Extract-Transform-Load paradigms. Hence, the second course in this section describes which paradigm should be used and when for batch data. Furthermore, this course covers several technologies on Google Cloud Platform for data transformation including BigQuery, executing Spark on Cloud Dataproc, pipeline graphs in Cloud Data Fusion and serverless data processing with Cloud Dataflow.

Course

Modernizing Data Lakes and Data Warehouses with Google Cloud

by Google Cloud
2h 58m
Jan 17, 2025

Course

Building Batch Data Pipelines on Google Cloud

by Google Cloud
2h 15m
Sep 23, 2024

Advanced

This section covers two things: (ii) Processing streaming data, which is becoming increasingly popular as streaming enables businesses to get real-time metrics on business operations, and (ii) Incorporating machine learning into data pipelines increases the ability of businesses to extract insights from their data. The first course covers how to build streaming data pipelines on Google Cloud Platform. Cloud Pub/Sub is described for handling incoming streaming data. The course also covers how to apply aggregations and transformations to streaming data using Cloud Dataflow, and how to store processed records to BigQuery or Cloud Bigtable for analysis. The second course covers several ways machine learning can be included in data pipelines on Google Cloud Platform depending on the level of customization required. For little to no customization, this course covers AutoML. For more tailored machine learning capabilities, this course introduces AI Platform Notebooks and BigQuery Machine Learning. Also, this course covers how to productionalize machine learning solutions using Kubeflow.

Course

Building Resilient Streaming Analytics Systems on Google Cloud

by Google Cloud
1h 45m
Jan 28, 2025

Course

Serverless Data Processing with Dataflow: Foundations

by Google Cloud
42m
Jan 13, 2025

Course

Serverless Data Processing with Dataflow: Develop Pipelines

by Google Cloud
1h 54m
Feb 27, 2025

Course

Serverless Data Processing with Dataflow: Operations

by Google Cloud
1h 51m
Jan 22, 2025

Course

Smart Analytics, Machine Learning, and AI on Google Cloud

by Google Cloud
1h 11m
Jan 23, 2025

Course

Boost Productivity with Gemini in BigQuery

by Google Cloud
25m
Oct 14, 2024

Course

Work with Gemini Models in BigQuery

by Google Cloud
29m
Aug 01, 2024

Access this learning path and other top-rated tech content with a free trial.

Free individual trial Free team trial

Have questions? Get them answered now.

Start a live chat

This path teaches the following skills
Design and build data processing systems on Google Cloud Platform
Lift and shift your existing Hadoop workloads to the Cloud using Cloud Dataproc.
Process batch and streaming data by implementing autoscaling data pipelines on Cloud Dataflow
Manage your data Pipelines with Data Fusion and Cloud Composer.
Derive business insights from extremely large datasets using Google BigQuery
Learn how to use pre-built ML APIs on unstructured data and build different kinds of ML models using BigQuery ML.
Enable instant insights from streaming data

Prerequisites

Participants should have experience with one or more of the following:
• A common query language such as SQL
• Extracting, Loading, Transforming, cleaning, and validating data
• Designing pipelines and architectures for data processing
• Integrating analytics and machine learning capabilities into data pipelines
• Querying datasets, visualizing query results and creating reports

BigQuery
Dataflow
Dataproc
ML APIs
Data Fusion
Bigtable

Not sure where to start?

With over 500 assessments to choose from, you can see where your skills stand and receive adaptive learning recommendations to fill knowledge gaps in as little as 10 minutes.

Learn more

Learn with the best

Google Cloud

Google Cloud can help solve your toughest problems and grow your business. With Google Cloud, their infrastructure is your infrastructure. Their tools are your tools. And their innovations are your innovations.

Data Engineering on Google Cloud

Content in this path

Beginner

Preparing for Your Professional Data Engineer Journey

Intermediate

Modernizing Data Lakes and Data Warehouses with Google Cloud

Building Batch Data Pipelines on Google Cloud

Advanced

Building Resilient Streaming Analytics Systems on Google Cloud

Serverless Data Processing with Dataflow: Foundations

Serverless Data Processing with Dataflow: Develop Pipelines

Serverless Data Processing with Dataflow: Operations

Smart Analytics, Machine Learning, and AI on Google Cloud

Boost Productivity with Gemini in BigQuery

Work with Gemini Models in BigQuery

Learn with the best

Join our learners and upskill in leading technologies

Join our learners and upskill
in leading technologies