- Lab
- A Cloud Guru
Implement a Partition Strategy for Azure Data Factory Pipelines
In this hands-on lab, you will learn how to provision an Azure Data Factory workspace, create pipeline activities to migrate data from an Azure SQL Database into a data lake, and create partitions as you migrate data. The skills learned in this lab are very frequently used by data engineers and data architects. This lab will also touch on critical skills found on the DP-203 exam.
Path Info
Table of Contents
-
Challenge
Prepare the Environment
-
Create a Blob Storage account with a Customer Container.
-
Create an Azure Data Factory instance in the West US region.
-
Create an SQL Database instance and server in the West US region. It should utilize DTU-based basic tier compute. Make sure to choose sample data as the data source.
-
-
Challenge
Create the Data Sources
- Create a dataset that connects to the newly created SQL Database.
- Connect to the
SalesLT.Customer
table.
-
Challenge
Create a Data Flow
- Create a data flow with:
- Source linked to
SalesLT.Customer
table. - Sink linked to the newly created data lake (Create the sink as a Parquet file).
- This will require the creation of a sample container in the newly created Azure Storage account.
- Source linked to
- Select and create five hash partitions based upon the most appropriate column.
- Publish all updates.
- Create a data flow with:
-
Challenge
Create a Data Pipeline
- Create a pipeline with your newly created Data Flow activity as the only activity.
- Validate and Publish this pipeline.
- Run the pipeline using the Trigger Now option.
- Check that the partitions were created successfully.
What's a lab?
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.
Provided environment for hands-on practice
We will provide the credentials and environment necessary for you to practice right within your browser.
Guided walkthrough
Follow along with the author’s guided walkthrough and build something new in your provided environment!
Did you know?
On average, you retain 75% more of your learning if you get time for practice.