Data Wrangling with Azure
In this lab, you’ll practice how to cleanse your data with Azure Data Factory and Azure DataFlow scripts. When you’re finished with this lab, you’ll be able to master Data Wrangling in Azure.
Terms and conditions apply.
Lab info
Lab author
Challenge
Create an Azure Data Factory Pipeline for Data Wrangling
You will create an Azure Data Factory Pipeline for Data Cleansing and configure a Debug session.
Challenge
Configure an Azure Blob as Pipeline Source
You will configure a file in an Azure Blob Storage container as source for the Pipeline.
Challenge
Delete Duplicate Rows in Data flow
You'll remove duplicate rows in Data flow.
Challenge
Split Pipeline Data Based on Null Conditions
You will finish the cleanse Azure Data Pipeline by checking NULL values to load into Azure CosmosDB clean data.
Challenge
Run the Azure Data Factory Pipeline and Validate in CosmosDB
You will trigger the cleanse Pipeline and verify in CosmosDB the movies were inserted correctly.
Provided environment for hands-on practice
We will provide the credentials and environment necessary for you to practice right within your browser.
Guided walkthrough
Follow along with the author’s guided walkthrough and build something new in your provided environment!
Did you know?
On average, you retain 75% more of your learning if you get time for practice.
Recommended prerequisites
- Azure Blob Storage
- Azure Data Factory
- Azure DataFlow (optional)