- Lab
- A Cloud Guru
Create a Basic Pipeline in Azure Machine Learning Studio
In this hands-on lab, you work as a Machine Learning Operations Engineer for Avendador. During the lab, you will create and execute a machine learning pipeline using sample Microsoft data sources (**Restaurant Ratings** and **Restaurant Feature Data**).
Path Info
Table of Contents
-
Challenge
Preparing the Environment
- Log into the Azure portal using the credentials provided on the lab page. Be sure to use an incognito or private browser window to ensure you're using the lab account rather than your own.
- In the search bar at the top, enter and select Machine Learning studio, and then launch the studio, which will open in a separate tab.
- Create a
Standard_D2_V2
compute instance and start it. - Create a "classic" Azure Machine Learning pipeline in Azure Machine Learning studio by selecting "Designer" from the left menu. Choose the "Classic Prebuilt" tab and then click the plus-sign to "Create a new pipeline using classic, pre-built components."
-
Challenge
Ingest Data and Select Key Columns
- Add the sample datasets, Restaurant Ratings and Restaurant Feature Data, to the pipeline canvas.
- Select
placeID
andrating
from the Restaurant Ratings data source. - Select
placeID
,alcohol
,dress_code
,price
, andRambience
from the Restaurant Feature Data source.
-
Challenge
Transform Data Sources
- Join the data sources using
placeID
as key. - Replace missing data in columns (
placeID
,rating
,alcohol
,dress_code
,price
,Rambience
) with 0.
- Join the data sources using
-
Challenge
Split Data into Training and Test Data
- Split data using a 60/40 split.
- 60% should go to a filter using Pearson correlation
- 40% should be used as test
- Create a Pearson correlation feature selection using
rating
as a target column (select columns to transform and apply transformation). - Create a Boosted Decision Tree Regression with the following settings:
- Create trainer mode: SingleParameter
- Maximum number of leaves per tree: 20
- Minimum number of leaves per tree: 10
- Learning rate: 0.2
- Total number of trees constructed: 100
- Create Train Model using
rating
as label column.
- Split data using a 60/40 split.
-
Challenge
Score and Evaluate
- Create Score Model activity.
- Create Evaluate Model activity.
- Submit Model.
- Evaluate Results.
What's a lab?
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.
Provided environment for hands-on practice
We will provide the credentials and environment necessary for you to practice right within your browser.
Guided walkthrough
Follow along with the author’s guided walkthrough and build something new in your provided environment!
Did you know?
On average, you retain 75% more of your learning if you get time for practice.