Skip to content

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.
  • Labs icon Lab
  • A Cloud Guru
Google Cloud Platform icon
Labs

Perform Feature Engineering Using Amazon SageMaker

Imagine you are the data engineer, and you have been assigned the task of preprocessing the data and getting it ready for the machine learning engineers to create a highly predictable model. Your data contains both text and numerical data. The numerical data is of different ranges, and some text features require proper ordering. In this hands-on lab, you will learn how to encode, scale, and bin the data using scikit-learn.

Google Cloud Platform icon
Labs

Path Info

Level
Clock icon Intermediate
Duration
Clock icon 45m
Published
Clock icon Apr 25, 2024

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.

Table of Contents

  1. Challenge

    Launch SageMaker Notebook

    Log in to the AWS console and navigate to **AWS SageMaker **. From there, load the Jupyter Notebook that has been provided with this hands-on lab.

  2. Challenge

    Load Libraries and Prepare the Data

    1. Use the Pandas library and load the data from "Employee_encoding.csv".
    2. Display the top few rows and ensure the data is read successfully.
  3. Challenge

    Apply Encoding Techniques

    1. Use **OrdinalEncoder **and encode the title feature.
    2. Check the categories and ensure the encoder's categories follow the required ordering.
    3. Use OneHotEncoder and encode the gender feature.
    4. Use Labelencoder and encode the department feature.
  4. Challenge

    Apply Scaling Techniques

    1. Use MinMaxScaler and scale the salary feature to values between 0 and 1.
    2. Use the scaler's describe function and validate the values.
  5. Challenge

    Apply Binning Techniques

    1. Initialize KBinsDiscretizer and apply the equal-frequency strategy to the age feature.
    2. Use matplotlib and plot the binned data.

The Cloud Content team comprises subject matter experts hyper focused on services offered by the leading cloud vendors (AWS, GCP, and Azure), as well as cloud-related technologies such as Linux and DevOps. The team is thrilled to share their knowledge to help you build modern tech solutions from the ground up, secure and optimize your environments, and so much more!

What's a lab?

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Provided environment for hands-on practice

We will provide the credentials and environment necessary for you to practice right within your browser.

Guided walkthrough

Follow along with the author’s guided walkthrough and build something new in your provided environment!

Did you know?

On average, you retain 75% more of your learning if you get time for practice.

Start learning by doing today

View Plans