Skip to content

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.
  • Labs icon Lab
  • A Cloud Guru
Google Cloud Platform icon
Labs

Training Reports Utilized in SageMaker Debugger to Improve Your Models

Imagine you are the data engineer at your company, and your company has just selected AWS as the preferred cloud provider. You have been given a dataset to predict if an individual makes more than $50K in salary. As part of the modeling process, you have been asked to generate a summary of the model training evaluation results, insights into the model performance, and interactive graphs. In this lab, you will fetch the census data and use that as the input dataset. Once the data is split, the data is uploaded to the S3 bucket. Then, the Sagemaker estimator is configured with the debugger hook and Sagemaker built-in rules to generate performance metric reports.

Google Cloud Platform icon
Labs

Path Info

Level
Clock icon Intermediate
Duration
Clock icon 1h 0m
Published
Clock icon Apr 12, 2024

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.

Table of Contents

  1. Challenge

    Launch SageMaker Notebook

    1. Log in to the AWS console and navigate to AWS SageMaker.
    2. Load the Jupyter Notebook that has been provided with this hands-on lab.
  2. Challenge

    Install dependencies and import the libraries

    1. Load required libraries.
    2. Fetch the census dataset from shap library.
  3. Challenge

    Download the data and upload them to S3 bucket

    1. Convert the downloaded data to a CSV format
    2. Use the upload_file function and upload the CSV files to S3 bucket
  4. Challenge

    Set up training and validation data

    1. Create the inputs for the fit() function with the training and validation location.
  5. Challenge

    Configure and run the estimator

    1. Fetch the xgboost image according to the region.
    2. Setup the estimator function with the debugger hook and debugger built-in rules.
    3. Initiate the training process.
  6. Challenge

    View the generated reports

    1. Fetch the training and profile reports from S3 bucket.
    2. Review the graphs generated by the training report.
    3. Understand the resource utilization statistics generated by the profiling report.

The Cloud Content team comprises subject matter experts hyper focused on services offered by the leading cloud vendors (AWS, GCP, and Azure), as well as cloud-related technologies such as Linux and DevOps. The team is thrilled to share their knowledge to help you build modern tech solutions from the ground up, secure and optimize your environments, and so much more!

What's a lab?

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Provided environment for hands-on practice

We will provide the credentials and environment necessary for you to practice right within your browser.

Guided walkthrough

Follow along with the author’s guided walkthrough and build something new in your provided environment!

Did you know?

On average, you retain 75% more of your learning if you get time for practice.

Start learning by doing today

View Plans