- Lab
- A Cloud Guru
Preprocess Data with the scikit-learn Python Package
In this lab, we will load a dataset from a SQLite database into a pandas DataFrame. Once loaded, we will standardize the dataset using the `StandardScaler()` method and write it to a new table within the SQLite database. Basic Python programming skills will be required for this lab. If you need a refresher, check out the following course: - [Certified Associate in Python Programming Certification](https://acloud.guru/overview/8169e8e7-91a7-4d92-b278-4dd08c787dc6)
Path Info
Table of Contents
-
Challenge
Load the Data
Load the data from the provided SQLite database (
data.db
) into a pandas DataFrame object. -
Challenge
Scale the Data
Use the
StandardScaler()
method of the scikit-learn preprocessing package to scale the data such that the distribution is now centered around 0, with a standard deviation of 1. -
Challenge
Save the Data
Write the scaled dataset to a new table named
data-scaled
in the SQLite database.
What's a lab?
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.
Provided environment for hands-on practice
We will provide the credentials and environment necessary for you to practice right within your browser.
Guided walkthrough
Follow along with the author’s guided walkthrough and build something new in your provided environment!
Did you know?
On average, you retain 75% more of your learning if you get time for practice.