Preparing Data for Modeling with scikit-learn

This course covers important steps in the pre-processing of data, including standardization, normalization, novelty and outlier detection, pre-processing image and text data, as well as explicit kernel approximations such as the RBF and Nystroem methods.

by Janani Ravi

Get started Preview course

What you'll learn

Even as the number of machine learning frameworks and libraries increases on a daily basis, scikit-learn is retaining its popularity with ease. Scikit-learn makes the common use-cases in machine learning - clustering, classification, dimensionality reduction and regression - incredibly easy. In this course, Preparing Data for Modeling with scikit-learn, you will gain the ability to appropriately pre-process data, identify outliers and apply kernel approximations. First, you will learn how pre-processing techniques such as standardization and scaling help improve the efficacy of ML algorithms. Next, you will discover how novelty and outlier detection is implemented in scikit-learn. Then, you will understand the typical set of steps needed to work with both text and image data in scikit-learn. Finally, you will round out your knowledge by applying implicit and explicit kernel transformations to transform data into higher dimensions. When you’re finished with this course, you will have the skills and knowledge to identify the correct data pre-processing technique for your use-case and detect outliers using theoretically robust techniques.

Try this course for free

Access this course and other top-rated tech content with a free trial.

Free individual trial Free team trial

Have questions?

Get them answered now.

Start a live chat

Course Info

Rating

(17 reviews)

Level

Advanced

Last updated

Aug 12, 2019

Duration

3h 40m 50s

Course Overview | 1m 50s

About the author

Janani Ravi

A problem solver at heart, Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework.

More Courses by Janani

Preparing Data for Modeling with scikit-learn

What you'll learn

Table of contents

Course Overview 1m 50s

Preparing Numeric Data for Machine Learning 46m 11s

Understanding and Implementing Novelty and Outlier Detection 47m 41s

Preparing Text Data for Machine Learning 30m 12s

Preparing Image Data for Machine Learning 34m 48s

Working with Specialized Datasets 27m 13s

Performing Kernel Approximations 32m 51s

About the author