Course

Skills

Experimental Design for Data Analysis

by Janani Ravi

This course covers conceptual and practical aspects of building and evaluating machine learning models in a way that uses data judiciously, while also accounting for considerations such as ordering and relationships within data and other biases.

Preview this course

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(42)

Level

Intermediate

Updated

Sep 21, 2020

Duration

2h 45m

What you'll learn

Providing crisp, clear, actionable points-of-view to senior executives is becoming an increasingly important role of data scientists and data professionals these days. Now, a point-of-view must represent a hypothesis, ideally backed by data. In this course, Experimental Design for Data Analysis, you will gain the ability to construct such hypotheses from data and use rigorous frameworks to test whether they hold true. First, you will learn how inferential statistics and hypothesis testing form the basis of data modeling and machine learning. Next, you will discover how the process of building machine learning models is akin to that of designing an experiment and how training and validation techniques help rigorously evaluate the results of such experiments. Then, you will round out the course by studying various forms of cross-validation, including both singular and iterative techniques to cope with independent, identically distributed data and grouped data. Finally, you will also learn how you can refine your models using these techniques with hyperparameter tuning. When you’re finished with this course, you will have the skills and knowledge to build and evaluate models, specifically including machine learning models, using rigorous cross-validation frameworks and hyperparameter tuning.

Course Overview

1min

Course Overview 2m

Designing an Experiment for Data Analysis

27mins

Building and Training a Machine Learning Model

40mins

Module Overview 2m
Getting Started with Azure ML Studio 5m
Loading and Visualizing Data 5m
Exploring Relationships in Data 5m
Preprocessing and Preparing Data 6m
Building and Training a Regression Model for Price Prediction 8m
Building and Training a Regression Model in Python 9m
Summary 1m

Understanding and Overcoming Common Problems in Data Modeling

33mins

Module Overview 1m
Overfitting and Techniques to Mitigate Overfitting 7m
Accuracy, Precision, and Recall 5m
The ROC Curve 4m
Preparing and Processing Data 7m
Building Training and Evaluating a Classification Model 8m
Summary 2m

Leveraging Different Validation Strategies in Data Modeling

40mins

Module Overview 1m
Cross-validation in the ML Workflow 2m
Singular Cross-validation 4m
Cross-validation Using Azure ML Studio 6m
K-fold Cross-validation and Variants 6m
K-fold Cross-validation in scikit-learn 7m
Repeated K-fold Cross-validation in scikit-learn 4m
Stratified K-fold Cross-validation in scikit-learn 5m
Group K-fold in scikit-learn 4m
Summary 1m

Tuning Hyperparameters Using Cross Validation Scores

20mins

Module Overview 2m
Hyperparameter Tuning 2m
Decision Trees 3m
Hyperparameter Tuning a Decision Forest Classifier 7m
Tuning and Scoring Multiple Models 5m
Summary and Further Study 1m

About the author

Janani Ravi

Janani has a Masters degree from Stanford and worked for 7+ years at Google. She was one of the original engineers on Google Docs and holds 4 patents for its real-time collaborative editing framework. After spending years working in tech in the Bay Area, New York, and Singapore at companies such as Microsoft, Google, and Flipkart, Janani finally decided to combine her love for technology with her passion for teaching. She is now the co-founder of Loonycorn, a content studio focused on providing ... more

See more courses by Janani Ravi

Try for free

Get this course plus top-rated picks in tech skills and other popular topics.

$29.00

per month after 10 day trial

Your 10 day Standard free trial includes

Expert-led courses

Keep up with the pace of change with thousands of expert-led, in-depth courses.

For teams

Give up to 50 users access to our full library including this course free for 30 days

Course info

Rating

(42)

Level

Intermediate

Updated

Sep 21, 2020

Duration

2h 45m

Ready to upskill? Get started

Contact Sales

Experimental Design for Data Analysis

What you'll learn

Table of contents

About the author

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill up
your entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Contact Sales

Experimental Design for Data Analysis

What you'll learn

Table of contents

About the author

Get access now

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Ready to skill upyour entire team?

With your Pluralsight plan, you can:

With your 30-day pilot, you can:

Support

Community

Company

Industries

Newsletter

Ready to skill up
your entire team?

Ready to skill up
your entire team?