Building Statistical Models Using StatsModels
It is important to use statistical tools well to explore and analyze your data before using it in ML models. This course teaches how basic statistical models work and how you can use the StatsModel Python package for estimation and exploration.
What you'll learn
It is becoming ever-easier to build and use machine learning models, but it still is a challenge to use those models intelligently, and without committing any egregious modeling errors. In this course, Building Statistical Models Using StatsModels, you will learn to intuitively understand how to approach statistical techniques and apply them without getting bogged down in arcane mathematics. First, you will delve into tests of statistical significance by using the T-test to see whether the differences in two samples of a population are different and how to tell if the differences are statistically significant. Next, you will explore how to use the Analysis of Variance (ANOVA) techniques to compare several different population samples to see whether they differ on the basis of single or multiple factors. Then, you will discover a number of different regression models, such as generalized (or weighted) least squares regression, which are typically used with heteroscedastic data and robust linear models to cope with outliers. Finally, you will learn specialized statistical models that work with time-series data, including autoregressive and moving average models, and the ARMA family which combines both of these. By the end of this course, you will have developed an intuitive understanding of statistics and will be able to apply that intuition to your own specific use case using the StatsModel Python library.
Table of contents
- Version Check 0m
- Module Overview 2m
- Prerequisites and Course Outline 2m
- Mean, Standard Deviation, and the Normal Distribution 5m
- Hypothesis Testing 7m
- The T-test 6m
- Demo: Implementing the T-test 8m
- Introducing ANOVA 6m
- Introducing Linear Regression 3m
- Two-way ANOVA 3m
- Demo: Implementing ANOVA on Linear Models 6m
- Demo: Implementing Two-way ANOVA 4m
- Skewness and Kurtosis 4m
- Demo: Analyzing Data for Skewness and Kurtosis 5m
- Module Overview 1m
- Regression Assumptions 4m
- Working with Heteroscedastic Data 7m
- Demo: Performing Weighted Least Squares Regression 7m
- Generalized Linear Models 5m
- Demo: Applying a Generalized Linear Model on Binomial Data 6m
- Robust Linear Regression 1m
- Demo: Applying Robust Linear Models to Data Containing Outliers 4m