Splitting and Combining Data with R
If you’ve struggled to categorize dates, clean strings, or order bars in ggplot, this course is for you. Learn the basics of splitting and combining data, variable cleaning and creation, grouping and summarizing data, and creating visualizations.
What you'll learn
Summarizing statistics across groups is invaluable for comparing categories of observations. In this course, Splitting and Combining Data with R, you'll explore splitting data into groups based on some criteria, applying functions or calculations to each group independently, and combining the results into a data structure. To begin, you’ll learn how to create custom categorical variables for grouping, and custom numeric variables to which you can apply functions. Next, with the criteria for grouping created, you will split the data, apply functions, and combine the data into a data structure. Finally, with the raw data transformed, you’ll discover how a grouped dataframe can then be ungrouped with summary statistics maintained, or keep the grouped dataframe intact with plotting functions for visualizing variation between groups. By the end of this course, you’ll have a better understanding of how to use R to build data pipelines with dplyr, manipulate strings and dates for feature engineering, and create customized ggplot charts. .
Table of contents
- Course Introduction 1m
- Variable Creation Basics 5m
- Demo: Variable Creation Basics 8m
- Summary: Variable Creation Basics 1m
- Variable Creation: Tools for Strings 4m
- Tools for Strings: Pattern Based Functions 5m
- Tools for Strings: Splitting and Consolidating 2m
- Demo: Variable Creation with Strings 6m
- Summary: Tools for Strings 2m
- Working with Dates and Times 4m
- Variable Creation: Time Based Categories 3m
- Demo: Variable Creation with Dates and Times 6m
- Summary: Variable Creation with Dates and Times 1m
- Intro to Grouping and Summarizing 2m
- Demo: Basic Grouping and Summarizing 6m
- Sorting a Grouped Dataset 4m
- Demo: Utilizing Position Based Functions 4m
- More Functions for Aggregation 2m
- Demo: Repeated Grouping 5m
- Joining Datasets 1m
- Demo: Joining Grouped and Ungrouped Datasets 5m
- Summary: Grouping and Summarizing 2m