Handling and Analyzing Data with AWS Elastic MapReduce
In this course, you are going to learn how to utilize today's most popular big data tools and ML frameworks to process and analyze data within AWS pipelines.
What you'll learn
A lot of people hear about big data analyzation, but how can you use it for your use cases? In this course, Handling and Analyzing Data with AWS Elastic MapReduce, you’ll learn foundational knowledge and gain the ability to use AWS Elastic MapReduce to perform data analyzation. First, you’ll explore configuring AWS EMR and Hadoop. Next, you’ll discover how to process, move, and query data using big data frameworks. Finally, you’ll learn how to stream and analyze data using Apache products and MLlib. When you’re finished with this course, you’ll have the skills and knowledge of using AWS EMR needed to handle and analyze your own big data datasets.
Table of contents
- Module Overview 1m
- Apache Hive and EMR Intro 1m
- Connecting EMR to DynamoDB Overview 2m
- DynamoDB Demo Intro 1m
- Exporting DynamoDB Data - Part 1 5m
- Exporting DynamoDB Data - Part 2 8m
- Looking at EMR and Redshift 2m
- Demo: Redshift and EMR - Part 1 7m
- Demo: Redshift and EMR - Part 2 8m
- HBase and EMR Overview 2m
- Configure and Using HBase on EMR 5m
- Taking a Look at PrestoDB 2m
- Demo: Presto with EMR and S3 8m
- Module Summary 1m