- Lab
- A Cloud Guru
Write Data to Amazon Kinesis Data Streams Using a Studio Notebook
In this lab you will step into the shoes of a data engineer at Pleasantville Taxis. You have been given a big data set to go explore via Apache Zeppelin to understand what data has been collected and to load the data from S3 into a Kinesis Data Stream.
Path Info
Table of Contents
-
Challenge
Upload Data to an S3 bucket
Upload the
taxi-trips.csv
from the lab GitHub repo into the S3 bucket provided. -
Challenge
Run the Kinesis Data Analytics Notebook
From the Kinesis console, run the KDA-studio-1-Kinesislab notebook provided and launch in Apache Zeppelin.
-
Challenge
Import and Execute the Notebook Using Apache Zeppelin
- Download the Zeppelin notebook
Taxi Trips Data Loading from S3 to Kinesis-1.zpln
from the GitHub Repo. - Import the notebook as a note in Apache Zeppelin.
- Execute the various commands to load data from S3 to the Kinesis Data Stream.
- Download the Zeppelin notebook
-
Challenge
Create a CloudFormation Stack
Create a CloudFormation stack using the CFN Build.yaml file provided in the GitHub repo. Name the stack
kinesislab
. -
Challenge
Create an Amazon Kinesis Data Stream
Create a Kinesis Data Stream named
input-stream
.
What's a lab?
Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.
Provided environment for hands-on practice
We will provide the credentials and environment necessary for you to practice right within your browser.
Guided walkthrough
Follow along with the author’s guided walkthrough and build something new in your provided environment!
Did you know?
On average, you retain 75% more of your learning if you get time for practice.