Getting Started with HDFS
Learning to work with Hadoop Distributed File System (HDFS) is a baseline skill for anyone administering or developing in the Hadoop ecosystem. In this course, you will learn how to work with HDFS, Hive, Pig, Sqoop and HBase from the command line.
What you'll learn
Getting Started with Hadoop Distributed File System (HDFS) is designed to give you everything you need to learn about how to use HDFS to read, store, and remove files. In addition to working with files in Hadoop, you will learn how to take data from relational databases and import it into HDFS using Sqoop. After we have our data inside HDFS, we will learn how to use Pig and Hive to query that data. Building on our HDFS skills, we will look at how use HBase for near real-time data processing. Whether you are a developer, administrator, or data analyst, the concepts in this course are essential to getting started with HDFS.
Table of contents
- Introduction 1m
- Actions in HDFS 2m
- Demo: Stock Data 5m
- HDFS Shell Interaction 3m
- Demo: Shell Interaction 2m
- HDFS Basic Commands 3m
- Demo: HDFS Basic Commands 7m
- Permissions in HDFS 4m
- Demo: Permission in HDFS 5m
- HDFS Moving Data Shell Commands 3m
- Demo: HDFS Moving Data Shell Commands 4m
- HDFS Maintenance Shell Commands 2m
- Demo: HDFS Maintenance Shell Commands 3m
- Summary 1m