Getting Started with HBase: The Hadoop Database
As the data you store expands in size, traditional relational databases may no longer work. HBase has the ability to deal with billions of rows of data and each record can contains millions of fields. This course will help you get started with HBase.
What you'll learn
Billions of records with millions of fields of semi-structured, unformatted data is the reality of the kind of data we are storing today. Traditional databases are bound by strict data layout requirements and constraints that, unfortunately, do not scale to meet big data requirements. HBase reimagines how data can be stored in a distributed system. This course, Getting Started with HBase: The Hadoop Database, teaches you how to use HBase from the start to finish. First, you'll learn how to design and layout data in a columnar format in order to optimize disk seeks and reduce read latency. Next, you'll learn how to manipulate and access this data using the command line HBase shell as well as the HBase Java API. Finally, you'll learn to process this data by performing complex aggregation and grouping operations using the MapReduce programming model with HBase. By the end of this course, you'll be ready to start making your data much more manageable using HBase.
Table of contents
- MapReduce for Complex Operations on Hbase Tables 2m
- The Map and Reduce Phases 4m
- MapReduce on an Hbase Table 4m
- Map and Reduce Classes in Hadoop 3m
- Summarizing Census Data Using MapReduce 4m
- Specifying Dependencies on Hadoop and HBase Libraries 2m
- The Mapper Class 5m
- The Reducer Class 4m
- The Main Class 5m
- Submitting the MapReduce Job to Hadoop 7m