Architecting Big Data Solutions Using Google Bigtable
Google Bigtable is a sophisticated NoSQL offering on the Google Cloud Platform with extremely low latencies. By the end of this course, you'll understand why Bigtable is much more powerful offering than HBase, with linear scaling of your data.
What you'll learn
Bigtable is Google’s proprietary storage service that offers extremely fast read and write speeds. It uses a sophisticated internal architecture which learns access patterns and moves around your data to mitigate the issue of hot-spotting.
In this course, Architecting Big Data Solutions Using Google Bigtable, you’ll learn both the conceptual and practical aspects of working with Bigtable. You’ll learn how to best to design your schema to enable fast reads and write speeds and discover how data in Bigtable can be accessed using the command line as well as client libraries.
First, you’ll study the internal architecture of Bigtable and how data is stored within it using the 4-dimensional data model. You’ll also discover how Bigtable clusters, nodes, and instances work and how Bigtable works with Colossus - Google’s proprietary storage system behind the scenes.
Next, you’ll access Bigtable using both the HBase shell as well as cbt, Google’s command line utility. Later, you'll create and manage tables while practice exporting and importing data using sequence files.
Finally, you’ll study how manual fail-overs can be handled when we have single cluster routing enabled.
At the end of this course, you’ll be comfortable working with Bigtable using both the command line as well as client libraries.
Table of contents
- Module Overview 2m
- Prerequisites and Course Outline 3m
- Introducing Bigtable 7m
- Bigtable vs. Other GCP Services 6m
- Storage Model 7m
- Instances, Clusters, Nodes, and Tablets 3m
- Replication 6m
- Schema Design 6m
- Understanding Performance 3m
- Pricing 1m
- Enabling Bigtable APIs 2m
- Creating a Bigtable Instance Using the Web Console 4m
- Editing a Bigtable Instance 3m
- Creating a Bigtable Instance Using the Command Line 4m
- Module Overview 1m
- Connecting to Bigtable Using the HBase Shell 2m
- Creating Tables and Adding Rows 4m
- Using the cbt Tool to Work with Bigtable 5m
- Writing and Reading Records Using cbt 3m
- Using a Service Account to Authenticate to Bigtable 2m
- Exporting Bigtable Data as a Sequence File 5m
- Importing Data from a Sequence File 4m
- Creating Tables, Adding Rows to Bigtable Using Python 5m
- Reading Rows Using Python 4m
- Module Overview 1m
- Replication and Application Profiles for Routing 4m
- Specifying Application Profiles in Client Applications 3m
- Handling Manual Failover 4m
- Monitoring a Bigtable Instance Using Stackdriver 4m
- Visualizing Monitoring Metrics Using Matplotlib 3m
- Programmatically Scaling Clusters 4m
- Summary and Further Study 2m