Preparing a Production Hadoop Cluster with Cloudera: Databases
Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, they're still an important part of a Hadoop cluster. Learn how to setup databases for Cloudera CDH and install a production grade cluster.
What you'll learn
Big Data is a natural evolution of data analysis, scaling beyond the limits of conventional databases. However, this does not mean that databases are dead. On the contrary, they are still an important part of a Hadoop cluster and used to store all kinds of information by multiple services. In this course, Preparing a Production Hadoop Cluster with Cloudera: Databases, you'll learn how to setup databases for Cloudera CDH and install a production grade cluster using Cloudera's Installation Path B. First, you'll discover how to select, initialize, and install a supported database. Next, you'll explore how to configure a database with Cloudera's recommended settings, and how to create databases with CDH services. Finally, you'll learn how to complete a CDH deployment. By the end of this course, you'll be able to deploy a production grade cluster.
Table of contents
- Preparing a Production Hadoop Cluster with Cloudera: Databases 3m
- Roadmap to Understanding Databases in Hadoop with Cloudera 2m
- A Story of Databases in Big Data 2m
- Checking Cloudera's Documentation: DBs and OS for CDH 3m
- Course Objective: Path B Install for a Production Hadoop Cluster 1m
- Takeaway 1m
- Setting up a Production Database for Your Hadoop Cluster 1m
- Selecting the Right Database 3m
- What's the Difference Between MySQL and MariaDB? 3m
- Comparing Databases: Oracle, MariaDB, MySQL, PostgreSQL & SQLite 4m
- Our Pick for an External Database: MySQL 1m
- Understanding Repositories: Yum Package Manager and RPM 3m
- Demo: Downloading, Installing, and Starting MySQL on Linux (RHEL) 2m
- Demo: Using AWS Relational Database Services (RDS) 5m
- Demo: Picking an AWS MySQL AMI 2m
- Takeaway 1m
- Configuring Your Database and Deploying Cloudera Manager (Path B) 1m
- Configuring Database Options & Variables for Deploying CDH 3m
- Potential MySQL Misconfiguration: InnoDB vs. MyISAM Engine 1m
- Demo: Setting my.cnf and mysql_secure_installation in MySql 3m
- Demo: Installing JDBC Required Drivers 2m
- Takeaway 1m
- Preparing Your Database and Deploying CDH 1m
- A Few Prerequisites to Install Your Cluster Demo 2m
- Installing JDK, Cloudera Manager Server, and Daemons Demo 4m
- Avoiding a Common Cloudera Manager Installation Mistake 0m
- Preparing Cloudera Manager to Use an External Database 2m
- Demo: Script for a Production Cluster -> scm_prepare_database.sh 3m
- Demo: Creating Databases for CDH Services 2m
- Completing CDH Path B Installation for a Production Cluster 1m
- License, CDH Edition, Hosts, Versions, Credentials, & Agents Demo 4m
- Selecting CDH Services, Role Assignments, Databases, and First Run 4m
- Databases in Hadoop Featuring HUE's Db Query App Demo 3m
- Takeaway 1m
- Preparing Your Database for High Availability 1m
- Understanding High Availability: An Overview 2m
- Demo: Configuring Database Replication in MySQL for HA 2m
- Configuring the Replication Master Demo 2m
- Configuring the Replication Slave Demo 2m
- Restarting Services and Confirming Replication 2m
- The Importance of Backing up Databases 2m
- Takeaway 1m