Managing Cloud Resources Using Google Stackdriver
This course covers all important aspects of Stackdriver monitoring, which works across all Google Cloud Platform resources, making it convenient to set up uptime checks, profiling, and integration with other cloud platforms, and monitoring tools.
What you'll learn
Stackdriver Monitoring is a powerful and versatile cloud monitoring tool that is tightly integrated with virtually every service on the Google Cloud Platform. You can significantly improve the performance and design of your architecture and simplify troubleshooting if you master the nuances of Stackdriver Monitoring. In this course, Managing Cloud Resources Using Google Stackdriver, you will gain the ability to monitor your cloud resources track both system and user-defined metrics and respond to alerts using Stackdriver Monitoring. First, you will learn Stackdriver concepts such as metrics, monitored resources, workspaces, and alerting policies. In this process, we will learn how to install the Stackdriver monitoring agent, and also when that agent is and is not required. Next, you will discover how to monitor third-party applications and work with custom metrics. We will create resources to monitor as well as metrics associated with those resources, then use the Metrics Explorer to create dashboards to keep track of those metrics. You will also configure uptime checks and alerts to notify you when resource health is not satisfactory. Stackdriver supports uptime checks in HTTP, HTTPS, and TCP. The probes sent by these checks are governed by VPC firewall rules, so those must be set up correctly as well. Finally, you will explore how to create checks for the absence of metrics, set variables in alerts, and explore incidents and events and integrate with third-party tools. Specifically, you will integrate Stackdriver Monitoring with OpsGenie, which is an alerting and incident management platform. You will round out the course by programmatically working with the Stackdriver Monitoring API from within Datalab python notebooks. When you’re finished with this course, you will have the skills and knowledge of Stackdriver Monitoring needed to monitor, troubleshoot, and analyze the usage of your cloud resources.
Table of contents
- Module Overview 1m
- Prerequisites and Course Outline 3m
- Introducing the Stackdriver Suite of Tools 4m
- Stackdriver Monitoring 6m
- Pricing 1m
- Creating Resources to Monitor 6m
- Using the Metrics Explorer to Create Dashboards 8m
- Multiple Projects in the Same Workspace and Sharing Workspaces 4m
- Installing the Stackdriver Monitoring Agent 2m
- Uptime Checks and Alerts 7m
- Module Overview 1m
- Third-party Applications: Monitoring the Apache Webserver 5m
- Third-party Applications: Installing Elasticsearch 3m
- Third-party Applications: Monitoring Elasticsearch 2m
- Custom Metrics in Python Using Cloud Monitoring APIs 4m
- Writing Custom Metrics to Stackdriver 3m
- Creating and Monitoring Resource Groups 4m
- Uptime Checks and Alerts on Resource Groups 3m
- Configuring Alerts for Metric Absence 2m
- Documentation Variables in Alerts 2m
- Exploring Incidents and Events 4m
- Integrating Stackdriver with OpsGenie 6m