Maintaining, Monitoring and Troubleshooting Kubernetes
This course will teach you to maintain, monitor and troubleshoot production Kubernetes clusters.
What you'll learn
Troubleshooting production in Kubernetes clusters can be difficult. In this course, Maintaining, Monitoring and Troubleshooting Kubernetes, you’ll learn to maintain, monitor, and troubleshoot Kubernetes Clusters. First, you’ll learn core cluster maintenance tasks such as etcd backup and restore operations, upgrading an existing Kubernetes Cluster, and facilitating for Worker Node maintenance and high availability cluster topologies. Next, you’ll discover how to quickly find and analyze performance monitoring and logging data in your cluster. Finally, you’ll explore core tools and techniques for troubleshooting your cluster. When you’re finished with this course, you’ll have the skills and knowledge of maintaining, monitoring, and troubleshooting needed to operate and manage Kubernetes Clusters. This course can also help you prepare for your Certified Kubernetes Administrator (CKA) certification.
Table of contents
- Introduction, Course and Module Overview 2m
- Introducing etcd and etcd Backup Concepts 3m
- Getting etcdctl and Backing up etcd with etcdctl 3m
- Restoring etcd with etcdctl 5m
- Demo: Investigating etcd Configuration 6m
- Demo: Backing up etcd with etcdctl 3m
- Demo: Restoring etcd to the Same Data Directory 3m
- Demo: Restoring etcd to a New Data Directory 3m
- Cluster Upgrade Process - Overview 2m
- Cluster Upgrade Process - Control Plane 3m
- Cluster Upgrade Process - Worker Nodes 2m
- Demo: Upgrading a Control Plane Node 5m
- Demo: Upgrading a Worker Node 2m
- Worker Node Maintenance 2m
- High Availability Cluster Overview and High Availability Cluster Topology - Stacked etcd 4m
- High Availability Cluster Topology - External etcd 2m
- Module Review and What's Next! 0m
- Introduction, Course and Module Overview 1m
- Kubernetes Logging Architecture and Logging in Pods and Containers 5m
- Logging Architecture - Nodes 1m
- Logging Architecture - Control Plane 1m
- Understanding and Accessing Cluster Events 1m
- Demo: Kubernetes Logging Architecture - Pods 7m
- Demo: Kubernetes Logging Architecture - Nodes 3m
- Demo: Kubernetes Logging Architecture - Control Plane Pods 3m
- Demo: Working with Kubernetes Events 5m
- Accessing Object Data with JSONPath 5m
- Filtering Objects with JSONPath 2m
- Demo: Accessing Object Data with JSONPath 5m
- Demo: Filtering and Sorting Object Data with JSONPath 4m
- Monitoring in Kubernetes and the Kubernetes Metrics Server 2m
- Demo: Deploying the Kubernetes Metrics Server 3m
- Demo: Using kubectl top to Analyze Resource Consumption on Pods and Nodes 4m
- Module Review and What's Next! 1m
- Introduction, Course and Module Overview 1m
- Troubleshooting Tools and Troubleshooting Nodes 3m
- Managing the kubelet with systemd 1m
- Demo: Troubleshooting Nodes - Stopped kubelet 3m
- Demo: Troubleshooting Nodes - Inaccessible kubelet config.yaml 2m
- Demo: Troubleshooting Nodes - Misconfigured systemd Unit File 3m
- Troubleshooting - Control Plane 2m
- Demo: Troubleshooting Control Plane - Inaccessible Static Pod Manifests 4m
- Demo: Troubleshooting Control Plane - Misconfigured Static Pod Manifests 4m
- Troubleshooting Workloads 0m
- Module Review and Thank You! 1m