This three-day instructor-led course teaches participants techniques for monitoring, troubleshooting, and improving infrastructure and application performance in Google Cloud. Guided by the principles of Site Reliability Engineering (SRE), and using a combination of presentations, demos, hands-on labs, and real-world case studies, attendees gain experience with full-stack monitoring, real-time log management and analysis, debugging code in production, tracing application performance bottlenecks, and profiling CPU and memory usage.
This class is intended for the following participants:
To get the most out of this course, participants should have:
This course teaches participants the following skills:
Module 1
Introduction to Google Cloud Monitoring Tools
Module 2
Avoiding Customer Pain
Module 3
Alerting Policies
Module 4
Monitoring Critical Systems
Module 5
Configuring Google Cloud Services for Observability
Module 6
Advanced Logging and Analysis
Module 7
Monitoring Network Security and Audit Logs
Module 8
Managing Incidents
Module 9
Investigating Application Performance Issues
Module 10
Optimizing the Costs of Monitoring