This three-day instructor-led course teaches participants techniques for monitoring, troubleshooting, and improving infrastructure and application performance in Google Cloud. Guided by the principles of Site Reliability Engineering (SRE), and using a combination of presentations, demos, hands-on labs, and real-world case studies, attendees gain experience with full-stack monitoring, real-time log management and analysis, debugging code in production, tracing application performance bottlenecks, and profiling CPU and memory usage.
This class is intended for the following participants:
To get the most out of this course, participants should have:
This course teaches participants the following skills:
	Module 1
	Introduction to Google Cloud Monitoring Tools
	Module 2
	Avoiding Customer Pain
	Module 3
	Alerting Policies
	Module 4
	Monitoring Critical Systems
	Module 5
	Configuring Google Cloud Services for Observability
	Module 6
	Advanced Logging and Analysis
	Module 7
	Monitoring Network Security and Audit Logs
	Module 8
	Managing Incidents
	Module 9
	Investigating Application Performance Issues
	Module 10
	Optimizing the Costs of Monitoring