In this course, develop valuable, in-demand skills that you can apply as a reliability engineer. Discover engineering strategies for promoting chaos engineering practices, observability and monitoring techniques, disaster recovery exercises, reliability metrics, fast data-driven decision-making, and more. Along the way, learn how to automate operations to improve time to restore and time to detect using modern cloud services, LLMs, and best-in-class tools. This course is an ideal fit for software engineers and development teams responsible for designing, deploying, or maintaining cloud-native applications.
This course was created by Pearson, Mariya Breyter, and Carlos Rojas. We are pleased to host this training in our library.
Learn More
