Site Reliability Engineering Foundation® (SREF) Certification Training

Description

The Site Reliability Engineering Foundation® (SREF) Certification Training introduces the principles and practices necessary for organizations to effectively scale critical services with reliability and efficiency. Implementing a site reliability engineering approach entails organizational realignment, a heightened emphasis on engineering and automation, and embracing new operational paradigms.

This course explores the evolution of SRE and its future trajectory. It equips participants with practices, methodologies, and tools to engage various stakeholders across the organization in matters of reliability and stability, using real-life scenarios and case studies as illustrations. Upon completing the course, participants will gain practical insights they can apply immediately, such as understanding, establishing, and monitoring Service Level Objectives (SLOs).

Training Objectives

The history of SRE and its emergence at Google
The inter-relationship of SRE with DevOps and other popular frameworks
The underlying principles behind SRE
Service Level Objectives (SLOs) and their user focus
Service Level Indicators (SLIs) and the modern monitoring landscape
Error budgets and the associated error budget policies
Toil and its effect on an organisation’s productivity
Some practical steps that can help to eliminate toil
Observability is something to indicate the health of a service
SRE tools
automation techniques
and the importance of security
Anti-fragility
our approach to failure
and failure testing
The organisational impact that introducing SRE brings

Course Outline

Course Introduction
Course Goals
Course Agenda<
Module 1: SRE Principles & Practices
What is Site Reliability Engineering?
SRE & DevOps: What is the Difference?
SRE Principles & Practices<
Module 2: Service Level Objectives & Error Budgets
Service Level Objectives (SLOs)
Error Budgets
Error Budget Policies<
Module 3: Reducing Toil
What is Toil?
Why is Toil Bad?
Doing Something About Toil<
Module 4: Monitoring & Service Level Indicators
Service Level Indicators (SLIs)
Monitoring
Observability<
Module 5: SRE Tools & Automation
Automation Defined
Automation Focus
Hierarchy of Automation Types
Secure Automation
Automation Tools<
Module 6: Anti-Fragility & Learning from Failure
Why Learn from Failure
Benefits of Anti-Fragility
Shifting the Organisational Balance<
Module 7: Organisational Impact of SRE
Why Organisations Embrace SRE
Patterns for SRE Adoption
On-Call Necessities
Blameless Post-Mortems
SRE & Scale<
Module 8: SRE, Other Frameworks, The Future
SRE & Other Frameworks
The Future
Additional Sources of Information<
Exam Preparations
Exam Requirements, Question Weighting, and Terminology List
Sample Exam Review

Site Reliability Engineering Foundation® (SREF) Certification Training - Ratio

Site Reliability Engineering Foundation® (SREF) Certification Training

Description

Description

Training Objectives

Course Outline

Enquire About This Course?

Site Reliability Engineering Foundation® (SREF) Certification Training - Ratio

Site Reliability Engineering Foundation® (SREF) Certification Training

Description

Description

Training Objectives

Course Outline

Enquire About This Course?

Related products

DevOps Foundations Training (ICP-FDO Certification)

DevOps Engineering Foundation® (DOEF) Certification Training

Introduction to DevOps with Kubernetes

Introduction to Jenkins