Have a Question About This Course?





    Image

    Site Reliability Engineering Practitioner® (SREP) Certification Training

    Site Reliability Engineering Practitioner® (SREP) Certification Training
    The Site Reliability Engineering Practitioner® (SREP) Certification course introduces methods for scaling services reliably and cost-effectively within an organization. This training for SRE Practitioners delves into strategies aimed at enhancing agility, fostering cross-functional collaboration, and ensuring transparency regarding service health. The course emphasizes the principles of resilience through design, automation, and closed-loop remediation processes.

    Site Reliability Engineering Practitioner® (SREP) Certification Training Objectives

    • Successfully implement a flourishing SRE culture in your organisation.
    • Manage the organisational impact of introducing SRE.
    • Build security and resilience by design in a distributed
    • zero-trust environment.
    • Prepare for the DevOps Institute SRE Practitioner certification exam.
    • Participation in unique exercises designed to apply concepts.
    • Get sample documents
    • templates
    • tools
    • and techniques.
    • Access to additional value-added resources and communities.
    • Continue learning and face new challenges with after-course one-on-one instructor coaching.

    Need Assistance Finding the Right Training Solution

    Our Consultants are here to assist you

    Key Point of Training Pragrams

    We have different work process to go step by step for complete our working process in effective way.
    • Site Reliability Engineering Foundation® (SREF) Certification Training Prerequisites

      It is highly recommended that learners attend course Site Reliability Engineering Foundation® (SREF) Certification Training, before attending the SRE Practitioner course.
      An understanding and knowledge of common SRE terminology, concepts, principles, and related work experience are recommended.

    • Site Reliability Engineering Foundation® (SREF) Certification Training Delivery Methods

      In-Person

      Online

    • Site Reliability Engineering Foundation® (SREF) Certification Training Outline

      Module 1: SRE Anti-Patterns
      Rebranding Ops or DevOps or Dev as SRE
      Users notice an issue before you do
      Measuring until my Edge
      False positives are worse than no alerts
      Configuration management trap for snowflakes
      The Dogpile: Mob incident response
      Point fixing
      Production Readiness Gatekeeper
      Fail-Safe really?

      Module 2: SLO is a Proxy for Customer Happiness
      Define SLIs that meaningfully measure the reliability of a service from a user’s perspective
      Defining System boundaries in a distributed ecosystem for defining correct SLIs
      Use error budgets to help your team have better discussions and make better data-driven decisions
      Overall, reliability is only as good as the weakest link on your service graph
      Error thresholds when 3rd party services are used

      Module 3: Building Secure and Reliable Systems
      SRE and their role in Building Secure and Reliable systems
      Design for Changing Architecture
      Fault-tolerant Design
      Design for Security
      Design for Resiliency
      Design for Scalability
      Design for Performance
      Design for Reliability
      Ensuring Data Security and Privacy

      Module 4: Full-Stack Observability
      Modern Apps are Complex & Unpredictable
      Slow is the new down
      Pillars of Observability
      Implementing Synthetic and End-user monitoring
      Observability driven development
      Distributed Tracing
      What happens to monitoring?
      Instrumenting using Libraries and Agents

      Module 5: Platform Engineering and AIOPs
      Taking a Platform Centric View solves Organisational scalability challenges such as fragmentation, inconsistency, and unpredictability
      How do you use AIOps to improve resiliency?
      How can DataOps help you in the journey?
      A simple recipe to implement AIOps
      Indicative measurement of AIOps

      Module 6: SRE & Incident Response Management
      SRE Key Responsibilities towards incident response
      DevOps & SRE and ITIL
      OODA and SRE Incident Response
      Closed Loop Remediation and the Advantages
      Swarming – Food for Thought
      AI/ML for better incident management

      Module 7: Chaos Engineering
      Navigating Complexity
      Chaos Engineering Defined
      Quick Facts about Chaos Engineering
      Chaos Monkey Origin Story
      Who is adopting Chaos Engineering?
      Myths of Chaos
      Chaos Engineering Experiments
      GameDay Exercises
      Security Chaos Engineering
      Chaos Engineering Resources

      Module 8: SRE is the Purest form of DevOps
      Key Principles of SRE
      SREs help increase reliability across the product spectrum
      Metrics for Success
      Selection of Target areas
      SRE Execution Model
      Cultural and Behavioral Skills are key
      SRE Case study