Site Reliability Engineering Practitioner 16 hours

Site Reliability Engineering (SRE) Practitioner™ certification is to impart, test and validate knowledge of SRE vocabulary, principles and practices.

Module 1 : Implementing SRE

  • Implementing SRE principles

  • Implementing SRE practices

  • Implementing SRE Models

  • SRE Patterns and Anti -Patterns

  • Use Case: IT Automation

  • Video: What is Site Reliability

45min.

Module 2 : On-Call

  • What is on call?

  • On Call Challenges

  • On-Call Rotations

  • On-call rotation best practices

  • Benefits of on -call rotation

  • On call SRE Team Building

  • On-call Incentives

  • Use Case: On-Call Team Structure

  • Module Quiz

  • Tool Demo: PagerDuty for AIOps Solution

1hr. 10min.

Module 3 : Managing Incident Response

  • What is Incident Response?

  • Incident Command System (ICS)

  • Incident response Team

  • Metrics for effective Incident Response

  • Best Practices

  • Use Case: Automatic Security

  • Module Quiz

  • TOOL Demo – Major Incident Management

 2hrs. 27min.

Module 4 : Blameless Incident Postmortems

  • Incident postmortem

  • Why Incident Postmortems

  • Streamline the postmortem Process

  • Blameless postmortem

  • Incident Postmorten best Practices

  • Use Case: Network Outage Playbook

  • Module Quiz

2hrs 1min.

Module 5 : Data and CI/CD Pipeline

  • What is a data pipeline?

  • Data pipeline components

  • Data pipeline types and use cases

  • Implementation options for data pipelines

  • Continuous Delivery Pipeline

  • Data Pipelines On AWS, Microsotf Azure, GCP

  • Use Case: Traditional analytics

  • Use Case: Real-time analytics

  • Use Case: ML Pipeline

  • Module Quiz

  • AWS CodePipeline Demo: Best Practices for Data Pipelines Release Process 

1hr. 35min.

Module 6 : MLOps

  • Define MLOps

  • Evolution of the MLOps

  • ML/AI and MLOps capabilities

  • Implement AI/ML

  • AI/ML/MLOps Stack Canvas

  • Deploy ML/AI

  • MLOps Maturity Level

  • MLOps Infrastructure Stack

  • MLOps Principles

  • Use Case

  • Module Quiz

  • Tool Demo: Data Versioning and Data Lineage With Pachyderm

3hr. 15min.

Module 7 : Deployment Strategies

  • Deployment Strategies Defined

  • Deployment Best Practices

  • Deployment Use Cases

  • Monolith Vs SOA Vs Microservices

  • Tool Demo: Kubernetes Blue Green Deployment

  • Module Quiz

1hr. 35min.

Module 8 : Observability

  • What is Observability?

  • Primary Data Classes of Observability

  • Objectives of Observability

  • Difference between Monitoring and Observability

  • Synthetic Monitoring Vs Real User Monitoring

  • Why Architectures Require Observability

  • Building a Continuously Observable System

  • Use and benefits of observability for SREs

  • Enterprise Observability Strategy

  • Measuring Organizations Observability

  • Observability in containers and microservices

  • Criteria for good observability tools

  • Module Quiz

  • Tool Overview: Elastic Observability

1hr. 35min.

Gilbert Kapswara

Instructor

Duis egestas aliquet maecenas erat eros, fringilla et leo eget, viverpretium. Quisque sed augue tincidunt, posuere dui tempor, dapibus nisi. Donec vel lectus sapien. Pellentesque habitant morbi tristique senectus et netus et malesuada fames ac turpis egestas.

SRE Fundamentals

Duis egestas aliquet aliquet. Maecenas erat eros, fringilla et leo eget, viverra pretium nulla. Quisque sed augue tincidunt, posuere dui tempor.

SRE Leader

Duis egestas aliquet aliquet. Maecenas erat eros, fringilla et leo eget, viverra pretium nulla. Quisque sed augue tincidunt, posuere dui tempor.

Ready to get started?

Get in touch, or create an account