Site Reliability Engineering Practitioner (SREP)


Course Number: DVOP-139

Duration: 3 days (19.5 hours)

Format: Live, hands-on

Site Reliability Engineering Practitioner (SREP) Training Overview

This Site Reliability Practitioner (SRE)sm Practitioner certification training teaches attendees strategies to improve agility, cross-functional collaboration, and transparency of health of services. Attendees learn resiliency by design, automation, and closed-loop remediations. In addition, this course positions learners to successfully complete the DevOps Institute’s SRE Foundation certification exam.

Note: This course includes an exam voucher for each paid participant for the SRE Practitioner exam.

Location and Pricing

This course is taught as a private course in-person or online for teams of 3 or more. To receive a quote for online corporate training, please contact us.

Objectives

  • Understand how to successfully implement a flourishing SRE culture in your organization
  • Understand the underlying principles of SRE and what it is not in terms of anti-patterns, and how you become aware of them to avoid them
  • Understand the organizational impact of introducing SRE
  • Master the art of SLIs (Service Level Indicators) and SLOs (Service Level Objectives) in a distributed ecosystem
  • Extend the usage of Error Budgets beyond the norm to innovate and avoid risks
  • Build security and resilience by design in a distributed, zero-trust environment
  • Implement full-stack observability, distributed tracing, and bring about an observability-driven development culture
  • Curate data using AI to move from reactive to proactive and predictive incident management
  • Use DataOps to build clean data lineage
  • Understand why Platform Engineering is so important in building consistency and predictability ofSRE culture
  • Implement practical Chaos Engineering
  • Understand the major incident response responsibilities for SRE based on incident command framework and the anatomy of unmanaged incidents
  • Understand why SRE can be considered as the purest implementation of DevOps
  • Work with the SRE Execution model
  • Understand the SRE role and why reliability is everyone’s problem
  • Explore SRE success stories

Prerequisites

All SRE Practitioner certification training students should have taken Accelebrate's Site Reliability Engineering (SRE)sm Foundation certification course.

Outline

Expand All | Collapse All

SRE Anti-patterns
  • Rebranding Ops or DevOps or Dev as SRE
  • Users notice an issue before you do
  • Measuring until my edge
  • False positives are worse than no alerts
  • Configuration management trap for snowflakes
  • The Dogpile: mob incident response
  • Point fixing
  • Production readiness gatekeeper
  • Fail-Safe really?
SLO is a Proxy for Customer Happiness
  • Define SLIs
  • Defining system boundaries
  • Use error budgets
  • Reliability is only as good as the weakest link on your service graph
  • Error thresholds when 3rd party services are used
Building Secure and Reliable Systems
  • SRE and their role in building secure and reliable systems
  • Design for changing architecture
  • Fault-tolerant design
  • Design for security
  • Design for resiliency
  • Design for scalability
  • Design for performance
  • Design for reliability
  • Ensuring data security and privacy
Full-Stack Observability
  • Modern apps are complex and unpredictable
  • Slow is the new down
  • Pillars of observability
  • Synthetic and end-user monitoring
  • Observability driven development
  • Distributed tracing
  • What happens to monitoring?
  • Instrumenting using libraries and agents
Platform Engineering and AIOPs
  • Taking a platform-centric view
  • How do you use AIOps to improve resiliency?
  • How can DataOps help you in the journey?
  • A simple recipe to implement AIOps
  • Indicative measurement of AIOps
SRE & Incident Response Management
  • SRE key responsibilities towards incident response
  • DevOps & SRE and ITIL
  • OODA and SRE incident response
  • Closed loop remediation and the advantages
  • Swarming - food for thought
  • AI/ML for better incident management
Chaos Engineering
  • Navigating complexity
  • Chaos engineering defined
  • Quick facts about chaos engineering
  • Chaos monkey origin story
  • Who is adopting chaos engineering
  • Myths of chaos
  • Chaos engineering experiments
  • GameDay exercises
  • Security chaos engineering
  • Chaos engineering resources
SRE is the Purest form of DevOps
  • Key Principles of SRE
  • SREs to help increase reliability across the product spectrum
  • Metrics for success
  • Selection of target areas
  • SRE execution model
  • Culture and behavioral skills are key
  • SRE case study
Conclusion

Training Materials:

  • Learner Manual (excellent post-class reference)
  • Participation in exercises and discussions designed to apply concepts
  • Case stories
  • Access to additional sources of information and communities
  • An exam voucher for each attendee

Software Requirements:

Attendees will not need to install any software on their computer for this class. The class will be conducted in a remote environment that Accelebrate will provide; students will only need a local computer with a web browser and a stable Internet connection. Any recent version of Microsoft Edge, Mozilla Firefox, or Google Chrome will be fine.



Learn faster

Our live, instructor-led lectures are far more effective than pre-recorded classes

Satisfaction guarantee

If your team is not 100% satisfied with your training, we do what's necessary to make it right

Learn online from anywhere

Whether you are at home or in the office, we make learning interactive and engaging

Multiple Payment Options

We accept check, ACH/EFT, major credit cards, and most purchase orders



Recent Training Locations

Alabama

Birmingham

Huntsville

Montgomery

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

Los Angeles

Oakland

Orange County

Sacramento

San Diego

San Francisco

San Jose

Colorado

Boulder

Colorado Springs

Denver

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Jacksonville

Miami

Orlando

Tampa

Georgia

Atlanta

Augusta

Savannah

Hawaii

Honolulu

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Cedar Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Frederick

Hagerstown

Massachusetts

Boston

Cambridge

Springfield

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Minneapolis

Saint Paul

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Las Vegas

Reno

New Jersey

Princeton

New Mexico

Albuquerque

New York

Albany

Buffalo

New York City

White Plains

North Carolina

Charlotte

Durham

Raleigh

Ohio

Akron

Canton

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Oklahoma City

Tulsa

Oregon

Portland

Pennsylvania

Philadelphia

Pittsburgh

Rhode Island

Providence

South Carolina

Charleston

Columbia

Greenville

Tennessee

Knoxville

Memphis

Nashville

Texas

Austin

Dallas

El Paso

Houston

San Antonio

Utah

Salt Lake City

Virginia

Alexandria

Arlington

Norfolk

Richmond

Washington

Seattle

Tacoma

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Calgary

Edmonton

British Columbia

Vancouver

Manitoba

Winnipeg

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan