DevOps Institute Blog

Leverage SRE to Build a Culture of Reliability, Resiliency and Risk Management
SRE propagates a culture of building and operating reliable, ...

The Practice of Chaos Engineering Observability
Today’s modern distributed systems are associated with many ...

Site Reliability Engineering Key Concepts: SLO, Error Budget, TOIL and Observability
By: Niladri Choudhuri
“What happens when a software engineer ...

SRE Is Fueling the Journey Towards Digital Reinvention: Are you Ready To Embrace it?
Learn how SRE is helping the day to day responsibilities of IT ...

Why You Should Bring Chaos Engineering to Your Legacy CI/CD Pipeline
In an increasingly distributed world we often ask ourselves if ...

Choosing the Right Service Level Indicators
A well-known quote from Google Site Reliability Engineering handbook ...

An Interview with a Value Stream Architect
We took the opportunity to ask Bryan Finster, Value Stream Architect ...

The Role of Bots in DevOps
DevOps has evolved from its infancy into a mainstream focus area for ...



