Site Reliability Engineering Audiobook By Betsy Beyer, Chris Jones, Jennifer Petoff, Niall Richard Murphy cover art

Site Reliability Engineering

How Google Runs Production Systems

Preview

Try for $0.00
Prime logo Prime members: New to Audible?
Get 2 free audiobooks during trial.
Pick 1 audiobook a month from our unmatched collection.
Listen all you want to thousands of included audiobooks, Originals, and podcasts.
Access exclusive sales and deals.
Premium Plus auto-renews for $14.95/mo after 30 days. Cancel anytime.

Site Reliability Engineering

By: Betsy Beyer, Chris Jones, Jennifer Petoff, Niall Richard Murphy
Narrated by: Liz Porter
Try for $0.00

$14.95/month after 30 days. Cancel anytime.

Buy for $30.09

Buy for $30.09

Confirm purchase
Pay using card ending in
By confirming your purchase, you agree to Audible's Conditions of Use and Amazon's Privacy Notice. Taxes where applicable.
Cancel

About this listen

Newly adapted for audiobook listeners.

The overwhelming majority of a software system’s lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large scale computing systems?

In this collection of essays and articles, key members of Google’s Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the largest software systems in the world. You’ll learn the principles and practices that enable Google engineers to make systems more scalable, reliable, and efficient - lessons directly applicable to your organization.

This book is divided into four sections:

  • Introduction - Learn what site reliability engineering is and why it differs from conventional IT industry practices
  • Principles - Examine the patterns, behaviors, and areas of concern that influence the work of a site reliability engineer (SRE)
  • Practices - Understand the theory and practice of an SRE’s day to day work: building and operating large distributed computing systems
  • Management - Explore Google's best practices for training, communication, and meetings that your organization can use
Download the accompanying reference guide.©2016 Google, Inc. (P)2021 Upfront Books
Management Business Software Software Development Programming Thought-Provoking
activate_Holiday_promo_in_buybox_DT_T2

What listeners say about Site Reliability Engineering

Average customer ratings
Overall
  • 4.5 out of 5 stars
  • 5 Stars
    55
  • 4 Stars
    19
  • 3 Stars
    6
  • 2 Stars
    2
  • 1 Stars
    1
Performance
  • 4.5 out of 5 stars
  • 5 Stars
    44
  • 4 Stars
    13
  • 3 Stars
    2
  • 2 Stars
    0
  • 1 Stars
    6
Story
  • 4.5 out of 5 stars
  • 5 Stars
    43
  • 4 Stars
    13
  • 3 Stars
    5
  • 2 Stars
    1
  • 1 Stars
    1

Reviews - Please select the tabs below to change the source of reviews.

Sort by:
Filter by:
  • Overall
    5 out of 5 stars
  • Performance
    4 out of 5 stars
  • Story
    5 out of 5 stars

Excellent perspective and methodology overview for operating complex technical environments

Obviously, the book is a classic. It works well in audio format, although the narrator has a bit of an attitude in her inflection that is easier to take in small dose, such as a chapter or a drive at a time. All in all, great content.

Something went wrong. Please try again in a few minutes.

You voted on this review!

You reported this review!

  • Overall
    4 out of 5 stars
  • Performance
    5 out of 5 stars
  • Story
    4 out of 5 stars

A great resource and a decent book.

The important thing to note first is that this is a book about SRE practices at Google, not about SRE in general. As long as you approach it with the right expectations, you should be fine. It contains a lot of valuable insights into various aspects of building software products, such as toil, monitoring, deployments, releases, and more. However, the book is very uneven, and the usefulness and generality of the chapters differ a lot. Some chapters are very specific to Google and can only serve as inspiration, while others are general and can be easily applied to other companies. It can be helpful to anyone involved in building software products to develop an understanding of reliability, but don't expect to gain much directly applicable knowledge from it.

Something went wrong. Please try again in a few minutes.

You voted on this review!

You reported this review!

  • Overall
    5 out of 5 stars
  • Performance
    5 out of 5 stars
  • Story
    4 out of 5 stars

Treasure trove of knowledge but way too long

This book is a must for anyone running a SRE team. Google has done IT departments around the world a favor by sharing how its SRE teams are formed and run. However the book is very long and could easily be broken into three different books focusing on the SRE culture, technology and management. The beginning chapters of the book are the easiest to appreciate and apply at work, while the later parts like distributed computing will be different to comprehend unless you have worked with the technology.

Something went wrong. Please try again in a few minutes.

You voted on this review!

You reported this review!

  • Overall
    3 out of 5 stars
  • Performance
    1 out of 5 stars
  • Story
    5 out of 5 stars

Great book, bad narrator

The book content is really high quality but the narrator is absolutely nauseating. I would strongly recommend reading the text instead if at all possible.

Something went wrong. Please try again in a few minutes.

You voted on this review!

You reported this review!

  • Overall
    2 out of 5 stars
  • Performance
    1 out of 5 stars
  • Story
    2 out of 5 stars

Google propaganda

I didn't like it.
Google has a lot of special Google-sauce to make their mono-repo work for them. And sorta assumes everyone has the special Google-sauce.
Therefore, I don't consider most of the organizational advice applicable without modification.

An SRE is really just an Ops person that can program and is encouraged to solve their problems with code and automation.

They first make on-call seem daunting, then says it's a privilege new hires has to earn.

They spend a whole chapter on cron jobs. And make it seem like something magic Google invented....

The narrator is really robotic, maybe the voice of Google translate? It also sounds weirdly nervous at times.

Something went wrong. Please try again in a few minutes.

You voted on this review!

You reported this review!

6 people found this helpful