Hands-on Site Reliability Engineering: Build Capability to Design, Deploy, Monitor, and Sustain Enterprise Software Systems at Scale Front Cover

Hands-on Site Reliability Engineering: Build Capability to Design, Deploy, Monitor, and Sustain Enterprise Software Systems at Scale

  • Length: 236 pages
  • Edition: 1
  • Publisher:
  • Publication Date: 2021-07-06
  • ISBN-10: 9391030327
  • ISBN-13: 9789391030322
  • Sales Rank: #1311648 (See Top 100 Books)
Description

A comprehensive guide with basic to advanced SRE practices and hands-on examples.

Key Features

  • Demonstrates how to execute site reliability engineering along with fundamental concepts.
  • Illustrates real-world examples and successful techniques to put SRE into production.
  • Introduces you to DevOps, advanced techniques of SRE, and popular tools in use.

Description

Hands-on Site Reliability Engineering (SRE) brings you a tailor-made guide to learn and practice the essential activities for the smooth functioning of enterprise systems, right from designing to the deployment of enterprise software programs and extending to scalable use with complete efficiency and reliability.

The book explores the fundamentals around SRE and related terms, concepts, and techniques that are used by SRE teams and experts. It discusses the essential elements of an IT system, including microservices, application architectures, types of software deployment, and concepts like load balancing. It explains the best techniques in delivering timely software releases using containerization and CI/CD pipeline. This book covers how to track and monitor application performance using Grafana, Prometheus, and Kibana along with how to extend monitoring more effectively by building full-stack observability into the system.

The book also talks about chaos engineering, types of system failures, design for high-availability, DevSecOps and AIOps.

What you will learn

  • Learn the best techniques and practices for building and running reliable software.
  • Explore observability and popular methods for effective monitoring of applications.
  • Workaround SLIs, SLOs, Error Budgets, and Error Budget Policies to manage failures.

Who this book is for

This book caters to experienced IT professionals, application developers, software engineers, and all those who are looking to develop SRE capabilities at the individual or team level.

About the Authors

Shamayel M. Farooqui is a technology leader who specializes in driving digital transformation for organizations and is the author of ‘Enterprise DevOps Framework – Transforming IT Operations’.
He has expertise in implementing IT security, cloud migrations, and IT automation and a proven track record of building teams of skilled site reliability engineers focused on delivering solutions for optimizing and running hybrid, multi-cloud environments.log links: http://www.shamayelfarooqui.com, http://www.shamayelfarooqui.com, https://www.xfgeek.com/home

LinkedIn Profile: https://www.linkedin.com/in/shamayel/

Vishnu Vardhan Chikoti has diverse experience in the areas of Application and Database design and development, Micro-services & Micro-frontends, DevOps, Site Reliability Engineering, and Machine Learning.
With the ability to conduct deep analysis, strong execution skills, and an innovative mindset, he has successfully led R&D teams to build engineering solutions to improve the reliability of applications. He is also an expert in building high-volume transaction processing applications for middle and back-office functions for Investment Banks using a variety of architectures.

LinkedIn Profile: https://www.linkedin.com/in/vishnu-vardhan-chikoti-3763262/

To access the link, solve the captcha.