Book Content
chapters • 14h total length
1. SRE Job Role – Activities and Responsibilities
2. Fundamental Numbers – Reliability Statistics
3. Imperfect Habits – Duct Tape Architecture and Spaghetti Code
4. Essential Observability – Metrics, Events, Logs, and Traces (MELT)
5. Resolution Path – Master Troubleshooting
6. Operational Framework – Managing Infrastructure and Systems
7. Data Consumed – Observability Data Science
8. Reliable Architecture – Systems Strategy and Design
9. Valued Automation – Toil Discovery and Elimination
10. Exposing Pipelines – GitOps and Testing Essentials
11. Worker Bees – Orchestrations of Serverless, Containers, and Kubernetes
12. Final Exam – Tests and Capacity Planning
13. First Thing – Runbooks and Low Noise Outage Notifications
14. Rapid Response – Outage Management Techniques
15. Postmortem Candor – Long-Term Resolution
16. Chaos Injector – Advanced Systems Stability
17. Interview Advice – Hiring and Being Hired
18. Appendix A The Site Reliability Engineer Manifesto
19. Appendix B The 12-Factor App Questionnaire














