Mastering Site Reliability Engineering in Enterprise: A Complete Guide to Resilient Systems & Chaos Engineering
暫譯: 企業級網站可靠性工程精通:韌性系統與混沌工程完整指南
Hoeppner, Florian, Sbaraglia, Francesco
商品描述
- Understand the key terms and history of SRE and its guiding principles Get insights into the SRE role and its evolution Overcome the challenges in adopting SRE at any level of the organisation Identify site reliability building blocks maturity readiness to improve digital resilience
商品描述(中文翻譯)
透過採用網站可靠性工程(SRE)實踐來轉變企業IT,減少停機時間、建立韌性並推動商業價值。本書是一本全面的指南,旨在幫助網站可靠性工程師、DevOps 團隊和平台工程師在系統出現重大故障之前識別、解決和減輕系統弱點。
作者 Francesco Sbaraglia 和 Florian Hoeppner 強調了IT從成本中心轉變為核心業務功能的範式轉變,強調開發人員的核心角色以及對速度和可靠性的需求。他們詳細說明了轉型為SRE所面臨的挑戰,包括克服文化抵抗和舊有基礎設施的限制,同時突顯在系統和流程中建立韌性的重要性。具體的SRE能力,如混沌工程、可觀察性和繁瑣工作管理,將被探討,並提供成功實施的策略,包括建立卓越中心、選擇合適的工具以及培養協作和持續改進的文化。
展望未來,本書探討了新興趨勢,如代理式AI SRE代理、在SRE中使用生成式AI(GenAI)以及混沌工程的未來演變。您將學習如何將SRE實踐嵌入現有的企業技術運營模型中,並解鎖可衡量的商業成果:減少停機時間、提高韌性和穩定性方面的可量化增益。此外,還將發現GenAI如何支持SRE團隊規劃、執行和優化可靠性實驗,以及自動化繁瑣工作減少和持續改進的努力。
在本書結束時,您將知道如何應用核心SRE實踐來加強可靠性:建立由SRE主導的混沌工程實踐、舉辦以可靠性為重點的「遊戲日」、改善可觀察性、排除故障場景,並加強系統和團隊的數位韌性。
您將學到的內容:
- 理解SRE的關鍵術語和歷史及其指導原則
- 獲得對SRE角色及其演變的見解
- 克服在組織任何層級採用SRE的挑戰
- 確定網站可靠性構建塊的成熟度準備,以改善數位韌性
本書適合對象:
渴望設計、規劃和實施企業系統韌性的專業人士、架構師、工程師和實踐者,並使用經過驗證的SRE實踐。
作者簡介
Francesco Sbaraglia is a distinguished Site Reliability Engineer (SRE) and a recognised expert in the field of Chaos Engineering and DevOps. With an extensive career spanning over two decades, Francesco has garnered a wealth of hands-on experience as a practitioner and innovator, establishing a profound mastery of cutting-edge AIOPS technologies and methodologies.
In addition to his technical prowess, Francesco has distinguished himself as an accomplished author, contributing numerous insightful tech articles and authoritative books across a spectrum of subjects surrounding SRE, Chaos Engineering, operations, and DevOps. Francesco is also an author and public speaker, sharing his insights and best practices in SRE, observability, and chaos engineering at renowned industry conferences, such as SRECon21 and DevOpsCon. He is passionate about combining systems engineering principles with observability tools to ensure seamless operations and improve software engineering practices.
Florian Hoeppner is a seasoned professional technology strategist and advisor for tech operating models. He is an Enterprise Site Reliability Engineer subject-matter -expert and DevOps expert with a deep understanding of tech operating model transformations. Florian is passionate about tech strategy, combined build-run teams, and optimising tech operations, and he has spoken and published extensively on these topics. He created a professional global community in his organisation with more than 500 members, constantly sharing and evaluating the latest around these critical topics. He is also the creator of the EngineeringOps radar, a yearly publication showing tech engineering and operational capabilities. He holds a degree in Media Information Systems and a Master of Science in Digital Media. Florian currently lives in New York and has a blog that offers practical insights into SRE, Chaos Engineering, and DevOps practices and solutions on an enterprise level. He has published the book "Competition as Motivation" with AV Akademikerverlag.
作者簡介(中文翻譯)
Francesco Sbaraglia 是一位傑出的網站可靠性工程師(Site Reliability Engineer, SRE),並且在混沌工程(Chaos Engineering)和DevOps領域中被認可為專家。Francesco擁有超過二十年的豐富職業生涯,積累了大量的實務經驗,作為實踐者和創新者,他對尖端的AIOPS技術和方法論有著深厚的掌握。
除了他的技術專長,Francesco還以優秀的作者身份脫穎而出,為SRE、混沌工程、運營和DevOps等多個主題撰寫了大量深具見解的技術文章和權威書籍。Francesco也是一位作者和公共演講者,曾在知名的行業會議上分享他在SRE、可觀察性(observability)和混沌工程方面的見解和最佳實踐,如SRECon21和DevOpsCon。他熱衷於將系統工程原則與可觀察性工具相結合,以確保無縫的運營並改善軟體工程實踐。
Florian Hoeppner 是一位經驗豐富的專業技術策略師和技術運營模式顧問。他是一位企業網站可靠性工程師(Enterprise Site Reliability Engineer)主題專家和DevOps專家,對技術運營模式轉型有著深刻的理解。Florian熱衷於技術策略、組合建置與運行團隊,以及優化技術運營,並在這些主題上發表了大量演講和出版物。他在其組織中創建了一個擁有超過500名成員的專業全球社群,持續分享和評估這些關鍵主題的最新資訊。他還是EngineeringOps雷達的創建者,這是一份每年發布的刊物,展示技術工程和運營能力。他擁有媒體資訊系統學位和數位媒體碩士學位。Florian目前居住在紐約,並擁有一個博客,提供有關SRE、混沌工程和企業級DevOps實踐及解決方案的實用見解。他與AV Akademikerverlag出版了《Competition as Motivation》一書。