The Practice of Cloud System Administration: Designing and Operating Large Distributed Systems, Volume 2 (Paperback)

Thomas A. Limoncelli, Strata R. Chalup, Christina J. Hogan

  • 出版商: Addison Wesley
  • 出版日期: 2014-09-03
  • 售價: $1,980
  • 貴賓價: 9.5$1,881
  • 語言: 英文
  • 頁數: 560
  • 裝訂: Paperback
  • ISBN: 032194318X
  • ISBN-13: 9780321943187
  • 立即出貨 (庫存 < 3)



“There’s an incredible amount of depth and thinking in the practicesdescribed here, and it’s impressive to see it all in one place.”

–Win Treese, coauthor of Designing Systems for Internet Commerce


The Practice of Cloud System Administration, Volume 2, focuses on “distributed” or “cloud” computing and brings a DevOps/SRE sensibility to the practice of system administration. Unsatisfied with books that cover either design or operations in isolation, the authors created this authoritative reference centered on a comprehensive approach.


Case studies and examples from Google, Etsy, Twitter, Facebook, Netflix, Amazon, and other industry giants are explained in practical ways that are useful to all enterprises. The new companion to the best-selling first volume, The Practice of System and Network Administration, Second Edition, this guide offers expert coverage of the following and many other crucial topics:


Designing and building modern web and distributed systems

  • Fundamentals of large system design
  • Understand the new software engineering implications of cloud administration
  • Make systems that are resilient to failure and grow and scale dynamically
  • Implement DevOps principles and cultural changes
  • IaaS/PaaS/SaaS and virtual platform selection

Operating and running systems using the latest DevOps/SRE strategies

  • Upgrade production systems with zero down-time
  • What and how to automate; how to decide what not to automate
  • On-call best practices that improve uptime
  • Why distributed systems require fundamentally different system administration techniques
  • Identify and resolve resiliency problems before they surprise you

Assessing and evaluating your team’s operational effectiveness

  • Manage the scientific process of continuous improvement
  • A forty-page, pain-free assessment system you can start using today



「這裡所描述的實踐方法中蘊含著非常深入且周全的思考,能夠在同一個地方看到這一切令人印象深刻。」- Win Treese,《設計互聯網商業系統》合著者



- 設計和構建現代網絡和分散式系統
- 大型系統設計的基礎知識
- 理解雲端管理的新軟體工程影響
- 建立具有彈性的、能夠動態增長和擴展的系統
- 實施DevOps原則和文化變革
- IaaS/PaaS/SaaS和虛擬平台選擇

- 使用最新的DevOps/SRE策略操作和運行系統
- 零停機升級生產系統
- 自動化的內容和方式;如何決定不自動化的內容
- 提高正常運行時間的當值最佳實踐
- 分散式系統需要根本不同的系統管理技術的原因
- 在問題出現之前識別和解決彈性問題

- 評估和評價團隊的運營效能
- 管理持續改進的科學過程
- 一個四十頁的無痛評估系統,您可以立即開始使用