Software Design for Resilient Computer Systems
暫譯: 韌性電腦系統的軟體設計

Schagaev, Igor, Gutknecht, Jürg

  • 出版商: Springer
  • 出版日期: 2025-07-16
  • 售價: $4,000
  • 貴賓價: 9.5$3,800
  • 語言: 英文
  • 頁數: 405
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 3031551419
  • ISBN-13: 9783031551413
  • 相關分類: 軟體架構系統開發
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

This book addresses the question of how system software should be designed to account for faults, and which fault tolerance features should provide for highest reliability. With this third edition of Software Design for Resilient Computer Systems, the book is thoroughly updated to contain the newest advice regarding software resilience. With a new introductory chapter, the new edition is ideal for researchers and industry professionals.


In the book, the authors first show how system software interacts with the hardware to tolerate faults. They analyze and further develop the theory of fault tolerance to understand the diverse ways to increase the reliability of a system, with special attention on the role of system software in this process. They introduce the theory of redundancy and its use for construction of a subsystem through generalised algorithm of fault tolerance (GAFT) and apply it to distributed systems. The book's approach is applied to various hardware subsystems: different structures of RAM and processor cores and demonstrates exceptional performance reliability and energy efficiency. This third edition devotes substantial attention to system software for modern computers, including run time systems, supporting algorithms of recovery and their analysis, language aspects and ways to improve reconfigurable and parallel computing.


Due to the wide-reaching nature of the content, this book applies to a host of industries and research areas, including military, aviation, intensive health care, industrial control, and space exploration.

商品描述(中文翻譯)

這本書探討了系統軟體應如何設計以考慮故障,以及哪些容錯特性應提供最高的可靠性。隨著《韌性計算機系統的軟體設計》第三版的出版,本書已全面更新,包含有關軟體韌性的最新建議。新版本增加了一個介紹章節,非常適合研究人員和業界專業人士。

在書中,作者首先展示了系統軟體如何與硬體互動以容忍故障。他們分析並進一步發展容錯理論,以理解提高系統可靠性的多種方式,特別關注系統軟體在此過程中的角色。他們介紹了冗餘理論及其在通過一般化容錯演算法(GAFT)構建子系統中的應用,並將其應用於分散式系統。該書的方法應用於各種硬體子系統:不同結構的RAM和處理器核心,並展示了卓越的性能可靠性和能源效率。這第三版特別關注現代計算機的系統軟體,包括執行時系統、支援的恢復演算法及其分析、語言方面以及改善可重構和並行計算的方法。

由於內容的廣泛性,本書適用於多個行業和研究領域,包括軍事、航空、重症醫療、工業控制和太空探索。