Operating Systems for Supercomputers and High Performance Computing
暫譯: 超級電腦與高效能計算的作業系統

Gerofi, Balazs, Ishikawa, Yutaka, Riesen, Rolf

  • 出版商: Springer
  • 出版日期: 2019-10-28
  • 售價: $5,360
  • 貴賓價: 9.5$5,092
  • 語言: 英文
  • 頁數: 400
  • 裝訂: Hardcover - also called cloth, retail trade, or trade
  • ISBN: 9811366233
  • ISBN-13: 9789811366239
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

Few works are as timely and critical to the advancement of high performance computing than is this new up-to-date treatise on leading-edge directions of operating systems. It is a first-hand product of many of the leaders in this rapidly evolving field and possibly the most comprehensive.

This new and important book masterfully presents the major alternative concepts driving the future of operating system design for high performance computing. In particular, it describes the major advances of monolithic operating systems such as Linux and Unix that dominate the TOP500 list. It also presents the state of the art in lightweight kernels that exhibit high efficiency and scalability at the loss of generality. Finally, this work looks forward to possibly the most promising strategy of a hybrid structure combining full service functionality with lightweight kernel operation. With this, it is likely that this new work will find its way on the shelves of almost everyone who is in any way engaged in the multi-discipline of high performance computing.

(From the foreword by Thomas Sterling)


商品描述(中文翻譯)

少有作品能像這本關於操作系統前沿方向的最新論著一樣,對高效能計算的進步如此及時且關鍵。這是許多在這個快速發展領域中的領導者的第一手成果,並且可能是最全面的。

這本新且重要的書巧妙地呈現了推動高效能計算操作系統設計未來的主要替代概念。特別是,它描述了主導 TOP500 名單的單體操作系統(如 Linux 和 Unix)的重大進展。它還介紹了在效率和可擴展性上表現出色但在通用性上有所妥協的輕量級核心的最新技術。最後,這部作品展望了一種可能最具前景的策略,即結合完整服務功能與輕量級核心操作的混合結構。由此可見,這本新作很可能會成為幾乎所有參與高效能計算多學科領域的人的書架上的一部分。

(摘自 Thomas Sterling 的前言)

作者簡介

Dr. Balazs Gerofi is a research scientist at the RIKEN Center for Computational Science, where he is involved with system software research and development for high performance computing. He actively participates in the design and development of the Post K supercomputer, Japan's next-generation flagship supercomputer after the K Computer. Balazs earned his M.Sc. degree and Ph.D. degree in computer science from the Vrije Universiteit Amsterdam and The University of Tokyo, respectively. His research interest covers operating systems, high performance computing, cloud computing, and fault-tolerant computing. Balazs is a member of the IEEE Computer Society and the Association for Computing Machinery (ACM).

Dr. Yutaka Ishikawa is the leader of the Post-K computer development project that aims at deploying the next Japanese flagship supercomputer around 2021, at the RIKEN Center for Computational Science, Japan. Ishikawa received his Ph.D. degree in electrical engineering from Keio University. From 1987 to 2001, he was a member of AIST (the former Electrotechnical Laboratory). From 1993 to 2001, he was the chief of the Parallel and Distributed System Software Laboratory at the Real World Computing Partnership. He led the development of the cluster system software called SCore, which was used in several large PC cluster systems around 2004. From 2002 to 2006 and from 2006 to 2014, he was an associate professor and a professor at The University Tokyo, respectively. From 2006 to 2008, he was a project co-leader to design a commodity-based supercomputer called T2K open supercomputer. As a result, three universities, Tsukuba, Tokyo, and Kyoto, obtained their respective supercomputers based on those specifications. From 2010 to 2014, he was also the director of the Information Technology Center at The University of Tokyo. He led the design and implementation of HPCI, High Performance Computing Infrastructure in Japan, from 2010 to 2012.

Dr. Rolf Riesen is the lead software architect for the multi-operating system (mOS) project at the Intel Corp. The mOS team is creating an OS for use in supercomputers and other high-end HPC systems. Rolf has 25 years of experience in researching, developing, and deploying software for massively parallel processors. His career began as a key member of the Sandia National Laboratory and University of New Mexico team that created the lightweight kernel and the Portals message passing interface that broke the teraflops barrier in 1997 with the Intel-powered ASCI Red supercomputer. Over the years, Rolf's code and research ideas have directly contributed to specific systems on the TOP500 list, stretching over a period of almost 20 years. It began with SUNMOS on an nCUBE 2 to the Catamount OS on the Cray/Sandia Red Storm system. After teaching for 2 years at the University of New Mexico, he joined IBM research in Dublin, Ireland, where he focused on simulation and fault tolerance for extreme scale systems Now, at Intel, he is using his expertise to guide a team that combines a lightweight OS kernel with Linux. Rolf has over 50 peer-reviewed publications and is an active member of various program committees. He is also a subject area editor for the journal Parallel Computing.

Dr. Robert W. Wisniewski is an ACM Distinguished Scientist and the chief software architect for Extreme Scale Computing and a senior principal engineer at the Intel Corporation. He is the lead architect for Intel's cohesive and comprehensive software stack that leverages OpenHPC and is responsible for the software for Aurora, the world's largest announced supercomputer. He has published over 74 papers in the area of high performance computing, computer systems, and system performance, filed over 56 patents, and given over 53 external invited presentations. Before coming to Intel, he was the chief software architect for Blue Gene Research and manager of the Blue Gene and Exascale Research Software Team at the IBM T.J. Watson Research Facility. There, he was an IBM master inventor and led the software effort on Blue Gene/Q, the fastest machine in the world on the June 2012 TOP500 list, and occupied 4 of the top 10 positions.

作者簡介(中文翻譯)

巴拉茲·格羅菲博士(Dr. Balazs Gerofi)是理化學研究所(RIKEN Center for Computational Science)的研究科學家,專注於高效能計算的系統軟體研究與開發。他積極參與日本下一代旗艦超級電腦Post K的設計與開發,該超級電腦是繼K Computer之後的下一代產品。巴拉茲在阿姆斯特丹自由大學(Vrije Universiteit Amsterdam)和東京大學(The University of Tokyo)分別獲得計算機科學的碩士和博士學位。他的研究興趣涵蓋作業系統、高效能計算、雲計算和容錯計算。巴拉茲是IEEE計算機學會和計算機協會(ACM)的成員。

石川裕(Dr. Yutaka Ishikawa)博士是Post-K計算機開發專案的負責人,該專案旨在於2021年左右在日本理化學研究所(RIKEN Center for Computational Science)部署下一台日本旗艦超級電腦。石川在慶應義塾大學(Keio University)獲得電機工程的博士學位。從1987年到2001年,他是AIST(前電氣技術研究所)的成員。從1993年到2001年,他擔任現實世界計算夥伴關係(Real World Computing Partnership)平行與分散系統軟體實驗室的負責人。他主導開發了名為SCore的叢集系統軟體,該軟體在2004年左右的幾個大型PC叢集系統中使用。從2002年到2006年及2006年到2014年,他分別擔任東京大學的副教授和教授。從2006年到2008年,他是設計基於商品的超級電腦T2K開放超級電腦的專案共同負責人。結果,筑波大學、東京大學和京都大學根據這些規格獲得了各自的超級電腦。從2010年到2014年,他還擔任東京大學資訊技術中心的主任。他主導了日本高效能計算基礎設施(HPCI)的設計與實施,時間為2010年至2012年。

羅夫·里森博士(Dr. Rolf Riesen)是英特爾公司(Intel Corp.)多作業系統(mOS)專案的首席軟體架構師。mOS團隊正在為超級電腦和其他高端HPC系統創建一個作業系統。羅夫在研究、開發和部署大規模平行處理器的軟體方面擁有25年的經驗。他的職業生涯始於桑迪亞國家實驗室(Sandia National Laboratory)和新墨西哥大學(University of New Mexico)團隊,該團隊創建了輕量級內核和Portals訊息傳遞介面,並在1997年通過英特爾驅動的ASCI Red超級電腦打破了teraflops的障礙。多年來,羅夫的程式碼和研究理念直接貢獻於TOP500名單上的特定系統,這段時間長達近20年。這一切始於nCUBE 2上的SUNMOS,然後是Cray/Sandia Red Storm系統上的Catamount OS。在新墨西哥大學教學兩年後,他加入了位於愛爾蘭都柏林的IBM研究部門,專注於極大規模系統的模擬和容錯。現在,在英特爾,他利用自己的專業知識指導一個團隊,將輕量級作業系統內核與Linux結合。羅夫擁有超過50篇經過同行評審的出版物,並且是各種程序委員會的活躍成員。他還是期刊《平行計算》(Parallel Computing)的主題區域編輯。

羅伯特·W·維斯涅夫斯基博士(Dr. Robert W. Wisniewski)是ACM傑出科學家,並擔任英特爾公司(Intel Corporation)極大規模計算的首席軟體架構師和高級首席工程師。他是英特爾整合且全面的軟體堆疊的首席架構師,該堆疊利用OpenHPC,並負責世界上最大的已宣布超級電腦Aurora的軟體。他在高效能計算、計算機系統和系統性能領域發表了超過74篇論文,申請了超過56項專利,並進行了超過53場外部邀請演講。在加入英特爾之前,他是藍基因研究(Blue Gene Research)的首席軟體架構師,並擔任IBM T.J. Watson研究所的藍基因和Exascale研究軟體團隊的經理。在那裡,他是IBM的首席發明家,並主導了藍基因/Q的軟體工作,該機器在2012年6月的TOP500名單中是世界上最快的機器,並佔據了前10名中的4個位置。