Operating Systems for Supercomputers and High Performance Computing

Gerofi, Balazs, Ishikawa, Yutaka, Riesen, Rolf

  • 出版商: Springer
  • 出版日期: 2020-11-27
  • 售價: $5,610
  • 貴賓價: 9.5$5,330
  • 語言: 英文
  • 頁數: 400
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 9811366268
  • ISBN-13: 9789811366260
  • 海外代購書籍(需單獨結帳)


Few works are as timely and critical to the advancement of high performance computing than is this new up-to-date treatise on leading-edge directions of operating systems. It is a first-hand product of many of the leaders in this rapidly evolving field and possibly the most comprehensive.

This new and important book masterfully presents the major alternative concepts driving the future of operating system design for high performance computing. In particular, it describes the major advances of monolithic operating systems such as Linux and Unix that dominate the TOP500 list. It also presents the state of the art in lightweight kernels that exhibit high efficiency and scalability at the loss of generality. Finally, this work looks forward to possibly the most promising strategy of a hybrid structure combining full service functionality with lightweight kernel operation. With this, it is likely that this new work will find its way on the shelves of almost everyone who is in any way engaged in the multi-discipline of high performance computing.

(From the foreword by Thomas Sterling)


Dr. Balazs Gerofi is a research scientist at the RIKEN Center for Computational Science, where he is involved with system software research and development for high performance computing. He actively participates in the design and development of the Post K supercomputer, Japan's next-generation flagship supercomputer after the K Computer. Balazs earned his M.Sc. degree and Ph.D. degree in computer science from the Vrije Universiteit Amsterdam and The University of Tokyo, respectively. His research interest covers operating systems, high performance computing, cloud computing, and fault-tolerant computing. Balazs is a member of the IEEE Computer Society and the Association for Computing Machinery (ACM).

Dr. Yutaka Ishikawa is the leader of the Post-K computer development project that aims at deploying the next Japanese flagship supercomputer around 2021, at the RIKEN Center for Computational Science, Japan. Ishikawa received his Ph.D. degree in electrical engineering from Keio University. From 1987 to 2001, he was a member of AIST (the former Electrotechnical Laboratory). From 1993 to 2001, he was the chief of the Parallel and Distributed System Software Laboratory at the Real World Computing Partnership. He led the development of the cluster system software called SCore, which was used in several large PC cluster systems around 2004. From 2002 to 2006 and from 2006 to 2014, he was an associate professor and a professor at The University Tokyo, respectively. From 2006 to 2008, he was a project co-leader to design a commodity-based supercomputer called T2K open supercomputer. As a result, three universities, Tsukuba, Tokyo, and Kyoto, obtained their respective supercomputers based on those specifications. From 2010 to 2014, he was also the director of the Information Technology Center at The University of Tokyo. He led the design and implementation of HPCI, High Performance Computing Infrastructure in Japan, from 2010 to 2012.

Dr. Rolf Riesen is the lead software architect for the multi-operating system (mOS) project at the Intel Corp. The mOS team is creating an OS for use in supercomputers and other high-end HPC systems. Rolf has 25 years of experience in researching, developing, and deploying software for massively parallel processors. His career began as a key member of the Sandia National Laboratory and University of New Mexico team that created the lightweight kernel and the Portals message passing interface that broke the teraflops barrier in 1997 with the Intel-powered ASCI Red supercomputer. Over the years, Rolf's code and research ideas have directly contributed to specific systems on the TOP500 list, stretching over a period of almost 20 years. It began with SUNMOS on an nCUBE 2 to the Catamount OS on the Cray/Sandia Red Storm system. After teaching for 2 years at the University of New Mexico, he joined IBM research in Dublin, Ireland, where he focused on simulation and fault tolerance for extreme scale systems Now, at Intel, he is using his expertise to guide a team that combines a lightweight OS kernel with Linux. Rolf has over 50 peer-reviewed publications and is an active member of various program committees. He is also a subject area editor for the journal Parallel Computing.

Dr. Robert W. Wisniewski is an ACM Distinguished Scientist and the chief software architect for Extreme Scale Computing and a senior principal engineer at the Intel Corporation. He is the lead architect for Intel's cohesive and comprehensive software stack that leverages OpenHPC and is responsible for the software for Aurora, the world's largest announced supercomputer. He has published over 74 papers in the area of high performance computing, computer systems, and system performance, filed over 56 patents, and given over 53 external invited presentations. Before coming to Intel, he was the chief software architect for Blue Gene Research and manager of the Blue Gene and Exascale Research Software Team at the IBM T.J. Watson Research Facility. There, he was an IBM master inventor and led the software effort on Blue Gene/Q, the fastest machine in the world on the June 2012 TOP500 list, and occupied 4 of the top 10 positions.