Supercomputers for Linux Sysadmins: Managing Modern HPC Clusters and Supercomputers from Software to Hardware
暫譯: Linux 系統管理員的超級電腦:從軟體到硬體管理現代 HPC 集群與超級電腦

Zhumatiy, Sergey

  • 出版商: Apress
  • 出版日期: 2025-10-24
  • 售價: $1,870
  • 貴賓價: 9.5$1,777
  • 語言: 英文
  • 頁數: 470
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 9798868815997
  • ISBN-13: 9798868815997
  • 相關分類: Linux
  • 海外代購書籍(需單獨結帳)

商品描述

Supercomputers and High Performance Computing (HPC) clusters are not so exotic as people imagine these days. They give companies the power of computation like no one server can give alone. They make new drugs and materials discoveries, universe modeling and AI training, crash simulations and market research possible - all thanks to HPC clusters. Building or renting a HPC cluster is not so difficult either as cloud providers can give you resources to build one cheap and performative enough to use yourself, so If you are or want to become HPC cluster Sysadmin or manager, this book is for you.

Supercomputers for Linux SysAdmins delves into the world of modern HPC cluster architecture, hardware, software and resources management using a Linux/UNIX based approach. The number of HPC clusters is growing with an estimated 30 billion by 2030 but there are not enough sysadmins to run and manage them, this book serves to bridge this gap to help more Sysadmins and managers to transition into the exiting world of HPCs.

This book helps those with a strong foundational knowledge in Linux, to deal with supercomputers and HPC clusters. We start with the basic principles of supercomputer management, fundamentals of Linux and UNIX, Shell Scripting and systemd and well as other open source tools and frameworks, taking you thorough the security, monitoring and hardware requirements for supercomputers and HPC clusters.

You Will Learn:

    How to plan new supercomputers The main principles and technologies used in supercomputers and HPC clusters How to set up the software environments on new supercomputers To set up supercomputer and HPC cluster resources and jobs management To manage accounts, resource sharing and many more.
Who is it for:

The main audience of this book are regular UNIX/Linux sysadmins and managers, who should deal with HPC clusters on-prem or in cloud and those who are interested in supercomputers and HPC clusters and how to utilize them in their projects and teams.

商品描述(中文翻譯)

超級電腦和高效能運算(HPC)叢集如今並不像人們想像的那麼神秘。它們為公司提供了無法由單一伺服器所能提供的運算能力。它們使新藥物和材料的發現、宇宙建模、人工智慧訓練、碰撞模擬和市場研究成為可能——這一切都要歸功於HPC叢集。建立或租用HPC叢集也並不困難,因為雲端服務提供商可以為您提供資源,以便以便宜且高效的方式建立一個可供自己使用的叢集。因此,如果您是或想成為HPC叢集的系統管理員或經理,這本書就是為您而寫的。

《Linux 系統管理員的超級電腦》深入探討現代HPC叢集架構、硬體、軟體和資源管理,採用基於Linux/UNIX的方法。預計到2030年,HPC叢集的數量將增長至300億,但能夠運行和管理這些叢集的系統管理員卻不夠。本書旨在填補這一空白,幫助更多的系統管理員和經理過渡到令人興奮的HPC世界。

本書幫助那些具備扎實Linux基礎知識的人,處理超級電腦和HPC叢集。我們從超級電腦管理的基本原則、Linux和UNIX的基礎知識、Shell腳本和systemd,以及其他開源工具和框架開始,帶您了解超級電腦和HPC叢集的安全性、監控和硬體需求。

您將學到:
- 如何規劃新的超級電腦
- 超級電腦和HPC叢集中使用的主要原則和技術
- 如何在新的超級電腦上設置軟體環境
- 如何設置超級電腦和HPC叢集的資源和作業管理
- 如何管理帳戶、資源共享等等。

本書的主要讀者是普通的UNIX/Linux系統管理員和經理,他們應該處理本地或雲端的HPC叢集,以及那些對超級電腦和HPC叢集感興趣,並希望在其項目和團隊中利用它們的人。

作者簡介

Sergey Zhumatiy has been managing supercomputers since 1999 starting out building and managing HPC clusters at Moscow State University and holds a PhD in computer science. Several supercomputers under his supervising, like Chebyshev, Lomonosov, Lomonosov-2, achieved top rankings in the top500 supercomputers list, and dominated the Russian top50 supercomputers list. Now he works as an HPC Architect and SysAdmin at NVIDIA.

作者簡介(中文翻譯)

Sergey Zhumatiy 自1999年以來一直在管理超級電腦,最初在莫斯科國立大學建立和管理高效能計算(HPC)叢集,並擁有計算機科學博士學位。在他的監督下,幾台超級電腦,如 Chebyshev、Lomonosov 和 Lomonosov-2,均在 top500 超級電腦名單中獲得了高排名,並主導了俄羅斯 top50 超級電腦名單。現在,他在 NVIDIA 擔任 HPC 架構師和系統管理員。