Share
Fr. 69.00
Sergey Zhumatiy
Supercomputers for Linux SysAdmins - Managing Modern HPC Clusters and Supercomputers from Software to Hardware
English · Paperback / Softback
Will be released 10.11.2025
Description
Supercomputers and High Performance Computing (HPC) clusters are not so exotic as people imagine these days. They give companies the power of computation like no one server can give alone. They make new drugs and materials discoveries, universe modeling and AI training, crash simulations and market research possible all thanks to HPC clusters. Building or renting a HPC cluster is not so difficult either as cloud providers can give you resources to build one cheap and performative enough to use yourself, so If you are or want to become HPC cluster Sysadmin or manager, this book is for you.
Supercomputers for Linux SysAdmins delves into the world of modern HPC cluster architecture, hardware, software and resources management using a Linux/UNIX based approach. The number of HPC clusters is growing with an estimated 30 billion by 2030 but there are not enough sysadmins to run and manage them, this book serves to bridge this gap to help more Sysadmins and managers to transition into the exiting world of HPCs.
This book helps those with a strong foundational knowledge in Linux, to deal with supercomputers and HPC clusters. we start with the basic principles of supercomputer management, fundamentals of Linux and UNIX, Shell Scripting and systemd and well as other open source tools and frameworks, taking you thorough the security, monitoring and hardware requirements for supercomputers and HPC clusters.
You Will Learn:
- How to plan new supercomputers
- The main principles and technologies used in supercomputers and HPC clusters
- How to set up the software environments on new supercomputers
- To set up supercomputer and HPC cluster resources and jobs management
- To manage accounts, resource sharing and many more.
The main audience of this book are regular UNIX/Linux sysadmins and managers, who should deal with HPC clusters on-prem or in cloud and those who are interested in supercomputers and HPC clusters and how to utilize them in their projects and teams.
List of contents
1: Introduction.- 2: What is "Super" About a Supercomputer?.- 3: How to Build and Start a Supercomputer and HPC Cluster.- Chapter 4: Supercomputer Hardware.- Chapter 5: How a Supercomputer Works.- 6: UNIX and Linux - the Basics.- 7: UNIX and Linux - Working Techniques.- Chapter 8: Network File Systems.- Chapter 9: Remote Management.- Chapter 10: Users - Accounting and Management.- 11: Users - Quotas and Access Rights.- 12: Job Management Systems.- 13: Organizing Remote User Access.- 14: Cluster Status Monitoring Systems.- 15: Backup.- 16: Compilers and Environments- for Parallel Technologies.- 17: Parallel Computing Support Libraries.- 18: Booting and init.- 19: Node Setup, Software Installation.- 20: Out-of-the-Box Stacks and Deployment Systems.- 21: Cluster Management Systems - xCAT and Others.- 22: Communicating with Users.- 23: One-two-three instructions.- 24: Shell Scripts - basics and common mistakes.- 25: Systemd - A Short Course.- 26: Conclusion.
About the author
Sergey Zhumatiy has been managing supercomputers since 1999 starting out building and managing HPC clusters at Moscow State University and holds a PhD in computer science. Several supercomputers under his supervising, like Chebyshev, Lomonosov, Lomonosov-2, achieved top rankings in the top500 supercomputers list, and dominated the Russian top50 supercomputers list. Now he works as an HPC Architect and SysAdmin at NVIDIA.
Summary
Supercomputers and High Performance Computing (HPC) clusters are not so exotic as people imagine these days. They give companies the power of computation like no one server can give alone. They make new drugs and materials discoveries, universe modeling and AI training, crash simulations and market research possible – all thanks to HPC clusters. Building or renting a HPC cluster is not so difficult either as cloud providers can give you resources to build one cheap and performative enough to use yourself, so If you are or want to become HPC cluster Sysadmin or manager, this book is for you.
Supercomputers for Linux SysAdmins delves into the world of modern HPC cluster architecture, hardware, software and resources management using a Linux/UNIX based approach. The number of HPC clusters is growing with an estimated 30 billion by 2030 but there are not enough sysadmins to run and manage them, this book serves to bridge this gap to help more Sysadmins and managers to transition into the exiting world of HPCs.
This book helps those with a strong foundational knowledge in Linux, to deal with supercomputers and HPC clusters. We start with the basic principles of supercomputer management, fundamentals of Linux and UNIX, Shell Scripting and systemd and well as other open source tools and frameworks, taking you thorough the security, monitoring and hardware requirements for supercomputers and HPC clusters.
You Will Learn:
- How to plan new supercomputers
- The main principles and technologies used in supercomputers and HPC clusters
- How to set up the software environments on new supercomputers
- To set up supercomputer and HPC cluster resources and jobs management
- To manage accounts, resource sharing and many more.
The main audience of this book are regular UNIX/Linux sysadmins and managers, who should deal with HPC clusters on-prem or in cloud and those who are interested in supercomputers and HPC clusters and how to utilize them in their projects and teams.
Product details
Authors | Sergey Zhumatiy |
Publisher | Springer, Berlin |
Languages | English |
Product format | Paperback / Softback |
Release | 10.11.2025 |
EAN | 9798868815997 |
ISBN | 9798868815997 |
No. of pages | 450 |
Illustrations | Approx. 450 p. |
Subjects |
Natural sciences, medicine, IT, technology
> IT, data processing
> IT
UNIX, Linux, Hardware, Computerhardware, Open Source, Rechnerarchitektur und Logik-Entwurf, Cloud Computing, Kubernetes, systemd, Nagios, System Management, Computer Engineering and Networks, Processor Architectures, Supercomputers, High Performance Computing, HPC Clusters, SysAdmin, Open MPI, Zabbix |
Customer reviews
No reviews have been written for this item yet. Write the first review and be helpful to other users when they decide on a purchase.
Write a review
Thumbs up or thumbs down? Write your own review.