En savoir plus
Informationen zum Autor Dhabaleswar K. Panda is Professor and University Distinguished Scholar of Computer Science and Engineering at the Ohio State University. Xiaoyi Lu is an Assistant Professor in the Department of Computer Science and Engineering at the University of California, Merced. Dipti Shankar is currently working at SAP, Germany. Klappentext "This book explores how to achieve high performance and scalability for big data middleware and applications"-- Zusammenfassung An in-depth overview of an emerging field that brings together high-performance computing, big data processing, and deep lLearning. Over the last decade, the exponential explosion of data known as big data has changed the way we understand and harness the power of data. The emerging field of high-performance big data computing, which brings together high-performance computing (HPC), big data processing, and deep learning, aims to meet the challenges posed by large-scale data processing. This book offers an in-depth overview of high-performance big data computing and the associated technical issues, approaches, and solutions. The book covers basic concepts and necessary background knowledge, including data processing frameworks, storage systems, and hardware capabilities; offers a detailed discussion of technical issues in accelerating big data computing in terms of computation, communication, memory and storage, codesign, workload characterization and benchmarking, and system deployment and management; and surveys benchmarks and workloads for evaluating big data middleware systems. It presents a detailed discussion of big data computing systems and applications with high-performance networking, computing, and storage technologies, including state-of-the-art designs for data processing and storage systems. Finally, the book considers some advanced research topics in high-performance big data computing, including designing high-performance deep learning over big data (DLoBD) stacks and HPC cloud technologies. Inhaltsverzeichnis Acknowledgments viii 1 Introduction 1 2 Parallel Programming Models and Systems 13 3 Parallel and Distributed Storage Systems 37 4 HPC Architectures and Trends 61 5 Opportunities and Challenges in Accelerating Big Data Computing 93 6 Benchmarking Big Data Systems 107 7 Accelerations with RDMA 121 8 Accelerations with Multicore/Accelerator Technologies 145 9 Acceleration with High-Performance Storage Technologies 159 10 Deep Learning over Big Data 175 11 Designs with Cloud Technologies 195 12 Frontier Research on High-Performance Big Data Computing 215 References 227 Index 257...