Design and Evaluation of Efficient Collective Communications on Modern Interconnects and Multi-core Clusters

Design and Evaluation of Efficient Collective Communications on Modern Interconnects and Multi-core Clusters
Author:
Publisher:
Total Pages:
Release: 2001
Genre:
ISBN:

Two driving forces behind high-performance clusters are the availability of modern interconnects and the advent of multi-core systems. As multi-core clusters become commonplace, where each core will run at least one process with multiple intra-node and inter-node connections to several other processes, there will be immense pressure on the interconnection network and its communication system software. Many parallel scientific applications use Message Passing Interface (MPI) collective communications intensively. Therefore, efficient and scalable implementation of MPI collective operations is critical to the performance of applications running on clusters. In this dissertation, I propose and evaluate a number of efficient collective communication algorithms that utilize the modern features of Quadrics and InfiniBand interconnects as well as the availability of multiple cores on emerging clusters. To overcome bandwidth limitations and to enhance fault tolerance, using multiple independent networks known as multi-rail networks is very promising. Quadrics multi-rail QsNetII network is constructed using multiple network interface cards (NICs) per node, where each NIC is connected to a rail. I design and evaluate a number of Remote Direct Memory Access (RDMA) based multi-port collective operations on multi-rail QsNetII network. I also extend the gather and allgather algorithms to be shared memory aware for small to medium messages. The algorithms prove to be much more efficient than the native Quadrics MPI implementation. ConnectX is the newest generation of InfiniBand host channel adapters from Mellanox Technologies. I provide evidence that ConnectX achieves scalable performance for simultaneous communication over multiple connections. Utilizing this ability of ConnectX cards, I propose a number of RDMA based multi-connection and multi-core aware allgather algorithms at the MPI level. My algorithms are devised to target different message sizes, and the performance result.

Delivery and Adoption of Cloud Computing Services in Contemporary Organizations

Delivery and Adoption of Cloud Computing Services in Contemporary Organizations
Author: Chang, Victor
Publisher: IGI Global
Total Pages: 543
Release: 2015-03-31
Genre: Computers
ISBN: 1466682116

The ubiquity of technology has not only brought the need for computer knowledge to every aspect of the modern business world; it has also increased our need to safely store the data we are now creating at a rate never experienced before. Delivery and Adoption of Cloud Computing Services in Contemporary Organizations brings together the best practices for storing massive amounts of data. Highlighting ways cloud services can work effectively in production and in real time, this book is an essential reference source for professionals and academics of various disciplines, such as computer science, consulting, information technology, information and communication sciences, healthcare, and finance.

Encyclopedia of Parallel Computing

Encyclopedia of Parallel Computing
Author: David Padua
Publisher: Springer Science & Business Media
Total Pages: 2211
Release: 2011-09-08
Genre: Computers
ISBN: 0387097651

Containing over 300 entries in an A-Z format, the Encyclopedia of Parallel Computing provides easy, intuitive access to relevant information for professionals and researchers seeking access to any aspect within the broad field of parallel computing. Topics for this comprehensive reference were selected, written, and peer-reviewed by an international pool of distinguished researchers in the field. The Encyclopedia is broad in scope, covering machine organization, programming languages, algorithms, and applications. Within each area, concepts, designs, and specific implementations are presented. The highly-structured essays in this work comprise synonyms, a definition and discussion of the topic, bibliographies, and links to related literature. Extensive cross-references to other entries within the Encyclopedia support efficient, user-friendly searchers for immediate access to useful information. Key concepts presented in the Encyclopedia of Parallel Computing include; laws and metrics; specific numerical and non-numerical algorithms; asynchronous algorithms; libraries of subroutines; benchmark suites; applications; sequential consistency and cache coherency; machine classes such as clusters, shared-memory multiprocessors, special-purpose machines and dataflow machines; specific machines such as Cray supercomputers, IBM’s cell processor and Intel’s multicore machines; race detection and auto parallelization; parallel programming languages, synchronization primitives, collective operations, message passing libraries, checkpointing, and operating systems. Topics covered: Speedup, Efficiency, Isoefficiency, Redundancy, Amdahls law, Computer Architecture Concepts, Parallel Machine Designs, Benmarks, Parallel Programming concepts & design, Algorithms, Parallel applications. This authoritative reference will be published in two formats: print and online. The online edition features hyperlinks to cross-references and to additional significant research. Related Subjects: supercomputing, high-performance computing, distributed computing

Interconnection Networks

Interconnection Networks
Author: Jose Duato
Publisher: Morgan Kaufmann
Total Pages: 626
Release: 2003
Genre: Computers
ISBN: 1558608524

Foreword -- Foreword to the First Printing -- Preface -- Chapter 1 -- Introduction -- Chapter 2 -- Message Switching Layer -- Chapter 3 -- Deadlock, Livelock, and Starvation -- Chapter 4 -- Routing Algorithms -- Chapter 5 -- CollectiveCommunicationSupport -- Chapter 6 -- Fault-Tolerant Routing -- Chapter 7 -- Network Architectures -- Chapter 8 -- Messaging Layer Software -- Chapter 9 -- Performance Evaluation -- Appendix A -- Formal Definitions for Deadlock Avoidance -- Appendix B -- Acronyms -- References -- Index.

并行程序设计

并行程序设计
Author: Foster
Publisher:
Total Pages: 381
Release: 2002
Genre: Computer programming
ISBN: 9787115103475

国外著名高等院校信息科学与技术优秀教材

The Road to Results

The Road to Results
Author: Linda G. Morra-Imas
Publisher: World Bank Publications
Total Pages: 611
Release: 2009
Genre: Business & Economics
ISBN: 0821379119

'The Road to Results: Designing and Conducting Effective Development Evaluations' presents concepts and procedures for evaluation in a development context. It provides procedures and examples on how to set up a monitoring and evaluation system, how to conduct participatory evaluations and do social mapping, and how to construct a "rigorous" quasi-experimental design to answer an impact question. The text begins with the context of development evaluation and how it arrived where it is today. It then discusses current issues driving development evaluation, such as the Millennium Development Goals and the move from simple project evaluations to the broader understandings of complex evaluations. The topics of implementing 'Results-based Measurement and Evaluation' and constructing a 'Theory of Change' are emphasized throughout the text. Next, the authors take the reader down 'the road to results, ' presenting procedures for evaluating projects, programs, and policies by using a 'Design Matrix' to help map the process. This road includes: determining the overall approach, formulating questions, selecting designs, developing data collection instruments, choosing a sampling strategy, and planning data analysis for qualitative, quantitative, and mixed method evaluations. The book also includes discussions on conducting complex evaluations, how to manage evaluations, how to present results, and ethical behavior--including principles, standards, and guidelines. The final chapter discusses the future of development evaluation. This comprehensive text is an essential tool for those involved in development evaluation.

High Performance Computing

High Performance Computing
Author: Julian M. Kunkel
Publisher: Springer
Total Pages: 506
Release: 2016-06-14
Genre: Computers
ISBN: 331941321X

This book constitutes the refereed proceedings of the 31st International Conference, ISC High Performance 2016 [formerly known as the International Supercomputing Conference] held in Frankfurt, Germany, in June 2016. The 25 revised full papers presented in this book were carefully reviewed and selected from 60 submissions. The papers cover the following topics: Autotuning and Thread Mapping; Data Locality and Decomposition; Scalable Applications; Machine Learning; Datacenters andCloud; Communication Runtime; Intel Xeon Phi; Manycore Architectures; Extreme-scale Computations; and Resilience.

The Origin of Consciousness in the Breakdown of the Bicameral Mind

The Origin of Consciousness in the Breakdown of the Bicameral Mind
Author: Julian Jaynes
Publisher: Houghton Mifflin Harcourt
Total Pages: 580
Release: 2000-08-15
Genre: Psychology
ISBN: 0547527543

National Book Award Finalist: “This man’s ideas may be the most influential, not to say controversial, of the second half of the twentieth century.”—Columbus Dispatch At the heart of this classic, seminal book is Julian Jaynes's still-controversial thesis that human consciousness did not begin far back in animal evolution but instead is a learned process that came about only three thousand years ago and is still developing. The implications of this revolutionary scientific paradigm extend into virtually every aspect of our psychology, our history and culture, our religion—and indeed our future. “Don’t be put off by the academic title of Julian Jaynes’s The Origin of Consciousness in the Breakdown of the Bicameral Mind. Its prose is always lucid and often lyrical…he unfolds his case with the utmost intellectual rigor.”—The New York Times “When Julian Jaynes . . . speculates that until late in the twentieth millennium BC men had no consciousness but were automatically obeying the voices of the gods, we are astounded but compelled to follow this remarkable thesis.”—John Updike, The New Yorker “He is as startling as Freud was in The Interpretation of Dreams, and Jaynes is equally as adept at forcing a new view of known human behavior.”—American Journal of Psychiatry

Network-on-Chip Architectures

Network-on-Chip Architectures
Author: Chrysostomos Nicopoulos
Publisher: Springer Science & Business Media
Total Pages: 237
Release: 2009-09-18
Genre: Technology & Engineering
ISBN: 904813031X

[2]. The Cell Processor from Sony, Toshiba and IBM (STI) [3], and the Sun UltraSPARC T1 (formerly codenamed Niagara) [4] signal the growing popularity of such systems. Furthermore, Intel’s very recently announced 80-core TeraFLOP chip [5] exemplifies the irreversible march toward many-core systems with tens or even hundreds of processing elements. 1.2 The Dawn of the Communication-Centric Revolution The multi-core thrust has ushered the gradual displacement of the computati- centric design model by a more communication-centric approach [6]. The large, sophisticated monolithic modules are giving way to several smaller, simpler p- cessing elements working in tandem. This trend has led to a surge in the popularity of multi-core systems, which typically manifest themselves in two distinct incarnations: heterogeneous Multi-Processor Systems-on-Chip (MPSoC) and homogeneous Chip Multi-Processors (CMP). The SoC philosophy revolves around the technique of Platform-Based Design (PBD) [7], which advocates the reuse of Intellectual Property (IP) cores in flexible design templates that can be customized accordingly to satisfy the demands of particular implementations. The appeal of such a modular approach lies in the substantially reduced Time-To- Market (TTM) incubation period, which is a direct outcome of lower circuit complexity and reduced design effort. The whole system can now be viewed as a diverse collection of pre-existing IP components integrated on a single die.