Reliability of Computer Systems and Networks

Reliability of Computer Systems and Networks
Author: Martin L. Shooman
Publisher: John Wiley & Sons
Total Pages: 552
Release: 2003-04-08
Genre: Technology & Engineering
ISBN: 0471464066

With computers becoming embedded as controllers in everything from network servers to the routing of subway schedules to NASA missions, there is a critical need to ensure that systems continue to function even when a component fails. In this book, bestselling author Martin Shooman draws on his expertise in reliability engineering and software engineering to provide a complete and authoritative look at fault tolerant computing. He clearly explains all fundamentals, including how to use redundant elements in system design to ensure the reliability of computer systems and networks. Market: Systems and Networking Engineers, Computer Programmers, IT Professionals.

Reliability in Computer System Design

Reliability in Computer System Design
Author: Balbir S. Dhillon
Publisher: Intellect (UK)
Total Pages: 304
Release: 1987
Genre: Computers
ISBN:

This volume covers wide areas of interest such as life cycle costing, microcomputers, common-cause failures and space computers. Every effort is made to present difficult material with the aid of an example along with its solution. The material covered is summarized at the end of each chapter. The information is written in a format that allows readers to learn and better understand the philosophy of reliability in computer system design. At the same time, it tests their comprehension through listed exercises.

Reliable Computer Systems

Reliable Computer Systems
Author: Daniel Siewiorek
Publisher: Digital Press
Total Pages: 929
Release: 2014-06-28
Genre: Computers
ISBN: 1483297438

Enhance your hardware/software reliability Enhancement of system reliability has been a major concern of computer users and designers ¦ and this major revision of the 1982 classic meets users' continuing need for practical information on this pressing topic. Included are case studies of reliable systems from manufacturers such as Tandem, Stratus, IBM, and Digital, as well as coverage of special systems such as the Galileo Orbiter fault protection system and AT&T telephone switching processors.

Computer System Reliability

Computer System Reliability
Author: B.S. Dhillon
Publisher: CRC Press
Total Pages: 247
Release: 2016-04-19
Genre: Computers
ISBN: 1466573139

Computer systems have become an important element of the world economy, with billions of dollars spent each year on development, manufacture, operation, and maintenance. Combining coverage of computer system reliability, safety, usability, and other related topics into a single volume, Computer System Reliability: Safety and Usability eliminates th

Design for Reliability

Design for Reliability
Author: Eric Bauer
Publisher: John Wiley & Sons
Total Pages: 349
Release: 2011-02-11
Genre: Computers
ISBN: 1118075080

System reliability, availability and robustness are often not well understood by system architects, engineers and developers. They often don't understand what drives customer's availability expectations, how to frame verifiable availability/robustness requirements, how to manage and budget availability/robustness, how to methodically architect and design systems that meet robustness requirements, and so on. The book takes a very pragmatic approach of framing reliability and robustness as a functional aspect of a system so that architects, designers, developers and testers can address it as a concrete, functional attribute of a system, rather than an abstract, non-functional notion.

Principles of Computer System Design

Principles of Computer System Design
Author: Jerome H. Saltzer
Publisher: Morgan Kaufmann
Total Pages: 561
Release: 2009-05-21
Genre: Computers
ISBN: 0080959423

Principles of Computer System Design is the first textbook to take a principles-based approach to the computer system design. It identifies, examines, and illustrates fundamental concepts in computer system design that are common across operating systems, networks, database systems, distributed systems, programming languages, software engineering, security, fault tolerance, and architecture. Through carefully analyzed case studies from each of these disciplines, it demonstrates how to apply these concepts to tackle practical system design problems. To support the focus on design, the text identifies and explains abstractions that have proven successful in practice such as remote procedure call, client/service organization, file systems, data integrity, consistency, and authenticated messages. Most computer systems are built using a handful of such abstractions. The text describes how these abstractions are implemented, demonstrates how they are used in different systems, and prepares the reader to apply them in future designs. The book is recommended for junior and senior undergraduate students in Operating Systems, Distributed Systems, Distributed Operating Systems and/or Computer Systems Design courses; and professional computer systems designers. Concepts of computer system design guided by fundamental principles Cross-cutting approach that identifies abstractions common to networking, operating systems, transaction systems, distributed systems, architecture, and software engineering Case studies that make the abstractions real: naming (DNS and the URL); file systems (the UNIX file system); clients and services (NFS); virtualization (virtual machines); scheduling (disk arms); security (TLS) Numerous pseudocode fragments that provide concrete examples of abstract concepts Extensive support. The authors and MIT OpenCourseWare provide on-line, free of charge, open educational resources, including additional chapters, course syllabi, board layouts and slides, lecture videos, and an archive of lecture schedules, class assignments, and design projects

Reliability Growth

Reliability Growth
Author: Panel on Reliability Growth Methods for Defense Systems
Publisher: National Academy Press
Total Pages: 235
Release: 2015-03-01
Genre: Technology & Engineering
ISBN: 9780309314749

A high percentage of defense systems fail to meet their reliability requirements. This is a serious problem for the U.S. Department of Defense (DOD), as well as the nation. Those systems are not only less likely to successfully carry out their intended missions, but they also could endanger the lives of the operators. Furthermore, reliability failures discovered after deployment can result in costly and strategic delays and the need for expensive redesign, which often limits the tactical situations in which the system can be used. Finally, systems that fail to meet their reliability requirements are much more likely to need additional scheduled and unscheduled maintenance and to need more spare parts and possibly replacement systems, all of which can substantially increase the life-cycle costs of a system. Beginning in 2008, DOD undertook a concerted effort to raise the priority of reliability through greater use of design for reliability techniques, reliability growth testing, and formal reliability growth modeling, by both the contractors and DOD units. To this end, handbooks, guidances, and formal memoranda were revised or newly issued to reduce the frequency of reliability deficiencies for defense systems in operational testing and the effects of those deficiencies. "Reliability Growth" evaluates these recent changes and, more generally, assesses how current DOD principles and practices could be modified to increase the likelihood that defense systems will satisfy their reliability requirements. This report examines changes to the reliability requirements for proposed systems; defines modern design and testing for reliability; discusses the contractor's role in reliability testing; and summarizes the current state of formal reliability growth modeling. The recommendations of "Reliability Growth" will improve the reliability of defense systems and protect the health of the valuable personnel who operate them.

Reliable Computer Systems

Reliable Computer Systems
Author: Daniel P. Siewiorek
Publisher: CRC Press
Total Pages: 908
Release: 1998-12-15
Genre: Computers
ISBN: 1439863962

This classic reference work is a comprehensive guide to the design, evaluation, and use of reliable computer systems. It includes case studies of reliable systems from manufacturers, such as Tandem, Stratus, IBM, and Digital. It covers special systems such as the Galileo Orbiter fault protection system and AT&T telephone switching system processors