Organizing for Reliability

Organizing for Reliability
Author: Ranga Ramanujam
Publisher: Stanford University Press
Total Pages: 390
Release: 2018-02-27
Genre: Business & Economics
ISBN: 1503604535

Increasingly, scholars view reliability—the ability to plan for and withstand disaster—as a social construction. However, there is a tendency to evoke this concept only in the face of catastrophes, such as the British Petroleum oil spill or the Space Shuttle Challenger explosion. This book frames reliability as a fundamental issue in the study of organizations—one that can also improve day-to-day operations. Bringing together a diverse cast of contributors, it considers how we can account for the ability of some organizations to maintain high reliability and what we can learn from them. The chapters distinguish reliability from related lines of inquiry; take stock of relevant research from different disciplinary perspectives; highlight implications for practice; and identify directions, questions, and priorities for future research. The first of its kind in over twenty years, this volume delivers a dynamic base of shared knowledge and an integrative research agenda at a time when organizational reliability has never been so important.

Building a High-Reliability Organization

Building a High-Reliability Organization
Author: Gary L. Sculli
Publisher: Hcpro, a Division of Simplify Compliance
Total Pages: 0
Release: 2015-08-28
Genre:
ISBN: 9781556452994

Building a High-Reliability Organization: A Toolkit for Success Gary Sculli, RN, MSN, ATP Douglas E. Paull, MD, FACS, FCCP, CHSE Building a High-Reliability Organization: A Toolkit for Success is a practical guide to becoming a high-reliability organization (HRO). HROs practice the highest standards of patient quality and prevent never events before they occur. In this first-of-its-kind book, written for real-world healthcare professionals on the front lines of patient safety, authors Gary L. Sculli, RN, MSN, ATP, and Douglas E. Paull, MD, FACS, FCCP, CHSE, take the concept of an HRO and break down what it means at the point of care. Through step-by-step instructions and a practical, straightforward approach, they demonstrate how your organization can ensure safe patient care, every day, for every patient. After reading this book, you will: Possess a clear understanding of what constitutes high-reliability healthcare Be able to promote evidence-based, reliable methods to improve safety, including team training, fatigue management systems, and investment in patient safety infrastructure and technology Understand which elements and behaviors must be included in an overall plan to achieve high reliability at the front lines of care Become a transformational leader in your healthcare organization Be able to apply the principles of a fair and just culture to promote the reporting, discussion, and disclosure of adverse events Table of Contents: Preface and Precepts Chapter 1: Situational Awareness Is Fundamental to High Reliability Chapter 2: Situational Awareness Countermeasures Chapter 3: Everyone on the Same Sheet of Music Chapter 4: Yes--You Need to Use the Checklist! Chapter 5: Preoccupation With Failure--It's an Attitude Chapter 6: Recognizing That the Expert Is Not Always the Person in Charge Chapter 7: Lab Coats and Scrubs, Meet Suits and Ties--Sensitivity to Frontline Operations Chapter 8: Just Response to Human Error: A Necessary Component of High-Reliability Organizations Chapter 9: Standardize Communication and Processes to Create Equivalent Actors Chapter 10: Ensuring Technical and Non-Technical Competence

High Reliability Management

High Reliability Management
Author: Emery Roe
Publisher: Stanford Business Books
Total Pages: 260
Release: 2008
Genre: Business & Economics
ISBN: 9780804759465

High Reliability Management is the first book to provide an in-depth and timely look at the people who manage for high reliability--professionals who run critical systems in electricity, water, and transportation. The book tells of the extraordinary challenges that these "reliability professionals" face in ensuring that society's basic systems operate continuously and safely, even in the wake of errors in policy and technical design.

Reliability and Risk

Reliability and Risk
Author: Paul Schulman
Publisher: Stanford University Press
Total Pages: 264
Release: 2016-04-13
Genre: Business & Economics
ISBN: 0804798621

The safe and continued functioning of critical infrastructures—such as electricity, natural gas, transportation, and water—is a social imperative. Yet the complex connections between these systems render them increasingly precarious. Furthermore, though we depend so heavily on interconnected infrastructures, we do not fully understand the risks involved in their failure. Emery Roe and Paul R. Schulman argue that designs, policies, and laws often overlook the knowledge and experiences of those who manage these systems on the ground—reliability professionals who have vital insights that would be invaluable to planning. To combat this major blind spot, the athors construct a new theoretical perspective that reveals how to make sense of complex interconnected networks and improve reliability through management, regulation, and political leadership. To illustrate their approach in action, they present a multi-year case study of one of the world's most important "infrastructure crossroads," the San Francisco Bay-Delta. Reliability and Risk advances our understanding of what it takes to ensure the dependability of the intricate—and sometimes hazardous—systems on which we rely every day.

Site Reliability Engineering

Site Reliability Engineering
Author: Niall Richard Murphy
Publisher: "O'Reilly Media, Inc."
Total Pages: 552
Release: 2016-03-23
Genre:
ISBN: 1491951176

The overwhelming majority of a software system’s lifespan is spent in use, not in design or implementation. So, why does conventional wisdom insist that software engineers focus primarily on the design and development of large-scale computing systems? In this collection of essays and articles, key members of Google’s Site Reliability Team explain how and why their commitment to the entire lifecycle has enabled the company to successfully build, deploy, monitor, and maintain some of the largest software systems in the world. You’ll learn the principles and practices that enable Google engineers to make systems more scalable, reliable, and efficient—lessons directly applicable to your organization. This book is divided into four sections: Introduction—Learn what site reliability engineering is and why it differs from conventional IT industry practices Principles—Examine the patterns, behaviors, and areas of concern that influence the work of a site reliability engineer (SRE) Practices—Understand the theory and practice of an SRE’s day-to-day work: building and operating large distributed computing systems Management—Explore Google's best practices for training, communication, and meetings that your organization can use

Maintenance and Operational Reliability

Maintenance and Operational Reliability
Author: Donald H. Nyman
Publisher: Industrial Press
Total Pages: 350
Release: 2021-09
Genre: Technology & Engineering
ISBN: 9780831136611

The quest for reliability is long overdue! In the case of many operations, realization of sustained reliability is still a work in progress. Very few organizations have completed the journey to world-class reliability. The vast majority still operate within a reactive culture, allowing response to repetitive failures to consume an excessive proportion of already limited maintenance resources, and leaving too few for performance of any proactive activities. In today's competitive international environment, enterprise survival is a battle of the fittest. To survive, organizations must achieve "world-class" stature, characterized by wellness, readiness, and application required for a company to successfully compete globally. That's why Maintenance and Operational Reliability is so important. This work is organized by the foundation and 5 Pillars of Maintenance/Reliability Excellence, plus 24 Building Blocks, as depicted throughout the book. This pillar graphic shows the functions, management techniques, systems, information sources and performance management vital to the maintenance and reliability process, and also serves as an important visual aid for the education of the entire organization. So, how is the ultimate, but challenging reliability goal to be achieved? Are you prepared to manage, support, process, and interpret the magnitude of information in real time, critical to making the right business decisions to achieve a competitive advantage? The authors, two veteran maintenance and reliability experts, have collected all the essentials leading to reliability here, in one practical resource, connecting and sequencing the integral pieces for world-class reliability. Features Guides readers through the journey from classic reactive repair upon failure to reliable, proactive maintenance, engineered to preclude failure and, ultimately, to sustain reliability. Clarifies roles and responsibilities of involved functions while explaining control tools to be deployed by each position. Provides the overriding business justification required to gain senior management commitment.

Methods for Reliability Improvement and Risk Reduction

Methods for Reliability Improvement and Risk Reduction
Author: Michael Todinov
Publisher: John Wiley & Sons
Total Pages: 286
Release: 2018-12-10
Genre: Technology & Engineering
ISBN: 1119477581

Reliability is one of the most important attributes for the products and processes of any company or organization. This important work provides a powerful framework of domain-independent reliability improvement and risk reducing methods which can greatly lower risk in any area of human activity. It reviews existing methods for risk reduction that can be classified as domain-independent and introduces the following new domain-independent reliability improvement and risk reduction methods: Separation Stochastic separation Introducing deliberate weaknesses Segmentation Self-reinforcement Inversion Reducing the rate of accumulation of damage Permutation Substitution Limiting the space and time exposure Comparative reliability models The domain-independent methods for reliability improvement and risk reduction do not depend on the availability of past failure data, domain-specific expertise or knowledge of the failure mechanisms underlying the failure modes. Through numerous examples and case studies, this invaluable guide shows that many of the new domain-independent methods improve reliability at no extra cost or at a low cost. Using the proven methods in this book, any company and organisation can greatly enhance the reliability of its products and operations.

Storage Systems

Storage Systems
Author: Alexander Thomasian
Publisher: Academic Press
Total Pages: 748
Release: 2021-10-13
Genre: Science
ISBN: 0323908098

Storage Systems: Organization, Performance, Coding, Reliability and Their Data Processing was motivated by the 1988 Redundant Array of Inexpensive/Independent Disks proposal to replace large form factor mainframe disks with an array of commodity disks. Disk loads are balanced by striping data into strips—with one strip per disk— and storage reliability is enhanced via replication or erasure coding, which at best dedicates k strips per stripe to tolerate k disk failures. Flash memories have resulted in a paradigm shift with Solid State Drives (SSDs) replacing Hard Disk Drives (HDDs) for high performance applications. RAID and Flash have resulted in the emergence of new storage companies, namely EMC, NetApp, SanDisk, and Purestorage, and a multibillion-dollar storage market. Key new conferences and publications are reviewed in this book.The goal of the book is to expose students, researchers, and IT professionals to the more important developments in storage systems, while covering the evolution of storage technologies, traditional and novel databases, and novel sources of data. We describe several prototypes: FAWN at CMU, RAMCloud at Stanford, and Lightstore at MIT; Oracle's Exadata, AWS' Aurora, Alibaba's PolarDB, Fungible Data Center; and author's paper designs for cloud storage, namely heterogeneous disk arrays and hierarchical RAID. - Surveys storage technologies and lists sources of data: measurements, text, audio, images, and video - Familiarizes with paradigms to improve performance: caching, prefetching, log-structured file systems, and merge-trees (LSMs) - Describes RAID organizations and analyzes their performance and reliability - Conserves storage via data compression, deduplication, compaction, and secures data via encryption - Specifies implications of storage technologies on performance and power consumption - Exemplifies database parallelism for big data, analytics, deep learning via multicore CPUs, GPUs, FPGAs, and ASICs, e.g., Google's Tensor Processing Units

Reliability Engineering

Reliability Engineering
Author: Kailash C. Kapur
Publisher: John Wiley & Sons
Total Pages: 528
Release: 2014-03-21
Genre: Technology & Engineering
ISBN: 1118841794

An Integrated Approach to Product Development Reliability Engineering presents an integrated approach to the design, engineering, and management of reliability activities throughout the life cycle of a product, including concept, research and development, design, manufacturing, assembly, sales, and service. Containing illustrative guides that include worked problems, numerical examples, homework problems, a solutions manual, and class-tested materials, it demonstrates to product development and manufacturing professionals how to distribute key reliability practices throughout an organization. The authors explain how to integrate reliability methods and techniques in the Six Sigma process and Design for Six Sigma (DFSS). They also discuss relationships between warranty and reliability, as well as legal and liability issues. Other topics covered include: Reliability engineering in the 21st Century Probability life distributions for reliability analysis Process control and process capability Failure modes, mechanisms, and effects analysis Health monitoring and prognostics Reliability tests and reliability estimation Reliability Engineering provides a comprehensive list of references on the topics covered in each chapter. It is an invaluable resource for those interested in gaining fundamental knowledge of the practical aspects of reliability in design, manufacturing, and testing. In addition, it is useful for implementation and management of reliability programs.