Volume 16: How to Detect and Handle Outliers

Volume 16: How to Detect and Handle Outliers
Author: Boris Iglewicz
Publisher: Quality Press
Total Pages: 99
Release: 1993-01-08
Genre: Business & Economics
ISBN: 0873892607

Outliers are the key focus of this book. The authors concentrate on the practical aspects of dealing with outliers in the forms of data that arise most often in applications: single and multiple samples, linear regression, and factorial experiments. Available only as an E-Book.

Identification of Outliers

Identification of Outliers
Author: D. Hawkins
Publisher: Springer Science & Business Media
Total Pages: 194
Release: 2013-04-17
Genre: Science
ISBN: 9401539944

The problem of outliers is one of the oldest in statistics, and during the last century and a half interest in it has waxed and waned several times. Currently it is once again an active research area after some years of relative neglect, and recent work has solved a number of old problems in outlier theory, and identified new ones. The major results are, however, scattered amongst many journal articles, and for some time there has been a clear need to bring them together in one place. That was the original intention of this monograph: but during execution it became clear that the existing theory of outliers was deficient in several areas, and so the monograph also contains a number of new results and conjectures. In view of the enormous volume ofliterature on the outlier problem and its cousins, no attempt has been made to make the coverage exhaustive. The material is concerned almost entirely with the use of outlier tests that are known (or may reasonably be expected) to be optimal in some way. Such topics as robust estimation are largely ignored, being covered more adequately in other sources. The numerous ad hoc statistics proposed in the early work on the grounds of intuitive appeal or computational simplicity also are not discussed in any detail.

Pattern Recognition and Data Analysis with Applications

Pattern Recognition and Data Analysis with Applications
Author: Deepak Gupta
Publisher: Springer Nature
Total Pages: 816
Release: 2022-09-01
Genre: Technology & Engineering
ISBN: 9811915202

This book covers latest advancements in the areas of machine learning, computer vision, pattern recognition, computational learning theory, big data analytics, network intelligence, signal processing and their applications in real world. The topics covered in machine learning involves feature extraction, variants of support vector machine (SVM), extreme learning machine (ELM), artificial neural network (ANN) and other areas in machine learning. The mathematical analysis of computer vision and pattern recognition involves the use of geometric techniques, scene understanding and modelling from video, 3D object recognition, localization and tracking, medical image analysis and so on. Computational learning theory involves different kinds of learning like incremental, online, reinforcement, manifold, multi-task, semi-supervised, etc. Further, it covers the real-time challenges involved while processing big data analytics and stream processing with the integration of smart data computing services and interconnectivity. Additionally, it covers the recent developments to network intelligence for analyzing the network information and thereby adapting the algorithms dynamically to improve the efficiency. In the last, it includes the progress in signal processing to process the normal and abnormal categories of real-world signals, for instance signals generated from IoT devices, smart systems, speech, videos, etc., and involves biomedical signal processing: electrocardiogram (ECG), electroencephalogram (EEG), magnetoencephalography (MEG) and electromyogram (EMG).

Probability, Statistics and Other Frightening Stuff

Probability, Statistics and Other Frightening Stuff
Author: Alan R. Jones
Publisher: Routledge
Total Pages: 472
Release: 2018-10-09
Genre: Business & Economics
ISBN: 1351661388

Probability, Statistics and Other Frightening Stuff (Volume II of the Working Guides to Estimating & Forecasting series) considers many of the commonly used Descriptive Statistics in the world of estimating and forecasting. It considers values that are representative of the ‘middle ground’ (Measures of Central Tendency), and the degree of data scatter (Measures of Dispersion and Shape) around the ‘middle ground’ values. A number of Probability Distributions and where they might be used are discussed, along with some fascinating and useful ‘rules of thumb’ or short-cut properties that estimators and forecasters can exploit in plying their trade. With the help of a ‘Correlation Chicken’, the concept of partial correlation is explained, including how the estimator or forecaster can exploit this in reflecting varying levels of independence and imperfect dependence between an output or predicted value (such as cost) and an input or predictor variable such as size. Under the guise of ‘Tails of the unexpected’ the book concludes with two chapters devoted to Hypothesis Testing (or knowing when to accept or reject the validity of an assumed estimating relationship), and a number of statistically-based tests to help the estimator to decide whether to include or exclude a data point as an ‘outlier’, one that appears not to be representative of that which the estimator is tasked to produce. This is a valuable resource for estimators, engineers, accountants, project risk specialists as well as students of cost engineering.

Control Charts and Machine Learning for Anomaly Detection in Manufacturing

Control Charts and Machine Learning for Anomaly Detection in Manufacturing
Author: Kim Phuc Tran
Publisher: Springer Nature
Total Pages: 270
Release: 2021-08-29
Genre: Technology & Engineering
ISBN: 3030838196

This book introduces the latest research on advanced control charts and new machine learning approaches to detect abnormalities in the smart manufacturing process. By approaching anomaly detection using both statistics and machine learning, the book promotes interdisciplinary cooperation between the research communities, to jointly develop new anomaly detection approaches that are more suitable for the 4.0 Industrial Revolution. The book provides ready-to-use algorithms and parameter sheets, enabling readers to design advanced control charts and machine learning-based approaches for anomaly detection in manufacturing. Case studies are introduced in each chapter to help practitioners easily apply these tools to real-world manufacturing processes. The book is of interest to researchers, industrial experts, and postgraduate students in the fields of industrial engineering, automation, statistical learning, and manufacturing industries.

The Semantic Web – ISWC 2016

The Semantic Web – ISWC 2016
Author: Paul Groth
Publisher: Springer
Total Pages: 698
Release: 2016-10-05
Genre: Computers
ISBN: 3319465236

The two-volume set LNCS 9981 and 9982 constitutes the refereed proceedings of the 15th International Semantic Web Conference, ISWC 2016, which was held in Kobe, Japan, in October 2016. The 75 full papers presented in these proceedings were carefully reviewed and selected from 326 submissions. The International Semantic Web Conference is the premier forum for Semantic Web research, where cutting edge scientific results and technological innovations are presented, where problems and solutions are discussed, and where the future of this vision is being developed. It brings together specialists in fields such as artificial intelligence, databases, social networks, distributed computing, Web engineering, information systems, human-computer interaction, natural language processing, and the social sciences. The Research Track solicited novel and significant research contributions addressing theoretical, analytical, empirical, and practical aspects of the Semantic Web. The Applications Track solicited submissions exploring the benefits and challenges of applying semantic technologies in concrete, practical applications, in contexts ranging from industry to government and science. The newly introduced Resources Track sought submissions providing a concise and clear description of a resource and its (expected) usage. Traditional resources include ontologies, vocabularies, datasets, benchmarks and replication studies, services and software. Besides more established types of resources, the track solicited submissions of new types of resources such as ontology design patterns, crowdsourcing task designs, workflows, methodologies, and protocols and measures.

Performance Evaluation for Network Services, Systems and Protocols

Performance Evaluation for Network Services, Systems and Protocols
Author: Stênio Fernandes
Publisher: Springer
Total Pages: 185
Release: 2017-03-21
Genre: Computers
ISBN: 3319545213

This book provides a comprehensive view of the methods and approaches for performance evaluation of computer networks. It offers a clear and logical introduction to the topic, covering both fundamental concepts and practical aspects. It enables the reader to answer a series of questions regarding performance evaluation in modern computer networking scenarios, such as ‘What, where, and when to measure?’, ‘Which time scale is more appropriate for a particular measurement and analysis?’, 'Experimentation, simulation or emulation? Why?’, and ‘How do I best design a sound performance evaluation plan?’. The book includes concrete examples and applications in the important aspects of experimentation, simulation and emulation, and analytical modeling, with strong support from the scientific literature. It enables the identification of common shortcomings and highlights where students, researchers, and engineers should focus to conduct sound performance evaluation. This book is a useful guide to advanced undergraduates and graduate students, network engineers, and researchers who plan and design proper performance evaluation of computer networks and services. Previous knowledge of computer networks concepts, mechanisms, and protocols is assumed. Although the book provides a quick review on applied statistics in computer networking, familiarity with basic statistics is an asset. It is suitable for advanced courses on computer networking as well as for more specific courses as a secondary textbook.

Digital Business and Intelligent Systems

Digital Business and Intelligent Systems
Author: Mirjana Ivanovic
Publisher: Springer Nature
Total Pages: 273
Release: 2022-06-27
Genre: Computers
ISBN: 3031098501

This book constitutes the refereed proceedings of the 15th International Baltic Conference on Digital Business and Intelligent Systems, Baltic DB&IS 2022, held in Riga, Latvia, in July 2022. The 16 revised full papers and 1 short paper presented were carefully reviewed and selected from 42 submissions. The papers are centered around topics like architectures and quality of information systems, artificial intelligence in information systems, data and knowledge engineering, enterprise and information systems engineering, security of information systems.

Essential Statistical Concepts for the Quality Professional

Essential Statistical Concepts for the Quality Professional
Author: D. H. Stamatis
Publisher: CRC Press
Total Pages: 512
Release: 2012-05-02
Genre: Business & Economics
ISBN: 1439894574

The essence of any root cause analysis in our modern quality thinking is to go beyond the actual problem. This means not only do we have to fix the problem at hand but we also have to identify why the failure occurred and what was the opportunity to apply the appropriate knowledge to avoid the problem in the future. Essential Statistical Concepts for the Quality Professional offers a new non-technical statistical approach to quality for effective improvement and productivity by focusing on very specific and fundamental methodologies and tools for the future. Written by an expert with more than 30 years of experience in management, quality training, and consulting, the book examines the fundamentals of statistical understanding, and by doing so demonstrates the importance of using statistics in the decision making process. The author points out pitfalls to keep in mind when undertaking an experiment for improvement and explains how to use statistics in improvement endeavors. He discusses data interpretation, common tests and confidence intervals, and how to plan experiments for improvement. The book expands the notion of experimentation by dealing with mathematical models such as regression to optimize the improvement and understand the relationship between several factors. It emphasizes the need for sampling and introduces specific techniques to make sure accuracy and precision of the data is appropriate and applicable for the study at hand. The author’s approach is somewhat new and unique; however, he details tools and methodologies that can be used to evaluate the system for prevention. These tools and methodologies focus on structured, repeatable processes that can be instrumental in finding real, fixable causes of the human errors and equipment failures that lead to quality issues.