Multiple Testing and False Discovery Rate Control

Multiple Testing and False Discovery Rate Control
Author: Shiyun Chen
Publisher:
Total Pages: 142
Release: 2019
Genre:
ISBN:

Multiple testing, a situation where multiple hypothesis tests are performed simultaneously, is a core research topic in statistics that arises in almost every scientific field. When more hypotheses are tested, more errors are bound to occur. Controlling the false discovery rate (FDR) [BH95], which is the expected proportion of falsely rejected null hypotheses among all rejections, is an important challenge for making meaningful inferences. Throughout the dissertation, we analyze the asymptotic performance of several FDR-controlling procedures under different multiple testing settings. In Chapter 1, we study the famous Benjamini-Hochberg (BH) method [BH95] which often serves as benchmark among FDR-controlling procedures, and show that it is asymptotic optimal in a stylized setting. We then prove that a distribution-free FDR control method of Barber and Candès [FBC15], which only requires the (unknown) null distribution to be symmetric, can achieve the same asymptotic performance as the BH method, thus is also optimal. Chapter 2 proposes an interval-type procedure which identifies the longest interval with the estimated FDR under a given level and rejects the corresponding hypotheses with P-values lying inside the interval. Unlike the threshold approaches, this procedure scans over all intervals with the left point not necessary being zero. We show that this scan procedure provides strong control of the asymptotic false discovery rate. In addition, we investigate its asymptotic false non-discovery rate (FNR), deriving conditions under which it outperforms the BH procedure. In Chapter 3, we consider an online multiple testing problem where the hypotheses arrive sequentially in a stream, and investigate two procedures proposed by Javanmard and Montanari [JM15] which control FDR in an online manner. We quantify their asymptotic performance in the same location models as in Chapter 1 and compare their power with the (static) BH method. In Chapter 4, we propose a new class of powerful online testing procedures which incorporates the available contextual information, and prove that any rule in this class controls the online FDR under some standard assumptions. We also derive a practical algorithm that can make more empirical discoveries in an online fashion, compared to the state-of-the-art procedures.

Resampling-Based Multiple Testing

Resampling-Based Multiple Testing
Author: Peter H. Westfall
Publisher: John Wiley & Sons
Total Pages: 382
Release: 1993-01-12
Genre: Mathematics
ISBN: 9780471557616

Combines recent developments in resampling technology (including the bootstrap) with new methods for multiple testing that are easy to use, convenient to report and widely applicable. Software from SAS Institute is available to execute many of the methods and programming is straightforward for other applications. Explains how to summarize results using adjusted p-values which do not necessitate cumbersome table look-ups. Demonstrates how to incorporate logical constraints among hypotheses, further improving power.

Multiple Comparison Procedures

Multiple Comparison Procedures
Author: Yosef Hochberg
Publisher:
Total Pages: 482
Release: 1987-10-05
Genre: Mathematics
ISBN:

Offering a balanced, up-to-date view of multiple comparison procedures, this book refutes the belief held by some statisticians that such procedures have no place in data analysis. With equal emphasis on theory and applications, it establishes the advantages of multiple comparison techniques in reducing error rates and in ensuring the validity of statistical inferences. Provides detailed descriptions of the derivation and implementation of a variety of procedures, paying particular attention to classical approaches and confidence estimation procedures. Also discusses the benefits and drawbacks of other methods. Numerous examples and tables for implementing procedures are included, making this work both practical and informative.

Testing Jumps Via False Discovery Rate Control

Testing Jumps Via False Discovery Rate Control
Author: Yu-Min Yen
Publisher:
Total Pages: 42
Release: 2019
Genre:
ISBN:

Many recently developed nonparametric jump tests can be viewed as multiple hypothesis testing problems. For such multiple hypothesis tests, it is well known that controlling type I error often makes a large proportion of erroneous rejections, and such situation becomes even worse when the jump occurrence is a rare event. To obtain more reliable results, we aim to control the false discovery rate (FDR), an e fficient compound error measure for erroneous rejections in multiple testing problems. We perform the test via the Barndor -Nielsen and Shephard (BNS) test statistic, and control the FDR with the Benjamini and Hochberg (BH) procedure. We provide asymptotic results for the FDR control. From simulations, we examine relevant theoretical results and demonstrate the advantages of controlling the FDR. The hybrid approach is then applied to empirical analysis on two benchmark stock indices with high frequency data.

Multiple Testing Problems in Pharmaceutical Statistics

Multiple Testing Problems in Pharmaceutical Statistics
Author: Alex Dmitrienko
Publisher: CRC Press
Total Pages: 323
Release: 2009-12-08
Genre: Mathematics
ISBN: 1584889853

Useful Statistical Approaches for Addressing Multiplicity IssuesIncludes practical examples from recent trials Bringing together leading statisticians, scientists, and clinicians from the pharmaceutical industry, academia, and regulatory agencies, Multiple Testing Problems in Pharmaceutical Statistics explores the rapidly growing area of multiple c

Large-Scale Inference

Large-Scale Inference
Author: Bradley Efron
Publisher: Cambridge University Press
Total Pages:
Release: 2012-11-29
Genre: Mathematics
ISBN: 1139492136

We live in a new age for statistical inference, where modern scientific technology such as microarrays and fMRI machines routinely produce thousands and sometimes millions of parallel data sets, each with its own estimation or testing problem. Doing thousands of problems at once is more than repeated application of classical methods. Taking an empirical Bayes approach, Bradley Efron, inventor of the bootstrap, shows how information accrues across problems in a way that combines Bayesian and frequentist ideas. Estimation, testing and prediction blend in this framework, producing opportunities for new methodologies of increased power. New difficulties also arise, easily leading to flawed inferences. This book takes a careful look at both the promise and pitfalls of large-scale statistical inference, with particular attention to false discovery rates, the most successful of the new statistical techniques. Emphasis is on the inferential ideas underlying technical developments, illustrated using a large number of real examples.

Statistics Done Wrong

Statistics Done Wrong
Author: Alex Reinhart
Publisher: No Starch Press
Total Pages: 177
Release: 2015-03-01
Genre: Mathematics
ISBN: 1593276206

Scientific progress depends on good research, and good research needs good statistics. But statistical analysis is tricky to get right, even for the best and brightest of us. You'd be surprised how many scientists are doing it wrong. Statistics Done Wrong is a pithy, essential guide to statistical blunders in modern science that will show you how to keep your research blunder-free. You'll examine embarrassing errors and omissions in recent research, learn about the misconceptions and scientific politics that allow these mistakes to happen, and begin your quest to reform the way you and your peers do statistics. You'll find advice on: –Asking the right question, designing the right experiment, choosing the right statistical analysis, and sticking to the plan –How to think about p values, significance, insignificance, confidence intervals, and regression –Choosing the right sample size and avoiding false positives –Reporting your analysis and publishing your data and source code –Procedures to follow, precautions to take, and analytical software that can help Scientists: Read this concise, powerful guide to help you produce statistically sound research. Statisticians: Give this book to everyone you know. The first step toward statistics done right is Statistics Done Wrong.

Multiple Testing Procedures with Applications to Genomics

Multiple Testing Procedures with Applications to Genomics
Author: Sandrine Dudoit
Publisher: Springer Science & Business Media
Total Pages: 611
Release: 2007-12-18
Genre: Science
ISBN: 0387493174

This book establishes the theoretical foundations of a general methodology for multiple hypothesis testing and discusses its software implementation in R and SAS. These are applied to a range of problems in biomedical and genomic research, including identification of differentially expressed and co-expressed genes in high-throughput gene expression experiments; tests of association between gene expression measures and biological annotation metadata; sequence analysis; and genetic mapping of complex traits using single nucleotide polymorphisms. The procedures are based on a test statistics joint null distribution and provide Type I error control in testing problems involving general data generating distributions, null hypotheses, and test statistics.