Statistics for High-Dimensional Data

Statistics for High-Dimensional Data
Author: Peter Bühlmann
Publisher: Springer Science & Business Media
Total Pages: 568
Release: 2011-06-08
Genre: Mathematics
ISBN: 364220192X

Modern statistics deals with large and complex data sets, and consequently with models containing a large number of parameters. This book presents a detailed account of recently developed approaches, including the Lasso and versions of it for various models, boosting methods, undirected graphical modeling, and procedures controlling false positive selections. A special characteristic of the book is that it contains comprehensive mathematical theory on high-dimensional statistics combined with methodology, algorithms and illustrations with real data examples. This in-depth approach highlights the methods’ great potential and practical applicability in a variety of settings. As such, it is a valuable resource for researchers, graduate students and experts in statistics, applied mathematics and computer science.

High-Dimensional Statistics

High-Dimensional Statistics
Author: Martin J. Wainwright
Publisher: Cambridge University Press
Total Pages: 571
Release: 2019-02-21
Genre: Business & Economics
ISBN: 1108498027

A coherent introductory text from a groundbreaking researcher, focusing on clarity and motivation to build intuition and understanding.

Statistical Analysis for High-Dimensional Data

Statistical Analysis for High-Dimensional Data
Author: Arnoldo Frigessi
Publisher: Springer
Total Pages: 313
Release: 2016-02-16
Genre: Mathematics
ISBN: 3319270990

This book features research contributions from The Abel Symposium on Statistical Analysis for High Dimensional Data, held in Nyvågar, Lofoten, Norway, in May 2014. The focus of the symposium was on statistical and machine learning methodologies specifically developed for inference in “big data” situations, with particular reference to genomic applications. The contributors, who are among the most prominent researchers on the theory of statistics for high dimensional inference, present new theories and methods, as well as challenging applications and computational solutions. Specific themes include, among others, variable selection and screening, penalised regression, sparsity, thresholding, low dimensional structures, computational challenges, non-convex situations, learning graphical models, sparse covariance and precision matrices, semi- and non-parametric formulations, multiple testing, classification, factor models, clustering, and preselection. Highlighting cutting-edge research and casting light on future research directions, the contributions will benefit graduate students and researchers in computational biology, statistics and the machine learning community.

Fundamentals of High-Dimensional Statistics

Fundamentals of High-Dimensional Statistics
Author: Johannes Lederer
Publisher: Springer Nature
Total Pages: 355
Release: 2021-11-16
Genre: Mathematics
ISBN: 3030737926

This textbook provides a step-by-step introduction to the tools and principles of high-dimensional statistics. Each chapter is complemented by numerous exercises, many of them with detailed solutions, and computer labs in R that convey valuable practical insights. The book covers the theory and practice of high-dimensional linear regression, graphical models, and inference, ensuring readers have a smooth start in the field. It also offers suggestions for further reading. Given its scope, the textbook is intended for beginning graduate and advanced undergraduate students in statistics, biostatistics, and bioinformatics, though it will be equally useful to a broader audience.

Statistical Inference from High Dimensional Data

Statistical Inference from High Dimensional Data
Author: Carlos Fernandez-Lozano
Publisher: MDPI
Total Pages: 314
Release: 2021-04-28
Genre: Science
ISBN: 3036509445

• Real-world problems can be high-dimensional, complex, and noisy • More data does not imply more information • Different approaches deal with the so-called curse of dimensionality to reduce irrelevant information • A process with multidimensional information is not necessarily easy to interpret nor process • In some real-world applications, the number of elements of a class is clearly lower than the other. The models tend to assume that the importance of the analysis belongs to the majority class and this is not usually the truth • The analysis of complex diseases such as cancer are focused on more-than-one dimensional omic data • The increasing amount of data thanks to the reduction of cost of the high-throughput experiments opens up a new era for integrative data-driven approaches • Entropy-based approaches are of interest to reduce the dimensionality of high-dimensional data

Statistical Inference Via Convex Optimization

Statistical Inference Via Convex Optimization
Author: Anatoli Juditsky
Publisher: Princeton University Press
Total Pages: 655
Release: 2020-04-07
Genre: Mathematics
ISBN: 0691197296

This authoritative book draws on the latest research to explore the interplay of high-dimensional statistics with optimization. Through an accessible analysis of fundamental problems of hypothesis testing and signal recovery, Anatoli Juditsky and Arkadi Nemirovski show how convex optimization theory can be used to devise and analyze near-optimal statistical inferences. Statistical Inference via Convex Optimization is an essential resource for optimization specialists who are new to statistics and its applications, and for data scientists who want to improve their optimization methods. Juditsky and Nemirovski provide the first systematic treatment of the statistical techniques that have arisen from advances in the theory of optimization. They focus on four well-known statistical problems—sparse recovery, hypothesis testing, and recovery from indirect observations of both signals and functions of signals—demonstrating how they can be solved more efficiently as convex optimization problems. The emphasis throughout is on achieving the best possible statistical performance. The construction of inference routines and the quantification of their statistical performance are given by efficient computation rather than by analytical derivation typical of more conventional statistical approaches. In addition to being computation-friendly, the methods described in this book enable practitioners to handle numerous situations too difficult for closed analytical form analysis, such as composite hypothesis testing and signal recovery in inverse problems. Statistical Inference via Convex Optimization features exercises with solutions along with extensive appendixes, making it ideal for use as a graduate text.

High-Dimensional Probability

High-Dimensional Probability
Author: Roman Vershynin
Publisher: Cambridge University Press
Total Pages: 299
Release: 2018-09-27
Genre: Business & Economics
ISBN: 1108415199

An integrated package of powerful probabilistic tools and key applications in modern mathematical data science.

Computer Age Statistical Inference, Student Edition

Computer Age Statistical Inference, Student Edition
Author: Bradley Efron
Publisher: Cambridge University Press
Total Pages: 514
Release: 2021-06-17
Genre: Mathematics
ISBN: 1108915876

The twenty-first century has seen a breathtaking expansion of statistical methodology, both in scope and influence. 'Data science' and 'machine learning' have become familiar terms in the news, as statistical methods are brought to bear upon the enormous data sets of modern science and commerce. How did we get here? And where are we going? How does it all fit together? Now in paperback and fortified with exercises, this book delivers a concentrated course in modern statistical thinking. Beginning with classical inferential theories - Bayesian, frequentist, Fisherian - individual chapters take up a series of influential topics: survival analysis, logistic regression, empirical Bayes, the jackknife and bootstrap, random forests, neural networks, Markov Chain Monte Carlo, inference after model selection, and dozens more. The distinctly modern approach integrates methodology and algorithms with statistical inference. Each chapter ends with class-tested exercises, and the book concludes with speculation on the future direction of statistics and data science.

Statistical Foundations of Data Science

Statistical Foundations of Data Science
Author: Jianqing Fan
Publisher: CRC Press
Total Pages: 942
Release: 2020-09-21
Genre: Mathematics
ISBN: 0429527616

Statistical Foundations of Data Science gives a thorough introduction to commonly used statistical models, contemporary statistical machine learning techniques and algorithms, along with their mathematical insights and statistical theories. It aims to serve as a graduate-level textbook and a research monograph on high-dimensional statistics, sparsity and covariance learning, machine learning, and statistical inference. It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis. It then introduces multiple linear regression and expands the techniques of model building via nonparametric regression and kernel tricks. It provides a comprehensive account on sparsity explorations and model selections for multiple regression, generalized linear models, quantile regression, robust regression, hazards regression, among others. High-dimensional inference is also thoroughly addressed and so is feature screening. The book also provides a comprehensive account on high-dimensional covariance estimation, learning latent factors and hidden structures, as well as their applications to statistical estimation, inference, prediction and machine learning problems. It also introduces thoroughly statistical machine learning theory and methods for classification, clustering, and prediction. These include CART, random forests, boosting, support vector machines, clustering algorithms, sparse PCA, and deep learning.

Analysis of Multivariate and High-Dimensional Data

Analysis of Multivariate and High-Dimensional Data
Author: Inge Koch
Publisher: Cambridge University Press
Total Pages: 531
Release: 2014
Genre: Business & Economics
ISBN: 0521887933

This modern approach integrates classical and contemporary methods, fusing theory and practice and bridging the gap to statistical learning.