Nonlinear Dimensionality Reduction

Nonlinear Dimensionality Reduction
Author: John A. Lee
Publisher: Springer Science & Business Media
Total Pages: 316
Release: 2007-10-31
Genre: Mathematics
ISBN: 038739351X

This book describes established and advanced methods for reducing the dimensionality of numerical databases. Each description starts from intuitive ideas, develops the necessary mathematical details, and ends by outlining the algorithmic implementation. The text provides a lucid summary of facts and concepts relating to well-known methods as well as recent developments in nonlinear dimensionality reduction. Methods are all described from a unifying point of view, which helps to highlight their respective strengths and shortcomings. The presentation will appeal to statisticians, computer scientists and data analysts, and other practitioners having a basic background in statistics or computational learning.

Sufficient Dimension Reduction

Sufficient Dimension Reduction
Author: Bing Li
Publisher: CRC Press
Total Pages: 362
Release: 2018-04-27
Genre: Mathematics
ISBN: 1351645730

Sufficient dimension reduction is a rapidly developing research field that has wide applications in regression diagnostics, data visualization, machine learning, genomics, image processing, pattern recognition, and medicine, because they are fields that produce large datasets with a large number of variables. Sufficient Dimension Reduction: Methods and Applications with R introduces the basic theories and the main methodologies, provides practical and easy-to-use algorithms and computer codes to implement these methodologies, and surveys the recent advances at the frontiers of this field. Features Provides comprehensive coverage of this emerging research field. Synthesizes a wide variety of dimension reduction methods under a few unifying principles such as projection in Hilbert spaces, kernel mapping, and von Mises expansion. Reflects most recent advances such as nonlinear sufficient dimension reduction, dimension folding for tensorial data, as well as sufficient dimension reduction for functional data. Includes a set of computer codes written in R that are easily implemented by the readers. Uses real data sets available online to illustrate the usage and power of the described methods. Sufficient dimension reduction has undergone momentous development in recent years, partly due to the increased demands for techniques to process high-dimensional data, a hallmark of our age of Big Data. This book will serve as the perfect entry into the field for the beginning researchers or a handy reference for the advanced ones. The author Bing Li obtained his Ph.D. from the University of Chicago. He is currently a Professor of Statistics at the Pennsylvania State University. His research interests cover sufficient dimension reduction, statistical graphical models, functional data analysis, machine learning, estimating equations and quasilikelihood, and robust statistics. He is a fellow of the Institute of Mathematical Statistics and the American Statistical Association. He is an Associate Editor for The Annals of Statistics and the Journal of the American Statistical Association.

Geometric Structure of High-Dimensional Data and Dimensionality Reduction

Geometric Structure of High-Dimensional Data and Dimensionality Reduction
Author: Jianzhong Wang
Publisher: Springer Science & Business Media
Total Pages: 363
Release: 2012-04-28
Genre: Computers
ISBN: 3642274978

"Geometric Structure of High-Dimensional Data and Dimensionality Reduction" adopts data geometry as a framework to address various methods of dimensionality reduction. In addition to the introduction to well-known linear methods, the book moreover stresses the recently developed nonlinear methods and introduces the applications of dimensionality reduction in many areas, such as face recognition, image segmentation, data classification, data visualization, and hyperspectral imagery data analysis. Numerous tables and graphs are included to illustrate the ideas, effects, and shortcomings of the methods. MATLAB code of all dimensionality reduction algorithms is provided to aid the readers with the implementations on computers. The book will be useful for mathematicians, statisticians, computer scientists, and data analysts. It is also a valuable handbook for other practitioners who have a basic background in mathematics, statistics and/or computer algorithms, like internet search engine designers, physicists, geologists, electronic engineers, and economists. Jianzhong Wang is a Professor of Mathematics at Sam Houston State University, U.S.A.

Dimension Reduction of Large-Scale Systems

Dimension Reduction of Large-Scale Systems
Author: Peter Benner
Publisher: Springer Science & Business Media
Total Pages: 397
Release: 2006-03-30
Genre: Technology & Engineering
ISBN: 3540279091

In the past decades, model reduction has become an ubiquitous tool in analysis and simulation of dynamical systems, control design, circuit simulation, structural dynamics, CFD, and many other disciplines dealing with complex physical models. The aim of this book is to survey some of the most successful model reduction methods in tutorial style articles and to present benchmark problems from several application areas for testing and comparing existing and new algorithms. As the discussed methods have often been developed in parallel in disconnected application areas, the intention of the mini-workshop in Oberwolfach and its proceedings is to make these ideas available to researchers and practitioners from all these different disciplines.

Dimension Reduction

Dimension Reduction
Author: Christopher J. C. Burges
Publisher: Now Publishers Inc
Total Pages: 104
Release: 2010
Genre: Computers
ISBN: 1601983786

We give a tutorial overview of several foundational methods for dimension reduction. We divide the methods into projective methods and methods that model the manifold on which the data lies. For projective methods, we review projection pursuit, principal component analysis (PCA), kernel PCA, probabilistic PCA, canonical correlation analysis (CCA), kernel CCA, Fisher discriminant analysis, oriented PCA, and several techniques for sufficient dimension reduction. For the manifold methods, we review multidimensional scaling (MDS), landmark MDS, Isomap, locally linear embedding, Laplacian eigenmaps, and spectral clustering. Although the review focuses on foundations, we also provide pointers to some more modern techniques. We also describe the correlation dimension as one method for estimating the intrinsic dimension, and we point out that the notion of dimension can be a scale-dependent quantity. The Nystr m method, which links several of the manifold algorithms, is also reviewed. We use a publicly available dataset to illustrate some of the methods. The goal is to provide a self-contained overview of key concepts underlying many of these algorithms, and to give pointers for further reading.

Machine Learning Techniques for Multimedia

Machine Learning Techniques for Multimedia
Author: Matthieu Cord
Publisher: Springer Science & Business Media
Total Pages: 297
Release: 2008-02-07
Genre: Computers
ISBN: 3540751718

Processing multimedia content has emerged as a key area for the application of machine learning techniques, where the objectives are to provide insight into the domain from which the data is drawn, and to organize that data and improve the performance of the processes manipulating it. Arising from the EU MUSCLE network, this multidisciplinary book provides a comprehensive coverage of the most important machine learning techniques used and their application in this domain.

Active Subspaces

Active Subspaces
Author: Paul G. Constantine
Publisher: SIAM
Total Pages: 105
Release: 2015-03-17
Genre: Computers
ISBN: 1611973864

Scientists and engineers use computer simulations to study relationships between a model's input parameters and its outputs. However, thorough parameter studies are challenging, if not impossible, when the simulation is expensive and the model has several inputs. To enable studies in these instances, the engineer may attempt to reduce the dimension of the model's input parameter space. Active subspaces are an emerging set of dimension reduction tools that identify important directions in the parameter space. This book describes techniques for discovering a model's active subspace and proposes methods for exploiting the reduced dimension to enable otherwise infeasible parameter studies. Readers will find new ideas for dimension reduction, easy-to-implement algorithms, and several examples of active subspaces in action.

Modern Dimension Reduction

Modern Dimension Reduction
Author: Philip D. Waggoner
Publisher: Cambridge University Press
Total Pages: 98
Release: 2021-08-05
Genre: Political Science
ISBN: 1108991645

Data are not only ubiquitous in society, but are increasingly complex both in size and dimensionality. Dimension reduction offers researchers and scholars the ability to make such complex, high dimensional data spaces simpler and more manageable. This Element offers readers a suite of modern unsupervised dimension reduction techniques along with hundreds of lines of R code, to efficiently represent the original high dimensional data space in a simplified, lower dimensional subspace. Launching from the earliest dimension reduction technique principal components analysis and using real social science data, I introduce and walk readers through application of the following techniques: locally linear embedding, t-distributed stochastic neighbor embedding (t-SNE), uniform manifold approximation and projection, self-organizing maps, and deep autoencoders. The result is a well-stocked toolbox of unsupervised algorithms for tackling the complexities of high dimensional data so common in modern society. All code is publicly accessible on Github.

Multidimensional Stationary Time Series

Multidimensional Stationary Time Series
Author: Marianna Bolla
Publisher: CRC Press
Total Pages: 318
Release: 2021-04-29
Genre: Mathematics
ISBN: 1000392392

This book gives a brief survey of the theory of multidimensional (multivariate), weakly stationary time series, with emphasis on dimension reduction and prediction. Understanding the covered material requires a certain mathematical maturity, a degree of knowledge in probability theory, linear algebra, and also in real, complex and functional analysis. For this, the cited literature and the Appendix contain all necessary material. The main tools of the book include harmonic analysis, some abstract algebra, and state space methods: linear time-invariant filters, factorization of rational spectral densities, and methods that reduce the rank of the spectral density matrix. Serves to find analogies between classical results (Cramer, Wold, Kolmogorov, Wiener, Kálmán, Rozanov) and up-to-date methods for dimension reduction in multidimensional time series Provides a unified treatment for time and frequency domain inferences by using machinery of complex and harmonic analysis, spectral and Smith--McMillan decompositions. Establishes analogies between the time and frequency domain notions and calculations Discusses the Wold's decomposition and the Kolmogorov's classification together, by distinguishing between different types of singularities. Understanding the remote past helps us to characterize the ideal situation where there is a regular part at present. Examples and constructions are also given Establishes a common outline structure for the state space models, prediction, and innovation algorithms with unified notions and principles, which is applicable to real-life high frequency time series It is an ideal companion for graduate students studying the theory of multivariate time series and researchers working in this field.

Data Analytics in Bioinformatics

Data Analytics in Bioinformatics
Author: Rabinarayan Satpathy
Publisher: John Wiley & Sons
Total Pages: 433
Release: 2021-01-20
Genre: Computers
ISBN: 111978560X

Machine learning techniques are increasingly being used to address problems in computational biology and bioinformatics. Novel machine learning computational techniques to analyze high throughput data in the form of sequences, gene and protein expressions, pathways, and images are becoming vital for understanding diseases and future drug discovery. Machine learning techniques such as Markov models, support vector machines, neural networks, and graphical models have been successful in analyzing life science data because of their capabilities in handling randomness and uncertainty of data noise and in generalization. Machine Learning in Bioinformatics compiles recent approaches in machine learning methods and their applications in addressing contemporary problems in bioinformatics approximating classification and prediction of disease, feature selection, dimensionality reduction, gene selection and classification of microarray data and many more.