Nonnegative Matrix Factorization

Nonnegative Matrix Factorization
Author: Nicolas Gillis
Publisher: SIAM
Total Pages: 376
Release: 2020-12-18
Genre: Mathematics
ISBN: 1611976413

Nonnegative matrix factorization (NMF) in its modern form has become a standard tool in the analysis of high-dimensional data sets. This book provides a comprehensive and up-to-date account of the most important aspects of the NMF problem and is the first to detail its theoretical aspects, including geometric interpretation, nonnegative rank, complexity, and uniqueness. It explains why understanding these theoretical insights is key to using this computational tool effectively and meaningfully. Nonnegative Matrix Factorization is accessible to a wide audience and is ideal for anyone interested in the workings of NMF. It discusses some new results on the nonnegative rank and the identifiability of NMF and makes available MATLAB codes for readers to run the numerical examples presented in the book. Graduate students starting to work on NMF and researchers interested in better understanding the NMF problem and how they can use it will find this book useful. It can be used in advanced undergraduate and graduate-level courses on numerical linear algebra and on advanced topics in numerical linear algebra and requires only a basic knowledge of linear algebra and optimization.

Nonnegative Matrix and Tensor Factorizations

Nonnegative Matrix and Tensor Factorizations
Author: Andrzej Cichocki
Publisher: John Wiley & Sons
Total Pages: 500
Release: 2009-07-10
Genre: Science
ISBN: 9780470747285

This book provides a broad survey of models and efficient algorithms for Nonnegative Matrix Factorization (NMF). This includes NMF’s various extensions and modifications, especially Nonnegative Tensor Factorizations (NTF) and Nonnegative Tucker Decompositions (NTD). NMF/NTF and their extensions are increasingly used as tools in signal and image processing, and data analysis, having garnered interest due to their capability to provide new insights and relevant information about the complex latent relationships in experimental data sets. It is suggested that NMF can provide meaningful components with physical interpretations; for example, in bioinformatics, NMF and its extensions have been successfully applied to gene expression, sequence analysis, the functional characterization of genes, clustering and text mining. As such, the authors focus on the algorithms that are most useful in practice, looking at the fastest, most robust, and suitable for large-scale models. Key features: Acts as a single source reference guide to NMF, collating information that is widely dispersed in current literature, including the authors’ own recently developed techniques in the subject area. Uses generalized cost functions such as Bregman, Alpha and Beta divergences, to present practical implementations of several types of robust algorithms, in particular Multiplicative, Alternating Least Squares, Projected Gradient and Quasi Newton algorithms. Provides a comparative analysis of the different methods in order to identify approximation error and complexity. Includes pseudo codes and optimized MATLAB source codes for almost all algorithms presented in the book. The increasing interest in nonnegative matrix and tensor factorizations, as well as decompositions and sparse representation of data, will ensure that this book is essential reading for engineers, scientists, researchers, industry practitioners and graduate students across signal and image processing; neuroscience; data mining and data analysis; computer science; bioinformatics; speech processing; biomedical engineering; and multimedia.

Machine Learning for Adaptive Many-Core Machines - A Practical Approach

Machine Learning for Adaptive Many-Core Machines - A Practical Approach
Author: Noel Lopes
Publisher: Springer
Total Pages: 251
Release: 2014-06-28
Genre: Technology & Engineering
ISBN: 3319069381

The overwhelming data produced everyday and the increasing performance and cost requirements of applications are transversal to a wide range of activities in society, from science to industry. In particular, the magnitude and complexity of the tasks that Machine Learning (ML) algorithms have to solve are driving the need to devise adaptive many-core machines that scale well with the volume of data, or in other words, can handle Big Data. This book gives a concise view on how to extend the applicability of well-known ML algorithms in Graphics Processing Unit (GPU) with data scalability in mind. It presents a series of new techniques to enhance, scale and distribute data in a Big Learning framework. It is not intended to be a comprehensive survey of the state of the art of the whole field of machine learning for Big Data. Its purpose is less ambitious and more practical: to explain and illustrate existing and novel GPU-based ML algorithms, not viewed as a universal solution for the Big Data challenges but rather as part of the answer, which may require the use of different strategies coupled together.

Non-negative Matrix Factorization Techniques

Non-negative Matrix Factorization Techniques
Author: Ganesh R. Naik
Publisher: Springer
Total Pages: 200
Release: 2015-09-25
Genre: Technology & Engineering
ISBN: 3662483319

This book collects new results, concepts and further developments of NMF. The open problems discussed include, e.g. in bioinformatics: NMF and its extensions applied to gene expression, sequence analysis, the functional characterization of genes, clustering and text mining etc. The research results previously scattered in different scientific journals and conference proceedings are methodically collected and presented in a unified form. While readers can read the book chapters sequentially, each chapter is also self-contained. This book can be a good reference work for researchers and engineers interested in NMF, and can also be used as a handbook for students and professionals seeking to gain a better understanding of the latest applications of NMF.

Independent Component Analysis and Signal Separation

Independent Component Analysis and Signal Separation
Author: Tulay Adali
Publisher: Springer
Total Pages: 803
Release: 2009-03-16
Genre: Computers
ISBN: 3642005993

This book constitutes the refereed proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation, ICA 2009, held in Paraty, Brazil, in March 2009. The 97 revised papers presented were carefully reviewed and selected from 137 submissions. The papers are organized in topical sections on theory, algorithms and architectures, biomedical applications, image processing, speech and audio processing, other applications, as well as a special session on evaluation.

Proceedings of the Fifth SIAM International Conference on Data Mining

Proceedings of the Fifth SIAM International Conference on Data Mining
Author: Hillol Kargupta
Publisher: SIAM
Total Pages: 670
Release: 2005-04-01
Genre: Mathematics
ISBN: 9780898715934

The Fifth SIAM International Conference on Data Mining continues the tradition of providing an open forum for the presentation and discussion of innovative algorithms as well as novel applications of data mining. Advances in information technology and data collection methods have led to the availability of large data sets in commercial enterprises and in a wide variety of scientific and engineering disciplines. The field of data mining draws upon extensive work in areas such as statistics, machine learning, pattern recognition, databases, and high performance computing to discover interesting and previously unknown information in data. This conference results in data mining, including applications, algorithms, software, and systems.

Computational Genomics with R

Computational Genomics with R
Author: Altuna Akalin
Publisher: CRC Press
Total Pages: 463
Release: 2020-12-16
Genre: Mathematics
ISBN: 1498781861

Computational Genomics with R provides a starting point for beginners in genomic data analysis and also guides more advanced practitioners to sophisticated data analysis techniques in genomics. The book covers topics from R programming, to machine learning and statistics, to the latest genomic data analysis techniques. The text provides accessible information and explanations, always with the genomics context in the background. This also contains practical and well-documented examples in R so readers can analyze their data by simply reusing the code presented. As the field of computational genomics is interdisciplinary, it requires different starting points for people with different backgrounds. For example, a biologist might skip sections on basic genome biology and start with R programming, whereas a computer scientist might want to start with genome biology. After reading: You will have the basics of R and be able to dive right into specialized uses of R for computational genomics such as using Bioconductor packages. You will be familiar with statistics, supervised and unsupervised learning techniques that are important in data modeling, and exploratory analysis of high-dimensional data. You will understand genomic intervals and operations on them that are used for tasks such as aligned read counting and genomic feature annotation. You will know the basics of processing and quality checking high-throughput sequencing data. You will be able to do sequence analysis, such as calculating GC content for parts of a genome or finding transcription factor binding sites. You will know about visualization techniques used in genomics, such as heatmaps, meta-gene plots, and genomic track visualization. You will be familiar with analysis of different high-throughput sequencing data sets, such as RNA-seq, ChIP-seq, and BS-seq. You will know basic techniques for integrating and interpreting multi-omics datasets. Altuna Akalin is a group leader and head of the Bioinformatics and Omics Data Science Platform at the Berlin Institute of Medical Systems Biology, Max Delbrück Center, Berlin. He has been developing computational methods for analyzing and integrating large-scale genomics data sets since 2002. He has published an extensive body of work in this area. The framework for this book grew out of the yearly computational genomics courses he has been organizing and teaching since 2015.

Algorithmic Aspects of Machine Learning

Algorithmic Aspects of Machine Learning
Author: Ankur Moitra
Publisher: Cambridge University Press
Total Pages: 161
Release: 2018-09-27
Genre: Computers
ISBN: 1107184584

Introduces cutting-edge research on machine learning theory and practice, providing an accessible, modern algorithmic toolkit.

Partitional Clustering Algorithms

Partitional Clustering Algorithms
Author: M. Emre Celebi
Publisher: Springer
Total Pages: 420
Release: 2014-11-07
Genre: Technology & Engineering
ISBN: 3319092596

This book focuses on partitional clustering algorithms, which are commonly used in engineering and computer scientific applications. The goal of this volume is to summarize the state-of-the-art in partitional clustering. The book includes such topics as center-based clustering, competitive learning clustering and density-based clustering. Each chapter is contributed by a leading expert in the field.

Mathematics of Data Science: A Computational Approach to Clustering and Classification

Mathematics of Data Science: A Computational Approach to Clustering and Classification
Author: Daniela Calvetti
Publisher: SIAM
Total Pages: 199
Release: 2020-11-20
Genre: Mathematics
ISBN: 1611976375

This textbook provides a solid mathematical basis for understanding popular data science algorithms for clustering and classification and shows that an in-depth understanding of the mathematics powering these algorithms gives insight into the underlying data. It presents a step-by-step derivation of these algorithms, outlining their implementation from scratch in a computationally sound way. Mathematics of Data Science: A Computational Approach to Clustering and Classification proposes different ways of visualizing high-dimensional data to unveil hidden internal structures, and nearly every chapter includes graphical explanations and computed examples using publicly available data sets to highlight similarities and differences among the algorithms. This self-contained book is geared toward advanced undergraduate and beginning graduate students in the mathematical sciences, engineering, and computer science and can be used as the main text in a semester course. Researchers in any application area where data science methods are used will also find the book of interest. No advanced mathematical or statistical background is assumed.