The Top Ten Algorithms in Data Mining

The Top Ten Algorithms in Data Mining
Author: Xindong Wu
Publisher: CRC Press
Total Pages: 230
Release: 2009-04-09
Genre: Business & Economics
ISBN: 142008965X

Identifying some of the most influential algorithms that are widely used in the data mining community, The Top Ten Algorithms in Data Mining provides a description of each algorithm, discusses its impact, and reviews current and future research. Thoroughly evaluated by independent reviewers, each chapter focuses on a particular algorithm and is wri

Data Mining and Analysis

Data Mining and Analysis
Author: Mohammed J. Zaki
Publisher: Cambridge University Press
Total Pages: 607
Release: 2014-05-12
Genre: Computers
ISBN: 0521766338

A comprehensive overview of data mining from an algorithmic perspective, integrating related concepts from machine learning and statistics.

Mining of Massive Datasets

Mining of Massive Datasets
Author: Jure Leskovec
Publisher: Cambridge University Press
Total Pages: 480
Release: 2014-11-13
Genre: Computers
ISBN: 1107077230

Now in its second edition, this book focuses on practical algorithms for mining data from even the largest datasets.

Contrast Data Mining

Contrast Data Mining
Author: Guozhu Dong
Publisher: CRC Press
Total Pages: 428
Release: 2016-04-19
Genre: Business & Economics
ISBN: 1439854335

A Fruitful Field for Researching Data Mining Methodology and for Solving Real-Life ProblemsContrast Data Mining: Concepts, Algorithms, and Applications collects recent results from this specialized area of data mining that have previously been scattered in the literature, making them more accessible to researchers and developers in data mining and

Data Mining and Machine Learning

Data Mining and Machine Learning
Author: Mohammed J. Zaki
Publisher: Cambridge University Press
Total Pages: 779
Release: 2020-01-30
Genre: Business & Economics
ISBN: 1108473989

New to the second edition of this advanced text are several chapters on regression, including neural networks and deep learning.

Algorithms from THE BOOK

Algorithms from THE BOOK
Author: Kenneth Lange
Publisher: SIAM
Total Pages: 227
Release: 2020-05-04
Genre: Mathematics
ISBN: 1611976170

Algorithms are a dominant force in modern culture, and every indication is that they will become more pervasive, not less. The best algorithms are undergirded by beautiful mathematics. This text cuts across discipline boundaries to highlight some of the most famous and successful algorithms. Readers are exposed to the principles behind these examples and guided in assembling complex algorithms from simpler building blocks. Written in clear, instructive language within the constraints of mathematical rigor, Algorithms from THE BOOK includes a large number of classroom-tested exercises at the end of each chapter. The appendices cover background material often omitted from undergraduate courses. Most of the algorithm descriptions are accompanied by Julia code, an ideal language for scientific computing. This code is immediately available for experimentation. Algorithms from THE BOOK is aimed at first-year graduate and advanced undergraduate students. It will also serve as a convenient reference for professionals throughout the mathematical sciences, physical sciences, engineering, and the quantitative sectors of the biological and social sciences.

Principles of Data Mining

Principles of Data Mining
Author: David J. Hand
Publisher: MIT Press
Total Pages: 594
Release: 2001-08-17
Genre: Computers
ISBN: 9780262082907

The first truly interdisciplinary text on data mining, blending the contributions of information science, computer science, and statistics. The growing interest in data mining is motivated by a common problem across disciplines: how does one store, access, model, and ultimately describe and understand very large data sets? Historically, different aspects of data mining have been addressed independently by different disciplines. This is the first truly interdisciplinary text on data mining, blending the contributions of information science, computer science, and statistics. The book consists of three sections. The first, foundations, provides a tutorial overview of the principles underlying data mining algorithms and their application. The presentation emphasizes intuition rather than rigor. The second section, data mining algorithms, shows how algorithms are constructed to solve specific problems in a principled manner. The algorithms covered include trees and rules for classification and regression, association rules, belief networks, classical statistical models, nonlinear models such as neural networks, and local "memory-based" models. The third section shows how all of the preceding analysis fits together when applied to real-world data mining problems. Topics include the role of metadata, how to handle missing data, and data preprocessing.

Matrix Methods in Data Mining and Pattern Recognition

Matrix Methods in Data Mining and Pattern Recognition
Author: Lars Elden
Publisher: SIAM
Total Pages: 226
Release: 2007-07-12
Genre: Computers
ISBN: 0898716268

Several very powerful numerical linear algebra techniques are available for solving problems in data mining and pattern recognition. This application-oriented book describes how modern matrix methods can be used to solve these problems, gives an introduction to matrix theory and decompositions, and provides students with a set of tools that can be modified for a particular application.Matrix Methods in Data Mining and Pattern Recognition is divided into three parts. Part I gives a short introduction to a few application areas before presenting linear algebra concepts and matrix decompositions that students can use in problem-solving environments such as MATLAB®. Some mathematical proofs that emphasize the existence and properties of the matrix decompositions are included. In Part II, linear algebra techniques are applied to data mining problems. Part III is a brief introduction to eigenvalue and singular value algorithms. The applications discussed by the author are: classification of handwritten digits, text mining, text summarization, pagerank computations related to the GoogleÔ search engine, and face recognition. Exercises and computer assignments are available on a Web page that supplements the book.Audience The book is intended for undergraduate students who have previously taken an introductory scientific computing/numerical analysis course. Graduate students in various data mining and pattern recognition areas who need an introduction to linear algebra techniques will also find the book useful.Contents Preface; Part I: Linear Algebra Concepts and Matrix Decompositions. Chapter 1: Vectors and Matrices in Data Mining and Pattern Recognition; Chapter 2: Vectors and Matrices; Chapter 3: Linear Systems and Least Squares; Chapter 4: Orthogonality; Chapter 5: QR Decomposition; Chapter 6: Singular Value Decomposition; Chapter 7: Reduced-Rank Least Squares Models; Chapter 8: Tensor Decomposition; Chapter 9: Clustering and Nonnegative Matrix Factorization; Part II: Data Mining Applications. Chapter 10: Classification of Handwritten Digits; Chapter 11: Text Mining; Chapter 12: Page Ranking for a Web Search Engine; Chapter 13: Automatic Key Word and Key Sentence Extraction; Chapter 14: Face Recognition Using Tensor SVD. Part III: Computing the Matrix Decompositions. Chapter 15: Computing Eigenvalues and Singular Values; Bibliography; Index.

Principles of Data Mining

Principles of Data Mining
Author: Max Bramer
Publisher: Springer
Total Pages: 530
Release: 2016-11-09
Genre: Computers
ISBN: 1447173074

This book explains and explores the principal techniques of Data Mining, the automatic extraction of implicit and potentially useful information from data, which is increasingly used in commercial, scientific and other application areas. It focuses on classification, association rule mining and clustering. Each topic is clearly explained, with a focus on algorithms not mathematical formalism, and is illustrated by detailed worked examples. The book is written for readers without a strong background in mathematics or statistics and any formulae used are explained in detail. It can be used as a textbook to support courses at undergraduate or postgraduate levels in a wide range of subjects including Computer Science, Business Studies, Marketing, Artificial Intelligence, Bioinformatics and Forensic Science. As an aid to self study, this book aims to help general readers develop the necessary understanding of what is inside the 'black box' so they can use commercial data mining packages discriminatingly, as well as enabling advanced readers or academic researchers to understand or contribute to future technical advances in the field. Each chapter has practical exercises to enable readers to check their progress. A full glossary of technical terms used is included. This expanded third edition includes detailed descriptions of algorithms for classifying streaming data, both stationary data, where the underlying model is fixed, and data that is time-dependent, where the underlying model changes from time to time - a phenomenon known as concept drift.

Pattern Recognition Algorithms for Data Mining

Pattern Recognition Algorithms for Data Mining
Author: Sankar K. Pal
Publisher: CRC Press
Total Pages: 275
Release: 2004-05-27
Genre: Computers
ISBN: 1135436401

Pattern Recognition Algorithms for Data Mining addresses different pattern recognition (PR) tasks in a unified framework with both theoretical and experimental results. Tasks covered include data condensation, feature selection, case generation, clustering/classification, and rule generation and evaluation. This volume presents various theories, methodologies, and algorithms, using both classical approaches and hybrid paradigms. The authors emphasize large datasets with overlapping, intractable, or nonlinear boundary classes, and datasets that demonstrate granular computing in soft frameworks. Organized into eight chapters, the book begins with an introduction to PR, data mining, and knowledge discovery concepts. The authors analyze the tasks of multi-scale data condensation and dimensionality reduction, then explore the problem of learning with support vector machine (SVM). They conclude by highlighting the significance of granular computing for different mining tasks in a soft paradigm.