Mining Very Large Databases with Parallel Processing

Mining Very Large Databases with Parallel Processing
Author: Alex A. Freitas
Publisher: Springer Science & Business Media
Total Pages: 226
Release: 1997-11-30
Genre: Computers
ISBN: 0792380487

Mining Very Large Databases with Parallel Processing addresses the problem of large-scale data mining. It is an interdisciplinary text, describing advances in the integration of three computer science areas, namely `intelligent' (machine learning-based) data mining techniques, relational databases and parallel processing. The basic idea is to use concepts and techniques of the latter two areas - particularly parallel processing - to speed up and scale up data mining algorithms. The book is divided into three parts. The first part presents a comprehensive review of intelligent data mining techniques such as rule induction, instance-based learning, neural networks and genetic algorithms. Likewise, the second part presents a comprehensive review of parallel processing and parallel databases. Each of these parts includes an overview of commercially-available, state-of-the-art tools. The third part deals with the application of parallel processing to data mining. The emphasis is on finding generic, cost-effective solutions for realistic data volumes. Two parallel computational environments are discussed, the first excluding the use of commercial-strength DBMS, and the second using parallel DBMS servers. It is assumed that the reader has a knowledge roughly equivalent to a first degree (BSc) in accurate sciences, so that (s)he is reasonably familiar with basic concepts of statistics and computer science. The primary audience for Mining Very Large Databases with Parallel Processing is industry data miners and practitioners in general, who would like to apply intelligent data mining techniques to large amounts of data. The book will also be of interest to academic researchers and postgraduate students, particularly database researchers, interested in advanced, intelligent database applications, and artificial intelligence researchers interested in industrial, real-world applications of machine learning.

Mining Very Large Databases with Parallel Processing

Mining Very Large Databases with Parallel Processing
Author: Alex A. Freitas
Publisher: Springer Science & Business Media
Total Pages: 211
Release: 2012-12-06
Genre: Computers
ISBN: 1461555213

Mining Very Large Databases with Parallel Processing addresses the problem of large-scale data mining. It is an interdisciplinary text, describing advances in the integration of three computer science areas, namely `intelligent' (machine learning-based) data mining techniques, relational databases and parallel processing. The basic idea is to use concepts and techniques of the latter two areas - particularly parallel processing - to speed up and scale up data mining algorithms. The book is divided into three parts. The first part presents a comprehensive review of intelligent data mining techniques such as rule induction, instance-based learning, neural networks and genetic algorithms. Likewise, the second part presents a comprehensive review of parallel processing and parallel databases. Each of these parts includes an overview of commercially-available, state-of-the-art tools. The third part deals with the application of parallel processing to data mining. The emphasis is on finding generic, cost-effective solutions for realistic data volumes. Two parallel computational environments are discussed, the first excluding the use of commercial-strength DBMS, and the second using parallel DBMS servers. It is assumed that the reader has a knowledge roughly equivalent to a first degree (BSc) in accurate sciences, so that (s)he is reasonably familiar with basic concepts of statistics and computer science. The primary audience for Mining Very Large Databases with Parallel Processing is industry data miners and practitioners in general, who would like to apply intelligent data mining techniques to large amounts of data. The book will also be of interest to academic researchers and postgraduate students, particularly database researchers, interested in advanced, intelligent database applications, and artificial intelligence researchers interested in industrial, real-world applications of machine learning.

Advances in Spatial and Temporal Databases

Advances in Spatial and Temporal Databases
Author: Christian S. Jensen
Publisher: Springer Science & Business Media
Total Pages: 532
Release: 2001-07-02
Genre: Business & Economics
ISBN: 354042301X

This book constitutes the refereed proceedings of the 7th International Conference on Spatial and Temporal Databases, SSTD 2001, held in Redondo Beach, CA, USA, in July 2001. The 25 revised full papers and two industrial papers presented were carefully reviewed and selected from a total of 70 submissions. The book offers topical sections on modeling and querying, moving-object query processing, query processing: architectures and cost estimation, processing advanced queries, formal aspects, data representation, industrial session, data warehousing and mining, and indexing.

Encyclopedia of Database Technologies and Applications

Encyclopedia of Database Technologies and Applications
Author: Rivero, Laura C.
Publisher: IGI Global
Total Pages: 784
Release: 2005-06-30
Genre: Education
ISBN: 1591407958

"Addresses the evolution of database management, technologies and applications along with the progress and endeavors of new research areas."--P. xiii.

Effective Databases for Text & Document Management

Effective Databases for Text & Document Management
Author: Shirley A. Becker
Publisher: IGI Global
Total Pages: 390
Release: 2003-01-01
Genre: Computers
ISBN: 9781931777476

"Focused on the latest research on text and document management, this guide addresses the information management needs of organizations by providing the most recent findings. How the need for effective databases to house information is impacting organizations worldwide and how some organizations that possess a vast amount of data are not able to use the data in an economic and efficient manner is demonstrated. A taxonomy for object-oriented databases, metrics for controlling database complexity, and a guide to accommodating hierarchies in relational databases are provided. Also covered is how to apply Java-triggers for X-Link management and how to build signatures."

Advances in Database Technology - EDBT 2004

Advances in Database Technology - EDBT 2004
Author: Elisa Bertino
Publisher: Springer
Total Pages: 895
Release: 2004-02-12
Genre: Computers
ISBN: 3540247416

The 9th International Conference on Extending Database Technology, EDBT 2004, was held in Heraklion, Crete, Greece, during March 14–18, 2004. The EDBT series of conferences is an established and prestigious forum for the exchange of the latest research results in data management. Held every two years in an attractive European location, the conference provides unique opp- tunities for database researchers, practitioners, developers, and users to explore new ideas, techniques, and tools, and to exchange experiences. The previous events were held in Venice, Vienna, Cambridge, Avignon, Valencia, Konstanz, and Prague. EDBT 2004 had the theme “new challenges for database technology,” with the goal of encouraging researchers to take a greater interest in the current exciting technological and application advancements and to devise and address new research and development directions for database technology. From its early days, database technology has been challenged and advanced by new uses and applications, and it continues to evolve along with application requirements and hardware advances. Today’s DBMS technology faces yet several new challenges. Technological trends and new computation paradigms, and applications such as pervasive and ubiquitous computing, grid computing, bioinformatics, trust management, virtual communities, and digital asset management, to name just a few, require database technology to be deployed in a variety of environments and for a number of di?erent purposes. Such an extensive deployment will also require trustworthy, resilient database systems, as well as easy-to-manage and ?exible ones, to which we can entrust our data in whatever form they are.