Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015

Proceedings of the 9th International Conference on Computer Recognition Systems CORES 2015
Author: Robert Burduk
Publisher: Springer
Total Pages: 827
Release: 2016-03-05
Genre: Computers
ISBN: 3319262270

The computer recognition systems are nowadays one of the most promising directions in artificial intelligence. This book is the most comprehensive study of this field. It contains a collection of 79 carefully selected articles contributed by experts of pattern recognition. It reports on current research with respect to both methodology and applications. In particular, it includes the following sections: Features, learning, and classifiers Biometrics Data Stream Classification and Big Data Analytics Image processing and computer vision Medical applications Applications RGB-D perception: recent developments and applications This book is a great reference tool for scientists who deal with the problems of designing computer pattern recognition systems. Its target readers can be the as well researchers as students of computer science, artificial intelligence or robotics.

Academic Press Library in Signal Processing

Academic Press Library in Signal Processing
Author:
Publisher: Academic Press
Total Pages: 1131
Release: 2013-09-14
Genre: Technology & Engineering
ISBN: 0123972256

This fourth volume, edited and authored by world leading experts, gives a review of the principles, methods and techniques of important and emerging research topics and technologies in Image, Video Processing and Analysis, Hardware, Audio, Acoustic and Speech Processing. With this reference source you will: - Quickly grasp a new area of research - Understand the underlying principles of a topic and its application - Ascertain how a topic relates to other areas and learn of the research issues yet to be resolved - Quick tutorial reviews of important and emerging topics of research in Image, Video Processing and Analysis, Hardware, Audio, Acoustic and Speech Processing - Presents core principles and shows their application - Reference content on core principles, technologies, algorithms and applications - Comprehensive references to journal articles and other literature on which to build further, more specific and detailed knowledge - Edited by leading people in the field who, through their reputation, have been able to commission experts to write on a particular topic

Audio Source Separation and Speech Enhancement

Audio Source Separation and Speech Enhancement
Author: Emmanuel Vincent
Publisher: John Wiley & Sons
Total Pages: 517
Release: 2018-10-22
Genre: Technology & Engineering
ISBN: 1119279895

Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software. Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting. Key features: Consolidated perspective on audio source separation and speech enhancement. Both historical perspective and latest advances in the field, e.g. deep neural networks. Diverse disciplines: array processing, machine learning, and statistical signal processing. Covers the most important techniques for both single-channel and multichannel processing. This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.

Encyclopedia of Information Science and Technology, Third Edition

Encyclopedia of Information Science and Technology, Third Edition
Author: Khosrow-Pour, Mehdi
Publisher: IGI Global
Total Pages: 7972
Release: 2014-07-31
Genre: Computers
ISBN: 1466658894

"This 10-volume compilation of authoritative, research-based articles contributed by thousands of researchers and experts from all over the world emphasized modern issues and the presentation of potential opportunities, prospective solutions, and future directions in the field of information science and technology"--Provided by publisher.

Modal Array Signal Processing: Principles and Applications of Acoustic Wavefield Decomposition

Modal Array Signal Processing: Principles and Applications of Acoustic Wavefield Decomposition
Author: Heinz Teutsch
Publisher: Springer
Total Pages: 267
Release: 2007-05-10
Genre: Technology & Engineering
ISBN: 3540408967

This book deals with the problem of detecting and localizing multiple simultaneously active wideband acoustic sources by applying the notion of wavefield decomposition using circular and spherical microphone arrays. A rigorous derivation of modal array signal processing algorithms for unambiguous source detection and localization, as well as performance evaluations by means of measurements using an actual real-time capable implementation, are discussed.

Computational Phonogram Archiving

Computational Phonogram Archiving
Author: Rolf Bader
Publisher: Springer
Total Pages: 354
Release: 2019-01-25
Genre: Music
ISBN: 3030026957

The future of music archiving and search engines lies in deep learning and big data. Music information retrieval algorithms automatically analyze musical features like timbre, melody, rhythm or musical form, and artificial intelligence then sorts and relates these features. At the first International Symposium on Computational Ethnomusicological Archiving held on November 9 to 11, 2017 at the Institute of Systematic Musicology in Hamburg, Germany, a new Computational Phonogram Archiving standard was discussed as an interdisciplinary approach. Ethnomusicologists, music and computer scientists, systematic musicologists as well as music archivists, composers and musicians presented tools, methods and platforms and shared fieldwork and archiving experiences in the fields of musical acoustics, informatics, music theory as well as on music storage, reproduction and metadata. The Computational Phonogram Archiving standard is also in high demand in the music market as a search engine for music consumers. This book offers a comprehensive overview of the field written by leading researchers around the globe.

Advances in Multimedia Information Processing - PCM 2016

Advances in Multimedia Information Processing - PCM 2016
Author: Enqing Chen
Publisher: Springer
Total Pages: 762
Release: 2016-11-26
Genre: Computers
ISBN: 3319488961

The two-volume proceedings LNCS 9916 and 9917, constitute the proceedings of the 17th Pacific-Rim Conference on Multimedia, PCM 2016, held in Xi`an, China, in September 2016. The total of 128 papers presented in these proceedings was carefully reviewed and selected from 202 submissions. The focus of the conference was as follows in multimedia content analysis, multimedia signal processing and communications, and multimedia applications and services.

Parametric Time-Frequency Domain Spatial Audio

Parametric Time-Frequency Domain Spatial Audio
Author: Ville Pulkki
Publisher: John Wiley & Sons
Total Pages: 412
Release: 2017-10-04
Genre: Technology & Engineering
ISBN: 111925258X

A comprehensive guide that addresses the theory and practice of spatial audio This book provides readers with the principles and best practices in spatial audio signal processing. It describes how sound fields and their perceptual attributes are captured and analyzed within the time-frequency domain, how essential representation parameters are coded, and how such signals are efficiently reproduced for practical applications. The book is split into four parts starting with an overview of the fundamentals. It then goes on to explain the reproduction of spatial sound before offering an examination of signal-dependent spatial filtering. The book finishes with coverage of both current and future applications and the direction that spatial audio research is heading in. Parametric Time-frequency Domain Spatial Audio focuses on applications in entertainment audio, including music, home cinema, and gaming—covering the capturing and reproduction of spatial sound as well as its generation, transduction, representation, transmission, and perception. This book will teach readers the tools needed for such processing, and provides an overview to existing research. It also shows recent up-to-date projects and commercial applications built on top of the systems. Provides an in-depth presentation of the principles, past developments, state-of-the-art methods, and future research directions of spatial audio technologies Includes contributions from leading researchers in the field Offers MATLAB codes with selected chapters An advanced book aimed at readers who are capable of digesting mathematical expressions about digital signal processing and sound field analysis, Parametric Time-frequency Domain Spatial Audio is best suited for researchers in academia and in the audio industry.

Intelligent Data analysis and its Applications, Volume II

Intelligent Data analysis and its Applications, Volume II
Author: Jeng-Shyang Pan
Publisher: Springer
Total Pages: 583
Release: 2014-06-05
Genre: Technology & Engineering
ISBN: 3319077732

This volume presents the proceedings of the First Euro-China Conference on Intelligent Data Analysis and Applications (ECC 2014), which was hosted by Shenzhen Graduate School of Harbin Institute of Technology and was held in Shenzhen City on June 13-15, 2014. ECC 2014 was technically co-sponsored by Shenzhen Municipal People’s Government, IEEE Signal Processing Society, Machine Intelligence Research Labs, VSB-Technical University of Ostrava (Czech Republic), National Kaohsiung University of Applied Sciences (Taiwan), and Secure E-commerce Transactions (Shenzhen) Engineering Laboratory of Shenzhen Institute of Standards and Technology.