Mathematical Modeling and Signal Processing in Speech and Hearing Sciences

Mathematical Modeling and Signal Processing in Speech and Hearing Sciences
Author: Jack Xin
Publisher: Springer Science & Business Media
Total Pages: 216
Release: 2014-04-14
Genre: Mathematics
ISBN: 3319030868

The aim of the book is to give an accessible introduction of mathematical models and signal processing methods in speech and hearing sciences for senior undergraduate and beginning graduate students with basic knowledge of linear algebra, differential equations, numerical analysis, and probability. Speech and hearing sciences are fundamental to numerous technological advances of the digital world in the past decade, from music compression in MP3 to digital hearing aids, from network based voice enabled services to speech interaction with mobile phones. Mathematics and computation are intimately related to these leaps and bounds. On the other hand, speech and hearing are strongly interdisciplinary areas where dissimilar scientific and engineering publications and approaches often coexist and make it difficult for newcomers to enter.

Modeling the Heart and the Circulatory System

Modeling the Heart and the Circulatory System
Author: Alfio Quarteroni
Publisher: Springer
Total Pages: 248
Release: 2015-04-24
Genre: Mathematics
ISBN: 3319052306

The book comprises contributions by some of the most respected scientists in the field of mathematical modeling and numerical simulation of the human cardiocirculatory system. The contributions cover a wide range of topics, from the preprocessing of clinical data to the development of mathematical equations, their numerical solution, and both in-vivo and in-vitro validation. They discuss the flow in the systemic arterial tree and the complex electro-fluid-mechanical coupling in the human heart. Many examples of patient-specific simulations are presented. This book is addressed to all scientists interested in the mathematical modeling and numerical simulation of the human cardiocirculatory system.

Dynamic Speech Models

Dynamic Speech Models
Author: Li Deng
Publisher: Springer Nature
Total Pages: 105
Release: 2022-05-31
Genre: Technology & Engineering
ISBN: 3031025555

Speech dynamics refer to the temporal characteristics in all stages of the human speech communication process. This speech “chain” starts with the formation of a linguistic message in a speaker's brain and ends with the arrival of the message in a listener's brain. Given the intricacy of the dynamic speech process and its fundamental importance in human communication, this monograph is intended to provide a comprehensive material on mathematical models of speech dynamics and to address the following issues: How do we make sense of the complex speech process in terms of its functional role of speech communication? How do we quantify the special role of speech timing? How do the dynamics relate to the variability of speech that has often been said to seriously hamper automatic speech recognition? How do we put the dynamic process of speech into a quantitative form to enable detailed analyses? And finally, how can we incorporate the knowledge of speech dynamics into computerized speech analysis and recognition algorithms? The answers to all these questions require building and applying computational models for the dynamic speech process. What are the compelling reasons for carrying out dynamic speech modeling? We provide the answer in two related aspects. First, scientific inquiry into the human speech code has been relentlessly pursued for several decades. As an essential carrier of human intelligence and knowledge, speech is the most natural form of human communication. Embedded in the speech code are linguistic (as well as para-linguistic) messages, which are conveyed through four levels of the speech chain. Underlying the robust encoding and transmission of the linguistic messages are the speech dynamics at all the four levels. Mathematical modeling of speech dynamics provides an effective tool in the scientific methods of studying the speech chain. Such scientific studies help understand why humans speak as they do and how humans exploit redundancy and variability by way of multitiered dynamic processes to enhance the efficiency and effectiveness of human speech communication. Second, advancement of human language technology, especially that in automatic recognition of natural-style human speech is also expected to benefit from comprehensive computational modeling of speech dynamics. The limitations of current speech recognition technology are serious and are well known. A commonly acknowledged and frequently discussed weakness of the statistical model underlying current speech recognition technology is the lack of adequate dynamic modeling schemes to provide correlation structure across the temporal speech observation sequence. Unfortunately, due to a variety of reasons, the majority of current research activities in this area favor only incremental modifications and improvements to the existing HMM-based state-of-the-art. For example, while the dynamic and correlation modeling is known to be an important topic, most of the systems nevertheless employ only an ultra-weak form of speech dynamics; e.g., differential or delta parameters. Strong-form dynamic speech modeling, which is the focus of this monograph, may serve as an ultimate solution to this problem. After the introduction chapter, the main body of this monograph consists of four chapters. They cover various aspects of theory, algorithms, and applications of dynamic speech models, and provide a comprehensive survey of the research work in this area spanning over past 20~years. This monograph is intended as advanced materials of speech and signal processing for graudate-level teaching, for professionals and engineering practioners, as well as for seasoned researchers and engineers specialized in speech processing

Multiscale Modeling of Pedestrian Dynamics

Multiscale Modeling of Pedestrian Dynamics
Author: Emiliano Cristiani
Publisher: Springer
Total Pages: 271
Release: 2014-09-12
Genre: Mathematics
ISBN: 331906620X

This book presents mathematical models and numerical simulations of crowd dynamics. The core topic is the development of a new multiscale paradigm, which bridges the microscopic and macroscopic scales taking the most from each of them for capturing the relevant clues of complexity of crowds. The background idea is indeed that most of the complex trends exhibited by crowds are due to an intrinsic interplay between individual and collective behaviors. The modeling approach promoted in this book pursues actively this intuition and profits from it for designing general mathematical structures susceptible of application also in fields different from the inspiring original one. The book considers also the two most traditional points of view: the microscopic one, in which pedestrians are tracked individually and the macroscopic one, in which pedestrians are assimilated to a continuum. Selected existing models are critically analyzed. The work is addressed to researchers and graduate students.

The Mimetic Finite Difference Method for Elliptic Problems

The Mimetic Finite Difference Method for Elliptic Problems
Author: Lourenco Beirao da Veiga
Publisher: Springer
Total Pages: 399
Release: 2014-05-22
Genre: Mathematics
ISBN: 3319026631

This book describes the theoretical and computational aspects of the mimetic finite difference method for a wide class of multidimensional elliptic problems, which includes diffusion, advection-diffusion, Stokes, elasticity, magnetostatics and plate bending problems. The modern mimetic discretization technology developed in part by the Authors allows one to solve these equations on unstructured polygonal, polyhedral and generalized polyhedral meshes. The book provides a practical guide for those scientists and engineers that are interested in the computational properties of the mimetic finite difference method such as the accuracy, stability, robustness, and efficiency. Many examples are provided to help the reader to understand and implement this method. This monograph also provides the essential background material and describes basic mathematical tools required to develop further the mimetic discretization technology and to extend it to various applications.

Mathematical Foundations of Speech and Language Processing

Mathematical Foundations of Speech and Language Processing
Author: Mark Johnson
Publisher: Springer Science & Business Media
Total Pages: 292
Release: 2012-12-06
Genre: Technology & Engineering
ISBN: 1441990178

Speech and language technologies continue to grow in importance as they are used to create natural and efficient interfaces between people and machines, and to automatically transcribe, extract, analyze, and route information from high-volume streams of spoken and written information. The workshops on Mathematical Foundations of Speech Processing and Natural Language Modeling were held in the Fall of 2000 at the University of Minnesota's NSF-sponsored Institute for Mathematics and Its Applications, as part of a "Mathematics in Multimedia" year-long program. Each workshop brought together researchers in the respective technologies on the one hand, and mathematicians and statisticians on the other hand, for an intensive week of cross-fertilization. There is a long history of benefit from introducing mathematical techniques and ideas to speech and language technologies. Examples include the source-channel paradigm, hidden Markov models, decision trees, exponential models and formal languages theory. It is likely that new mathematical techniques, or novel applications of existing techniques, will once again prove pivotal for moving the field forward. This volume consists of original contributions presented by participants during the two workshops. Topics include language modeling, prosody, acoustic-phonetic modeling, and statistical methodology.

Mathematical Models for Speech Technology

Mathematical Models for Speech Technology
Author: Stephen Levinson
Publisher: John Wiley & Sons
Total Pages: 286
Release: 2005-03-04
Genre: Technology & Engineering
ISBN: 9780470844076

Mathematical Models of Spoken Language presents the motivations for, intuitions behind, and basic mathematical models of natural spoken language communication. A comprehensive overview is given of all aspects of the problem from the physics of speech production through the hierarchy of linguistic structure and ending with some observations on language and mind. The author comprehensively explores the argument that these modern technologies are actually the most extensive compilations of linguistic knowledge available.Throughout the book, the emphasis is on placing all the material in a mathematically coherent and computationally tractable framework that captures linguistic structure. It presents material that appears nowhere else and gives a unification of formalisms and perspectives used by linguists and engineers. Its unique features include a coherent nomenclature that emphasizes the deep connections amongst the diverse mathematical models and explores the methods by means of which they capture linguistic structure. This contrasts with some of the superficial similarities described in the existing literature; the historical background and origins of the theories and models; the connections to related disciplines, e.g. artificial intelligence, automata theory and information theory; an elucidation of the current debates and their intellectual origins; many important little-known results and some original proofs of fundamental results, e.g. a geometric interpretation of parameter estimation techniques for stochastic models and finally the author's own unique perspectives on the future of this discipline. There is a vast literature on Speech Recognition and Synthesis however, this book is unlike any other in the field. Although it appears to be a rapidly advancing field, the fundamentals have not changed in decades. Most of the results are presented in journals from which it is difficult to integrate and evaluate all of these recent ideas. Some of the fundamentals have been collected into textbooks, which give detailed descriptions of the techniques but no motivation or perspective. The linguistic texts are mostly descriptive and pictorial, lacking the mathematical and computational aspects. This book strikes a useful balance by covering a wide range of ideas in a common framework. It provides all the basic algorithms and computational techniques and an analysis and perspective, which allows one to intelligently read the latest literature and understand state-of-the-art techniques as they evolve.

Speech Enhancement

Speech Enhancement
Author: Jacob Benesty
Publisher: Elsevier
Total Pages: 143
Release: 2014-01-04
Genre: Technology & Engineering
ISBN: 0128002530

Speech enhancement is a classical problem in signal processing, yet still largely unsolved. Two of the conventional approaches for solving this problem are linear filtering, like the classical Wiener filter, and subspace methods. These approaches have traditionally been treated as different classes of methods and have been introduced in somewhat different contexts. Linear filtering methods originate in stochastic processes, while subspace methods have largely been based on developments in numerical linear algebra and matrix approximation theory. This book bridges the gap between these two classes of methods by showing how the ideas behind subspace methods can be incorporated into traditional linear filtering. In the context of subspace methods, the enhancement problem can then be seen as a classical linear filter design problem. This means that various solutions can more easily be compared and their performance bounded and assessed in terms of noise reduction and speech distortion. The book shows how various filter designs can be obtained in this framework, including the maximum SNR, Wiener, LCMV, and MVDR filters, and how these can be applied in various contexts, like in single-channel and multichannel speech enhancement, and in both the time and frequency domains. First short book treating subspace approaches in a unified way for time and frequency domains, single-channel, multichannel, as well as binaural, speech enhancement Bridges the gap between optimal filtering methods and subspace approaches Includes original presentation of subspace methods from different perspectives

Signal and Acoustic Modeling for Speech and Communication Disorders

Signal and Acoustic Modeling for Speech and Communication Disorders
Author: Hemant A. Patil
Publisher: Walter de Gruyter GmbH & Co KG
Total Pages: 286
Release: 2018-12-17
Genre: Technology & Engineering
ISBN: 1501502417

Signal and Acoustic Modeling for Speech and Communication Disorders demonstrates how speech signal processing and acoustic modeling can be instrumental in early detection and successful intervention with speech deficits resulting from Parkinson’s disease, Autism Spectrum disorder, cleft palate, intellectual disabilities, and neuro-motor impairments. Utilizing some of the most advanced methods in signal and acoustic modeling, this eminent group of contributors show how such technologies can inure to the benefit of healthcare and to society writ large. Paradoxically, what most of us take for granted still remains a Sisyphean battle for those with speech and language disorders, who struggle every day to make themselves heard and understood. The purpose of this book is to stimulate a vibrant discussion among speech scientists, system designers, and practitioners on how to best marshal the latest advances in signal and acoustic modeling to address some of the most challenging speech and communication disorders affecting a wide variety of patient populations across the world.

Speech and Language Engineering

Speech and Language Engineering
Author: Martin Rajman
Publisher: EPFL Press
Total Pages: 512
Release: 2007-04-20
Genre: Technology & Engineering
ISBN: 9780824722197

Efficient processing of speech and language is required at all levels in the design of human-computer interfaces. In this perspective, the book provides a global understanding of the required theoretical foundations, as well as practical examples of successful applications, in the area of human-language technology. The authors start from acoustic signal processing to pragmatics, covering all the important aspects of speech and language processing such as phonetics, morphology, syntax and semantics.