Электронная книга: Tuomas Virtanen «Audio Source Separation and Speech Enhancement»

Audio Source Separation and Speech Enhancement

Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software. Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting. Key features: Consolidated perspective on audio source separation and speech enhancement. Both historical perspective and latest advances in the field, e.g. deep neural networks. Diverse disciplines: array processing, machine learning, and statistical signal processing. Covers the most important techniques for both single-channel and multichannel processing. This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.

Издательство: "John Wiley&Sons Limited (USD)"

ISBN: 9781119279884

электронная книга

Купить за 9105.53 руб и скачать на Litres

Другие книги автора:

КнигаОписаниеГодЦенаТип книги
Techniques for Noise Robustness in Automatic Speech RecognitionAutomatic speech recognition (ASR) systems are finding increasing use in everyday life. Many of the commonplace environments where the systems are used are noisy, for example users calling up a voice… — John Wiley&Sons Limited, электронная книга Подробнее...11196.4электронная книга

Look at other dictionaries:

  • Business and Industry Review — ▪ 1999 Introduction Overview        Annual Average Rates of Growth of Manufacturing Output, 1980 97, Table Pattern of Output, 1994 97, Table Index Numbers of Production, Employment, and Productivity in Manufacturing Industries, Table (For Annual… …   Universalium

  • education — /ej oo kay sheuhn/, n. 1. the act or process of imparting or acquiring general knowledge, developing the powers of reasoning and judgment, and generally of preparing oneself or others intellectually for mature life. 2. the act or process of… …   Universalium

  • cañada — /keuhn yah deuh, yad euh/, n. Chiefly Western U.S. 1. a dry riverbed. 2. a small, deep canyon. [1840 50; < Sp, equiv. to cañ(a) CANE + ada n. suffix] * * * Canada Introduction Canada Background: A land of vast distances and rich natural resources …   Universalium

  • Canada — /kan euh deuh/, n. a nation in N North America: a member of the Commonwealth of Nations. 29,123,194; 3,690,410 sq. mi. (9,558,160 sq. km). Cap.: Ottawa. * * * Canada Introduction Canada Background: A land of vast distances and rich natural… …   Universalium

  • performing arts — arts or skills that require public performance, as acting, singing, or dancing. [1945 50] * * * ▪ 2009 Introduction Music Classical.       The last vestiges of the Cold War seemed to thaw for a moment on Feb. 26, 2008, when the unfamiliar strains …   Universalium

  • Cocktail party effect — The cocktail party effect describes the ability to focus one s listening attention on a single talker among a mixture of conversations and background noises, ignoring other conversations.[1] The effect enables most people to talk in a noisy place …   Wikipedia

  • Marshall McLuhan — McLuhan redirects here. For the son of Marshall McLuhan, see Eric McLuhan. Marshall McLuhan Marshall McLuhan in the early 1970s Born July 21, 1911(1911 07 21) Edmonton, Alberta, Canada …   Wikipedia

  • Psychoacoustics — is the study of subjective human perception of sounds. Alternatively it can be described as the study of the psychological correlates of the physical parameters of acoustics. Background Hearing is not a purely mechanical phenomenon of wave… …   Wikipedia

  • Tissue engineering — Principle of tissue engineering Tissue engineering was once categorized as a sub field of bio materials, but having grown in scope and importance it can be considered as a field in its own right. It is the use of a combination of cells,… …   Wikipedia

  • Cepstrum — Cepstral redirects here. For the software company based in Pennsylvania, see Cepstral (company). A cepstrum /ˈkɛps …   Wikipedia

  • Microsoft Customer Care Framework — A possible implementation of CCF Agent Desktop Developer(s) Microsoft Stable release 2009 SP 1 / March 31, 2009 …   Wikipedia


We are using cookies for the best presentation of our site. Continuing to use this site, you agree with this.