Audio Source Separation and Speech Enhancement

  • 出版商: Wiley
  • 出版日期: 2018-10-22
  • 定價: $4,880
  • 售價: 9.5$4,636
  • 語言: 英文
  • 頁數: 504
  • 裝訂: Hardcover
  • ISBN: 1119279895
  • ISBN-13: 9781119279891
  • 相關分類: Machine Learning
  • 立即出貨 (庫存=1)



Learn the technology behind hearing aids, Siri, and Echo 

Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and bear a critical role in the success of hearing aids, hands-free phones, voice command and other noise-robust audio analysis systems, and music post-production software.

Research on this topic has followed three convergent paths, starting with sensor array processing, computational auditory scene analysis, and machine learning based approaches such as independent component analysis, respectively. This book is the first one to provide a comprehensive overview by presenting the common foundations and the differences between these techniques in a unified setting.

Key features:

  • Consolidated perspective on audio source separation and speech enhancement.
  • Both historical perspective and latest advances in the field, e.g. deep neural networks.
  • Diverse disciplines: array processing, machine learning, and statistical signal processing.
  • Covers the most important techniques for both single-channel and multichannel processing.

This book provides both introductory and advanced material suitable for people with basic knowledge of signal processing and machine learning. Thanks to its comprehensiveness, it will help students select a promising research track, researchers leverage the acquired cross-domain knowledge to design improved techniques, and engineers and developers choose the right technology for their target application scenario. It will also be useful for practitioners from other fields (e.g., acoustics, multimedia, phonetics, and musicology) willing to exploit audio source separation or speech enhancement as pre-processing tools for their own needs.






- 對音頻源分離和語音增強提供整合的觀點。
- 包括領域的歷史背景和最新進展,例如深度神經網絡。
- 涵蓋多個學科:陣列處理、機器學習和統計信號處理。
- 涵蓋單通道和多通道處理的最重要技術。