Multimodal Signal Processing: Theory and applications for human-computer interaction (Hardcover)

Jean-Philippe Thiran, Ferran Marqu臃, Herv Bourlard

商品描述

  • Presents state-of-art methods for multimodal signal processing, analysis, and modeling
  • Contains numerous examples of systems with different modalities combined
  • Describes advanced applications in multimodal Human-Computer Interaction (HCI) as well as in computer-based analysis and modelling of multimodal human-human communication scenes.

Multimodal signal processing is an important research and development field that processes signals and combines information from a variety of modalities - speech, vision, language, text - which significantly enhance the understanding, modelling, and performance of human-computer interaction devices or systems enhancing human-human communication. The overarching theme of this book is the application of signal processing and statistical machine learning techniques to problems arising in this multi-disciplinary field. It describes the capabilities and limitations of current technologies, and discusses the technical challenges that must be overcome to develop efficient and user-friendly multimodal interactive systems.

With contributions from the leading experts in the field, the present book should serve as a reference in multimodal signal processing for signal processing researchers, graduate students, R&D engineers, and computer engineers who are interested in this emerging field.




  • Presents state-of-art methods for multimodal signal processing, analysis, and modeling

  • Contains numerous examples of systems with different modalities combined

  • Describes advanced applications in multimodal Human-Computer Interaction (HCI) as well as in computer-based analysis and modelling of multimodal human-human communication scenes.

商品描述(中文翻譯)

本書提供了多模態信號處理、分析和建模的最新方法。書中包含了許多結合不同模態的系統示例。同時,本書還描述了多模態人機交互(HCI)以及基於計算機的多模態人與人之間通信場景的高級應用。

多模態信號處理是一個重要的研究和開發領域,它處理信號並結合來自多種模態(語音、視覺、語言、文本)的信息,從而顯著提高了人機交互設備或系統的理解、建模和性能,增強了人與人之間的交流。本書的主題是將信號處理和統計機器學習技術應用於這個多學科領域中出現的問題。它描述了當前技術的能力和限制,並討論了開發高效且用戶友好的多模態交互系統所必須克服的技術挑戰。

本書由該領域的領先專家貢獻,將成為多模態信號處理的參考資料,適用於信號處理研究人員、研究生、研發工程師和計算機工程師等對這一新興領域感興趣的讀者。