Real-time Speech and Music Classification by Large Audio Feature Space Extraction (Springer Theses)
暫譯: 大型音頻特徵空間提取的即時語音與音樂分類 (Springer 論文系列)
Florian Eyben
- 出版商: Springer
- 出版日期: 2018-03-30
- 售價: $6,170
- 貴賓價: 9.5 折 $5,862
- 語言: 英文
- 頁數: 336
- 裝訂: Paperback
- ISBN: 3319801112
- ISBN-13: 9783319801117
海外代購書籍(需單獨結帳)
相關主題
商品描述
This book reports on an outstanding thesis that has significantly advanced the state-of-the-art in the automated analysis and classification of speech and music. It defines several standard acoustic parameter sets and describes their implementation in a novel, open-source, audio analysis framework called openSMILE, which has been accepted and intensively used worldwide. The book offers extensive descriptions of key methods for the automatic classification of speech and music signals in real-life conditions and reports on the evaluation of the framework developed and the acoustic parameter sets that were selected. It is not only intended as a manual for openSMILE users, but also and primarily as a guide and source of inspiration for students and scientists involved in the design of speech and music analysis methods that can robustly handle real-life conditions.
商品描述(中文翻譯)
本書報告了一篇卓越的論文,顯著推進了自動分析和分類語音及音樂的技術前沿。它定義了幾個標準的聲學參數集,並描述了它們在一個名為 openSMILE 的新穎開源音頻分析框架中的實現,該框架已被全球廣泛接受和使用。本書詳細描述了在現實條件下自動分類語音和音樂信號的關鍵方法,並報告了所開發框架的評估以及所選擇的聲學參數集。它不僅旨在作為 openSMILE 使用者的手冊,更主要是作為設計能夠穩健處理現實條件的語音和音樂分析方法的學生和科學家的指南和靈感來源。