Speech Recognition Over Digital Channels: Robustness and Standards
            
暫譯: 數位通道上的語音辨識:穩健性與標準
        
        Antonio Peinado, Jose Segura
- 出版商: Wiley
- 出版日期: 2006-09-11
- 售價: $4,750
- 貴賓價: 9.5 折 $4,513
- 語言: 英文
- 頁數: 274
- 裝訂: Hardcover
- ISBN: 0470024003
- ISBN-13: 9780470024003
- 
    相關分類:
    
      語音辨識 Speech-recognition
 
海外代購書籍(需單獨結帳)
相關主題
商品描述
Description
Automatic speech recognition (ASR) is a very attractive means for human-machine interaction. The degree of maturity reached by speech recognition technologies during recent years allows the development of applications that use them. In particular, ASR shows an enormous potential in mobile environments, where devices such as mobile phones or PDAs are used, and for Internet Protocol (IP) applications.Speech Recognition Over Digital Channels is the first book of its kind to offer a complete system comprehension, addressing the topics of distributed and network-based speech recognition issues and standards, the concepts of speech processing and transmission, and system architectures and robustness.
Describes the different client/server architectures for remote speech recognition systems, by means of which the client transmits speech parameters through a digital channel to a remote recognition server
- Focuses on robustness against both adverse acoustic environments (in the front-end) and bit errors/packet loss
- Discusses four ETSI standards for distributed speech recognition; the understanding of the standards and the technologies behind them
- Provides the necessary background for the comprehension of remote speech recognition technologies
This book will appeal to a wide-ranging audience: engineers using speech recognition systems, researchers involved in ASR systems and those interested in processing and transmitting speech such as signal processing and communications communities. It will also be of interest to technical experts requiring an understanding of recognition over mobile and IP networks, and postgraduate students working on robust speech processing.
Table of Contents
Forward.Preface.
1 Introduction.
1.1 Introduction.
1.2 RSR over Digital Channels.
1.3 Organization of the Book.
2 Speech Recognition with HMMs.
2.1 Introduction.
2.2 Some General Issues.
2.3 Analysis of Speech Signals.
2.4 Vector Quantization.
2.5 Approaches to ASR.
2.6 Hidden Markov Models.
2.7 Application of HMMs to Speech Recognition.
2.8 Model Adaptation.
2.9 Dealing with Uncertainty.
3 Networks and Degradation.
3.1 Introduction.
3.2 Mobile and Wireless Networks.
3.3 IP Networks.
3.4 The Acoustic Environment.
4 Speech Compression and Architectures for RSR.
4.1 Introduction.
4.2 Speech Coding.
4.3 Recognition from Decoded Speech.
4.4 Recognition from Codec Parameters.
4.5 Distributed Speech Recognition.
4.6 Comparison between NSR and DSR.
5 Robustness Against Transmission Channel Errors.
5.1 Introduction.
5.2 Channel Coding Techniques.
5.3 Error Concealment (EC).
6 Front-end Processing for Robust Feature Extraction.
6.1 Introduction.
6.2 Noise Reduction Techniques.
6.3 Voice Activity Detection.
6.4 Feature Normalization.
7 Standards for Distributed Speech Recognition.
7.1 Introduction.
7.2 Signal Preprocessing.
7.3 Feature Extraction.
7.4 Feature Compression and Encoding.
7.5 Feature Decoding and Postprocessing.
A Alternative Representations of the LPC Coefficients.
B Basic Digital Modulation Concepts.
C Review of Channel Coding Techniques.
C.1 Media-independent FEC.
C.2 Interleaving.
Bibliography.
List of Acronyms.
Index.
商品描述(中文翻譯)
描述  
自動語音辨識(ASR)是一種非常吸引人的人機互動方式。近年來,語音辨識技術的成熟度使得開發使用這些技術的應用成為可能。特別是在移動環境中,ASR顯示出巨大的潛力,例如在手機或個人數位助理(PDA)等設備上,以及在網際協議(IP)應用中。  
《數位通道上的語音辨識》是首本提供完整系統理解的書籍,涵蓋分散式和基於網路的語音辨識問題與標準、語音處理與傳輸的概念,以及系統架構和穩健性。  
本書描述了遠端語音辨識系統的不同客戶端/伺服器架構,客戶端透過數位通道將語音參數傳輸到遠端辨識伺服器。  
- 專注於對抗不利的聲學環境(前端)以及位元錯誤/封包遺失的穩健性  
- 討論四項ETSI標準的分散式語音辨識;理解這些標準及其背後的技術  
- 提供理解遠端語音辨識技術所需的背景知識  
本書將吸引廣泛的讀者群:使用語音辨識系統的工程師、參與ASR系統的研究人員,以及對處理和傳輸語音感興趣的信號處理和通訊社群。它也將引起需要理解移動和IP網路上辨識的技術專家的興趣,以及從事穩健語音處理的研究生。
目錄  
前言  
序言  
1 介紹  
1.1 介紹  
1.2 數位通道上的RSR  
1.3 本書組織  
2 使用HMM的語音辨識  
2.1 介紹  
2.2 一些一般性問題  
2.3 語音信號分析  
2.4 向量量化  
2.5 ASR的方法  
2.6 隱藏馬可夫模型  
2.7 HMM在語音辨識中的應用  
2.8 模型適應  
2.9 處理不確定性  
3 網路與退化  
3.1 介紹  
3.2 移動和無線網路  
3.3 IP網路  
3.4 聲學環境  
4 語音壓縮與RSR架構  
4.1 介紹  
4.2 語音編碼  
4.3 從解碼語音中辨識  
4.4 從編解碼器參數中辨識  
4.5 分散式語音辨識  
4.6 NSR與DSR的比較  
5 對傳輸通道錯誤的穩健性  
5.1 介紹  
5.2 通道編碼技術  
5.3 錯誤隱藏(EC)  
6 用於穩健特徵提取的前端處理  
6.1 介紹  
6.2 噪音減少技術  
6.3 語音活動檢測  
6.4 特徵正規化  
7 分散式語音辨識的標準  
7.1 介紹  
7.2 信號預處理  
7.3 特徵提取  
7.4 特徵壓縮與編碼  
7.5 特徵解碼與後處理  
A LPC係數的替代表示法  
B 基本數位調變概念  
C 通道編碼技術回顧  
C.1 媒體獨立的FEC  
C.2 交錯  
參考文獻  
縮略語列表  
索引  

 
 
    
 
    
 
     
     
     
     
    
 
     
     
    
 
     
     
     
     
     
     
    
