Articulatory Speech Synthesis from the Fluid Dynamics of the Vocal Apparatus (Synthesis Lectures on Speech and Audio Processing)
暫譯: 從聲道流體動力學進行發音合成（語音與音頻處理合成講座）

Name: Articulatory Speech Synthesis from the Fluid Dynamics of the Vocal Apparatus (Synthesis Lectures on Speech and Audio Processing)
Price: 1482 TWD
Availability: OnlineOnly
Author: Stephen Levinson, Don Davis, Scot Slimon, Jun Huang
ISBN: 1598291785

Stephen Levinson, Don Davis, Scot Slimon, Jun Huang

出版商: Morgan & Claypool
出版日期: 2012-07-01
售價: $1,560
貴賓價: 9.5 折 $1,482
語言: 英文
頁數: 116
裝訂: Paperback
ISBN: 1598291785
ISBN-13: 9781598291780
相關分類: 語音辨識 Speech-recognition

海外代購書籍(需單獨結帳)

商品描述

This book addresses the problem of articulatory speech synthesis based on computed vocal tract geometries and the basic physics of sound production in it. Unlike conventional methods based on analysis/synthesis using the well-known source filter model, which assumes the independence of the excitation and filter, we treat the entire vocal apparatus as one mechanical system that produces sound by means of fluid dynamics. The vocal apparatus is represented as a three-dimensional time-varying mechanism and the sound propagation inside it is due to the non-planar propagation of acoustic waves through a viscous, compressible fluid described by the Navier-Stokes equations.

We propose a combined minimum energy and minimum jerk criterion to compute the dynamics of the vocal tract during articulation. Theoretical error bounds and experimental results show that this method obtains a close match to the phonetic target positions while avoiding abrupt changes in the articulatory trajectory. The vocal folds are set into aerodynamic oscillation by the flow of air from the lungs. The modulated air stream then excites the moving vocal tract. This method shows strong evidence for source-filter interaction.

Based on our results, we propose that the articulatory speech production model has the potential to synthesize speech and provide a compact parameterization of the speech signal that can be useful in a wide variety of speech signal processing problems.

Table of Contents: Introduction / Literature Review / Estimation of Dynamic Articulatory Parameters / Construction of Articulatory Model Based on MRI Data / Vocal Fold Excitation Models / Experimental Results of Articulatory Synthesis / Conclusion

商品描述(中文翻譯)

本書探討基於計算的聲道幾何形狀和聲音產生基本物理學的發音合成問題。與傳統基於分析/合成的已知源濾波模型的方法不同，該模型假設激勵和濾波器之間是獨立的，我們將整個發聲器官視為一個機械系統，通過流體動力學來產生聲音。發聲器官被表示為一個三維時間變化的機械裝置，聲音在其中的傳播是由於聲波在一個由Navier-Stokes方程描述的粘性可壓縮流體中非平面傳播所致。

我們提出了一個結合最小能量和最小抖動準則的方法，以計算發聲過程中的聲道動力學。理論誤差界限和實驗結果顯示，這種方法能夠與語音目標位置緊密匹配，同時避免發音軌跡中的突變。聲帶通過來自肺部的氣流進行空氣動力學振盪。調制的氣流隨後激發移動的聲道。這種方法顯示出源-濾波器互動的強有力證據。

根據我們的結果，我們提出發音語音產生模型有潛力合成語音，並提供語音信號的緊湊參數化，這在各種語音信號處理問題中都可能是有用的。

目錄：引言 / 文獻回顧 / 動態發音參數的估計 / 基於MRI數據的發音模型構建 / 聲帶激發模型 / 發音合成的實驗結果 / 結論