標題: Spectral and prosodic transformations of hearing-impaired Mandarin speech
作者: Lee, CL
Chang, WW
Chiang, YC
電信工程研究所
Institute of Communications Engineering
關鍵字: voice conversion;prosodic modification;spectral conversion;hearing-impaired speaker;sinusoidal model
公開日期: 1-二月-2006
摘要: This paper studies the combined use of spectral and prosodic conversions to enhance the hearing-impaired Mandarin speech. The analysis-synthesis system is based on a sinusoidal representation of the speech production mechanism. By taking advantage of the tone structure in Mandarin speech, pitch contours are orthogonally transformed and applied within the sinusoidal framework to perform pitch modification. Also proposed is a time-scale modification algorithm that finds accurate alignments between hearing-impaired and normal utterances. Using the alignments, spectral conversion is performed on subsyllabic acoustic units by a continuous probabilistic transform based on a Gaussian mixture model. Results of perceptual evaluation indicate that the proposed system greatly improves the intelligibility and the naturalness of hearing-impaired Mandarin speech. (c) 2005 Elsevier B.V. All rights reserved.
URI: http://dx.doi.org/10.1016/j.specom.2005.08.001
http://hdl.handle.net/11536/12655
ISSN: 0167-6393
DOI: 10.1016/j.specom.2005.08.001
期刊: SPEECH COMMUNICATION
Volume: 48
Issue: 2
起始頁: 207
結束頁: 219
顯示於類別:期刊論文


文件中的檔案:

  1. 000234772500007.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。