完整後設資料紀錄
DC 欄位語言
dc.contributor.authorHWANG, SHen_US
dc.contributor.authorCHEN, SHen_US
dc.date.accessioned2014-12-08T15:03:40Z-
dc.date.available2014-12-08T15:03:40Z-
dc.date.issued1994-12-01en_US
dc.identifier.issn1350-245Xen_US
dc.identifier.urihttp://dx.doi.org/10.1049/ip-vis:19941421en_US
dc.identifier.urihttp://hdl.handle.net/11536/2200-
dc.description.abstractA neural-network-based approach to synthesising F0 information for Mandarin text-to-speech is discussed. The basic idea is to use neural networks to model the relationship between linguistic features, extracted from input text and parameters representing the pitch contour of syllables. Two MLPs are used to separately synthesise the mean and shape of pitch contour, using different linguistic features. A large set of utterances is employed to train these MLPs using the well known back-propagation algorithm. Pronunciation rules for generating F0 information are automatically learned and implicitly memorised by the MLPs. In the synthesis, parameters representing the mean and shape of the pitch contour of each syllable are generated using linguistic features extracted from the given input text. Simulation results confirmed that this is a promising approach for F0 synthesis. The resulting synthesised pitch contours of syllables match well with their original counterparts. Average root mean square errors of 0.94 ms/frame and 1.00ms/frame were achieved.en_US
dc.language.isoen_USen_US
dc.subjectMANDARINE SPEECH SYNTHESIZERen_US
dc.subjectNEURAL NETWORKSen_US
dc.titleNEURAL-NETWORK-BASED F0 TEXT-TO-SPEECH SYNTHESIZER FOR MANDARINEen_US
dc.typeArticleen_US
dc.identifier.doi10.1049/ip-vis:19941421en_US
dc.identifier.journalIEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSINGen_US
dc.citation.volume141en_US
dc.citation.issue6en_US
dc.citation.spage384en_US
dc.citation.epage390en_US
dc.contributor.department電信工程研究所zh_TW
dc.contributor.department電信研究中心zh_TW
dc.contributor.departmentInstitute of Communications Engineeringen_US
dc.contributor.departmentCenter for Telecommunications Researchen_US
dc.identifier.wosnumberWOS:A1994QB09800005-
dc.citation.woscount7-
顯示於類別:期刊論文


文件中的檔案:

  1. A1994QB09800005.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。