ISOLATED MANDARINE SYLLABLE RECOGNITION USING SEGMENTAL FEATURES

doi:10.1049/ip-vis:19951648

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	CHANG, S	en_US
dc.contributor.author	CHEN, SH	en_US
dc.date.accessioned	2014-12-08T15:03:32Z	-
dc.date.available	2014-12-08T15:03:32Z	-
dc.date.issued	1995-02-01	en_US
dc.identifier.issn	1350-245X	en_US
dc.identifier.uri	http://dx.doi.org/10.1049/ip-vis:19951648	en_US
dc.identifier.uri	http://hdl.handle.net/11536/2072	-
dc.description.abstract	A segment-based speech recognition scheme is proposed. The basic idea is to model explicitly the correlation among successive frames of speech signals by using features representing contours of spectral parameters. The speech signal of an utterance is regarded as a template formed by directly concatenating a sequence of acoustic segments. Each constituent acoustic segment is of variable length in nature and represented by a fixed dimensional feature vector formed by coefficients of discrete orthonormal polynomial expansions for approximating its spectral parameter contours. In the training, an automatic algorithm is proposed to generate several segment-based reference templates for each syllable class. In the testing, a frame-based dynamic programming procedure is employed to calculate the matching score of comparing the test utterance with each reference template. Performance of the proposed scheme was examined by simulations on multispeaker speech recognition for 408 highly confusing isolated Mandarin base-syllables. A recognition rate of 81.1% was achieved for the case using 5-segment, 8-reference template models with cepstral and delta-cepstral coefficients as recognition features. It is 4.5% higher than that of a well-modelled 12-state, 5-mixture CHMM method using cepstral, delta cepstral, and delta-delta cepstral coefficients.	en_US
dc.language.iso	en_US	en_US
dc.subject	SPEECH RECOGNITION	en_US
dc.subject	ACOUSTIC SEGMENTS	en_US
dc.subject	MANDARINE BASE SYLLABLES	en_US
dc.title	ISOLATED MANDARINE SYLLABLE RECOGNITION USING SEGMENTAL FEATURES	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1049/ip-vis:19951648	en_US
dc.identifier.journal	IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING	en_US
dc.citation.volume	142	en_US
dc.citation.issue	1	en_US
dc.citation.spage	59	en_US
dc.citation.epage	64	en_US
dc.contributor.department	電信工程研究所	zh_TW
dc.contributor.department	電信研究中心	zh_TW
dc.contributor.department	Institute of Communications Engineering	en_US
dc.contributor.department	Center for Telecommunications Research	en_US
dc.identifier.wosnumber	WOS:A1995QL50500011	-
dc.citation.woscount	0	-
顯示於類別：	期刊論文

文件中的檔案：

A1995QL50500011.pdf

若為 zip 檔案，請下載檔案解壓縮後，用瀏覽器開啟資料夾中的 index.html 瀏覽全文。