完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Liu, CH | en_US |
dc.contributor.author | Huang, CC | en_US |
dc.date.accessioned | 2014-12-08T15:45:20Z | - |
dc.date.available | 2014-12-08T15:45:20Z | - |
dc.date.issued | 2000-05-01 | en_US |
dc.identifier.issn | 0018-9545 | en_US |
dc.identifier.uri | http://dx.doi.org/10.1109/25.845095 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/30554 | - |
dc.description.abstract | In this paper, we present a median-rate speech coder, the controlled adaptive prediction delta modulation coder (CAPDM), which operates at 16 kb/s with good speech quality and low algorithm complexity [15], The coder is dedicated to personal communication network (PCN) applications and transmits speech samples on the basis of packets. It combines the features of one-step looking forward decision, syllabic companding, instantaneous companding, and adaptive prediction. In addition to the use of a short-term prediction filter, CAPDM also exploits the pitch property to predict speech waveform explicitly. With the aid of a pitch prediction filter, the performance of a CAPDM codec improves about 3 dB in segmental signal-to-noise ratio (SEGSNR), The average SEGSNR of CAPDM.FF is about 21 dB, which is 7 dB over traditional CVSD at 16 kb/s, We also utilize an adaptive postfilter (APF) to enhance the perceptual quality of the decoded speech. The mean opinion score (MOS) listening test of CAPDM.FF with APF shows that its average score achieves 4.19, which is as good as G.728 16-kb/s LD-CELP and is comparable with CCITT G.721 32-kb/s ADPCM, The complexity of CAPDM.FF is evaluated to be 8 MIPS, which Is much lower than that of LD-CELP and could be further reduced by adopting a smaller correlation window for pitch detection. To solve the problem of packet loss, we developed a packet-based waveform substitution method by reinitializing the codec parameters at the beginning of each packet. The simulation results show that CAPDM.FF could tolerate 5% of packet loss and still keep an SEGSNR at 10 db and an MOS at about 3.0. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | adaptive prediction | en_US |
dc.subject | instantaneous companding | en_US |
dc.subject | packet recovery | en_US |
dc.subject | pitch detection | en_US |
dc.subject | speech coder | en_US |
dc.subject | syllabic companding | en_US |
dc.subject | waveform substitution | en_US |
dc.title | A packet-based CAPDM speech coder for PCN applications | en_US |
dc.type | Article; Proceedings Paper | en_US |
dc.identifier.doi | 10.1109/25.845095 | en_US |
dc.identifier.journal | IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | en_US |
dc.citation.volume | 49 | en_US |
dc.citation.issue | 3 | en_US |
dc.citation.spage | 753 | en_US |
dc.citation.epage | 765 | en_US |
dc.contributor.department | 電信工程研究所 | zh_TW |
dc.contributor.department | Institute of Communications Engineering | en_US |
dc.identifier.wosnumber | WOS:000087471700008 | - |
顯示於類別: | 會議論文 |