Full metadata record
DC FieldValueLanguage
dc.contributor.authorLin, CTen_US
dc.contributor.authorWu, RCen_US
dc.contributor.authorWu, GDen_US
dc.date.accessioned2014-12-08T15:41:49Z-
dc.date.available2014-12-08T15:41:49Z-
dc.date.issued2002-11-01en_US
dc.identifier.issn0218-0014en_US
dc.identifier.urihttp://dx.doi.org/10.1142/S0218001402002076en_US
dc.identifier.urihttp://hdl.handle.net/11536/28438-
dc.description.abstractThis paper addresses the problem of speech segmentation and enhancement in the presence of noise. We first propose a new word boundary detection algorithm by using a neural fuzzy network (called ATF-based SONFIN algorithm) for identifying islands of word signals in fixed noise-level environment. We further propose a new RTF-based RSONFIN algorithm where the background noise level varies during the procedure of recording. The adaptive time-frequency (ATF) and refined time-frequency (RTF) parameters extend the TF parameter from single band to multiband spectrum analysis, and help to make the distinction of speech and noise signals clear. The ATF and RTF parameters can extract useful frequency information by adaptively choosing proper bands of the mel-scale frequency bank. Due to the self-learning ability of SONFIN and RSONFIN, the proposed algorithms avoid the need of empirically determining thresholds and ambiguous rules. The RTF-based RSONFIN algorithm can also find the variation of the background noise level and detect correct word boundaries in the condition of variable background noise level by processing the temporal relations. Our experimental results show that both in the fixed and variable noise-level environment, the algorithms that we proposed achieved higher recognition rate than several commonly used word boundary detection algorithms and reduced the recognition error rate due to endpoint detection.en_US
dc.language.isoen_USen_US
dc.subjectmel-scale frequencyen_US
dc.subjectmultibanden_US
dc.subjectspectrum analysisen_US
dc.subjectself-learning abilityen_US
dc.subjectneural fuzzy networken_US
dc.titleNoisy speech segmentation/enhancement with multiband analysis and neural fuzzy networksen_US
dc.typeArticleen_US
dc.identifier.doi10.1142/S0218001402002076en_US
dc.identifier.journalINTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCEen_US
dc.citation.volume16en_US
dc.citation.issue7en_US
dc.citation.spage927en_US
dc.citation.epage955en_US
dc.contributor.department電控工程研究所zh_TW
dc.contributor.departmentInstitute of Electrical and Control Engineeringen_US
dc.identifier.wosnumberWOS:000179800300011-
dc.citation.woscount0-
Appears in Collections:Articles


Files in This Item:

  1. 000179800300011.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.