標題: | A new approach for audio classification and segmentation using Gabor wavelets and Fisher Linear Discriminator |
作者: | Lin, RS Chen, LH 資訊工程學系 Department of Computer Science |
關鍵字: | audio classification and segmentation;spectrogram;audio content-based retrieval;Fisher Linear discriminator;Gabor wavelets |
公開日期: | 1-九月-2005 |
摘要: | Rapid increase in the amount of audio data demands an efficient method to automatically segment or classify audio stream based on its content. In this paper, based on the Gabor wavelet features, an audio classification and segmentation method is proposed. This method will first divide an audio stream into clips, each of which contains one-second audio information. Then, each clip is classified as one of two classes or five classes. Two classes contain speech and music; pure speech, pure music, song, speech with music background, and speech with environmental noise background are for five classes. Finally, a merge technique is provided to do segmentation. In order to make the proposed method robust for a variety of audio sources: we use Fisher Linear Discriminator to obtain features with the highest discriminative ability Experimental results show that the proposed method can achieve over 98% accuracy rate for speech and music discrimination, and more than 95% for a five-way discrimination. By checking the class types of adjacent clips, we can also identify more than 95% audio scene breaks in audio sequence. |
URI: | http://dx.doi.org/10.1142/S0218001405004289 http://hdl.handle.net/11536/13337 |
ISSN: | 0218-0014 |
DOI: | 10.1142/S0218001405004289 |
期刊: | INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE |
Volume: | 19 |
Issue: | 6 |
起始頁: | 807 |
結束頁: | 822 |
顯示於類別: | 期刊論文 |