Full metadata record
DC FieldValueLanguage
dc.contributor.authorLin, RSen_US
dc.contributor.authorChen, LHen_US
dc.date.accessioned2014-12-08T15:34:58Z-
dc.date.available2014-12-08T15:34:58Z-
dc.date.issued2005-02-01en_US
dc.identifier.issn0218-0014en_US
dc.identifier.urihttp://dx.doi.org/10.1142/S0218001405003958en_US
dc.identifier.urihttp://hdl.handle.net/11536/23777-
dc.description.abstractThe existing audio retrieval systems fall into one of two categories: single-domain systems that can accept data of only a single type (e.g. speech) or multiple-domain systems that offer content-based retrieval for multiple types of audio data. Since a single-domain system has limited applications, a multiple-domain system will be more useful. However, different types of audio data will have different properties, this will make a multiple-domain system harder to be developed. If we can classify audio information in advance, the above problems can be solved. In this paper, we will propose a real-time classification method to classify audio signals into several basic audio types such as pure speech, music, song, speech with music background, and speech with environmental noise background. In order to make the proposed method robust for a variety of audio sources, we use Bayesian decision function for multivariable Gaussian distribution instead of manually adjusting a threshold for each discriminator. The proposed approach can be applied to content-based audio/video retrieval. In the experiment, the efficiency and effectiveness of this method are shown by an accuracy rate of more than 96% for general audio data classification.en_US
dc.language.isoen_USen_US
dc.subjectaudio classificationen_US
dc.subjectspectrogramen_US
dc.subjectBayesian decision functionen_US
dc.subjectmultivariable Gaussian distributionen_US
dc.titleA new approach for classification of generic audio dataen_US
dc.typeArticleen_US
dc.identifier.doi10.1142/S0218001405003958en_US
dc.identifier.journalINTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCEen_US
dc.citation.volume19en_US
dc.citation.issue1en_US
dc.citation.spage63en_US
dc.citation.epage78en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000228135600004-
dc.citation.woscount3-
Appears in Collections:Articles


Files in This Item:

  1. 000228135600004.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.