標題: | Bayesian classification for data from the same unknown class |
作者: | Huang, HJ Hsu, CN 資訊工程學系 Department of Computer Science |
關鍵字: | classification;machine learning;naive Bayes classifier;speaker recognition |
公開日期: | 1-四月-2002 |
摘要: | In this paper, we address the problem of how to classify a set of query vectors that belong to the same unknown class. Sets of data known to be sampled from the same class are naturally available in many application domains, such as speaker recognition. We refer to these sets as homologous sets. We show how to take advantage of homologous sets in classification to obtain improved accuracy over classifying each query vector individually. Our method, called homologous naive Bayes (HNB), is based on the naive Bayes classifier, a simple algorithm shown to be effective in many application domains. HNB uses a modified classification procedure that classifies multiple instances as a single unit. Compared with a voting method and several other variants of naive Bayes classification, HNB significantly outperforms these methods in a variety of test data sets, even when the number of query vectors in the homologous sets is small. We also report a successful application of HNB to speaker recognition. Experimental results show that HNB can achieve classification accuracy comparable to the Gaussian mixture model (GMM), the most widely used speaker recognition approach, while using less time for both training and classification. |
URI: | http://dx.doi.org/10.1109/3477.990870 http://hdl.handle.net/11536/28897 |
ISSN: | 1083-4419 |
DOI: | 10.1109/3477.990870 |
期刊: | IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS |
Volume: | 32 |
Issue: | 2 |
起始頁: | 137 |
結束頁: | 145 |
顯示於類別: | 期刊論文 |