標題: 利用貝氏方法透過聲音指紋辨識場所
Location Recognition via Audio Fingerprint by the Generalized Bayes Method
作者: 鄭翔之
Cheng, Hsiang-Chih
王秀瑛
曾煜棋
Wang, Hsiu-Ying
Tseng, Yu-Chee
統計學研究所
關鍵字: 聲音背景頻譜;MFCC;混合高斯分佈;最大期望演算法;貝氏方法;Acoustic background spectrum;MFCC;GMM;EM-algorithm;Generalized Bayes method

公開日期: 2013
摘要: 近年來,在手機或其他行動設備上,利用聲音的背景頻譜來辨識使用者的所在 位置已經越來越盛行。利用聲音來做辨識的原因是因為聲音具有完備的性質、簡單計 算、不易受到短暫臨時的聲音影響和聲音的獨特性。一般來說,聲音頻譜的分析是透 過Mel-frequency cepstral coeffcients (MFCCs) 將一維度的時域聲音資料轉換到高維度的 頻域資料,再用混合高斯分佈(Gaussian mixture model GMM) 作估計。估計混合高斯分 佈的參數,常見作法為最大期望演算法(EM-algorithm)。在這篇論文裡,改用貝氏方法 來估計混合高斯分配,並且把結果和最大期望演算法做比較。在模擬和分析實際數據 中,貝氏方法均表現得比最大期望演算法準確。
Location Recognition has attracted a lot of researchers' attention in the past few years. Recently, using acoustic background spectrum for indoor localization through mobile device, such as smartphone, had been discussed in the literature. The advantage of location recognition through audio is because it is compact, easy computed, robust to transient sounds, and distinctive. In general, acoustic spectrum analysis is based on Mel-frequency cepstral coefficients (MFCCs) to transform audio data to high dimensional data, and these high dimensional data can be fitted by Gaussian mixture models. Based on the Gaussian mixture models fitted by the training data, we can classify the audio fingerprint of a location to a Gaussian mixture model with the highest likelihood value. To estimate the parameters in the Gaussian mixture models, it is common to use the expectation-maximization algorithm (EM-algorithm) to estimate the parameter. In this thesis, we apply the generalized Bayes method to estimate the parameter in GMMs, and compare it with the EM-algorithm. In a simulation study and a real data example, the generalized Bayes method is shown to have better performance than the EM algorithm.

URI: http://140.113.39.130/cdrfb3/record/nctu/#GT070152620
http://hdl.handle.net/11536/74752
顯示於類別:畢業論文