標題: BIC-Based Speaker Segmentation Using Divide-and-Conquer Strategies With Application to Speaker Diarization
作者: Cheng, Shih-Sian
Wang, Hsin-Min
Fu, Hsin-Chia
資訊工程學系
Department of Computer Science
關鍵字: Bayesian information criterion (BIC);divide-and-conquer;speaker change detection;speaker diarization;speaker segmentation
公開日期: 1-一月-2010
摘要: In this paper, we propose three divide-and-conquer approaches for Bayesian information criterion (BIC)-based speaker segmentation. The approaches detect speaker changes by recursively partitioning a large analysis window into two sub-windows and recursively verifying the merging of two adjacent audio segments using Delta BIC, a widely-adopted distance measure of two audio segments. We compare our approaches to three popular distance-based approaches, namely, Chen and Gopalakrishnan's window-growing-based approach, Siegler et al.'s fixed-size sliding window approach, and Delacourt and Wellekens's DISTBIC approach, by performing computational cost analysis and conducting speaker change detection experiments on two broadcast news data sets. The results show that the proposed approaches are more efficient and achieve higher segmentation accuracy than the compared distance-based approaches. In addition, we apply the segmentation approaches discussed in this paper to the speaker diarization task. The experiment results show that a more effective segmentation approach leads to better diarization accuracy.
URI: http://dx.doi.org/10.1109/TASL.2009.2024730
http://hdl.handle.net/11536/6158
ISSN: 1558-7916
DOI: 10.1109/TASL.2009.2024730
期刊: IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Volume: 18
Issue: 1
起始頁: 141
結束頁: 157
顯示於類別:期刊論文


文件中的檔案:

  1. 000271020900003.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。