Frequency domain microphone array calibration and beamforming for automatic speech recognition

doi:10.1093/ietfec/e88-a.9.2401

標題:	Frequency domain microphone array calibration and beamforming for automatic speech recognition
作者:	Hu, JS Cheng, CC 電控工程研究所 Institute of Electrical and Control Engineering
關鍵字:	beamformer;microphone array;calibration;speech recognition;speech enhancement
公開日期:	1-Sep-2005
摘要:	This investigation proposed two array beamformers SPFDBB (Soft Penalty Frequency Domain Block Beamformer) and FDABB (Frequency Domain Adjustable Block Beamformer). Compared with the conventional beamformers, these frequency-domain methods can significantly reduce the computation power requirement in ASR (Automatic Speech Recognition) based applications. Like other reference signal based techniques, SPFDBB and FDABB minimize microphone's mismatch, desired signal cancellation caused by reflection effects and resolution due to the array's position. Additionally, these proposed methods are suitable for both near-field and far-field environments. Generally, the convolution relation between channel and speech source in time domain cannot be modeled accurately as a multiplication in the frequency domain with a finite window size, especially in ASR applications. SPFDBB and FDABB can approximate this multiplication by treating several frames as a block to achieve a better beamforming result. Moreover, FDABB adjusts the number of frames on-line to cope with the variation of characteristics in both speech and interference signals. A better performance was found to be achievable by combining these methods with an ASR mechanism.
URI:	http://dx.doi.org/10.1093/ietfec/e88-a.9.2401 http://hdl.handle.net/11536/13365
ISSN:	0916-8508
DOI:	10.1093/ietfec/e88-a.9.2401
期刊:	IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES
Volume:	E88A
Issue:	9
起始頁:	2401
結束頁:	2411
Appears in Collections:	Articles