Localization of Multiple Simultaneous Speakers by Combining the Information from Different Subbands

سال انتشار: 1392
نوع سند: مقاله کنفرانسی
زبان: انگلیسی
مشاهده: 980

فایل این مقاله در 6 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

ICEE21_379

تاریخ نمایه سازی: 27 مرداد 1392

چکیده مقاله:

Time Difference Of Arrival (TDOA)-based algorithms are the main methods for speech source localization. A category of these methods are based on Generalized Cross Correlation(GCC). These methods estimate the source location based on the calculated TDOA between microphones signals. Theaccuracy of these methods decreases as the amount of noise and reverberation increases. In this paper, we propose the utilization of subband processing for the localization of twosimultaneous speech sources. While the conventional methods consider the whole signal spectrum identically in thelocalization procedure, the proposed method takes advantage of the differences in the frequency bands of the mixed speech forthe localization of multiple speakers. Actually, the proposedmethod computes the GCC in the different frequency bands and then, combines the information from the subbands in a so-calledsmart manner. We have discussed several approaches for the combination of subband. The performance evaluations indifferent environmental conditions demonstrate the superiority of the proposed method compared to the fullband GCC method.The proposed method considerably increases the accuracy of simultaneous speaker localization.

کلیدواژه ها:

Multi Source Localization – Subband Processing – Generalized Cross Correlation – PHAT filter – DOA

نویسندگان

Ali Dehghan Firoozabadi

Speech Processing Research Lab (SPRL), Electrical and Computer Eng. Dept., Yazd University