Active speech source localization by a dual coarse-to-fine search
Title | Active speech source localization by a dual coarse-to-fine search |
Publication Type | Conference Papers |
Year of Publication | 2001 |
Authors | Duraiswami R, Zotkin DN, Davis LS |
Conference Name | 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01) |
Date Published | 2001/// |
Publisher | IEEE |
ISBN Number | 0-7803-7041-4 |
Keywords | active speech source localization, array signal processing, Computer interfaces, Delay effects, delay estimation, delay-based estimators, dual coarse-to-fine search, Frequency, Inverse problems, Laboratories, multiple speech sound sources, Position measurement, real-time implementation, Sensor arrays, Signal processing algorithms, space, speech, Speech processing, Teleconferencing, videoconferencing systems |
Abstract | Accurate and fast localization of multiple speech sound sources is a significant problem in videoconferencing systems. Based on the observation that the wavelengths of the sound from a speech source are comparable to the dimensions of the space being searched, and that the source is broadband, we develop an efficient search strategy that finds the source(s) in a given space. The search is made efficient by using coarse-to-fine strategies in both space and frequency. The algorithm is shown to be robust compared to typical delay-based estimators and fast enough for real-time implementation. Its performance can be further improved by using constraints from computer vision |
DOI | 10.1109/ICASSP.2001.940366 |