TY - GEN
T1 - Particle filtering for bearing-only audio-visual speaker detection and tracking
AU - Rae, Andrew
AU - Khamis, Alaa
AU - Basir, Otman
AU - Kamel, Mohamed
PY - 2009
Y1 - 2009
N2 - We present a method for audio-visual speaker detection and tracking in a smart meeting room environment based on bearing measurements and particle filtering. Bearing measurements are determined using the Time Difference of Arrival (TDOA) of the acoustic signal reaching a pair of microphones, and by tracking facial regions in images from monocular cameras. A particle filter is used to sample the space of possible speaker locations within the meeting room, and to fuse the bearing measurements from auditory and visual sources. The proposed system was tested in a video messaging scenario, using a single participant seated in front of a screen to which a camera and microphone pair are attached. The experimental results show that the accuracy of speaker tracking using bearing measurements is related to the location of the speaker relative to the locations of the camera and microphones, which can be quantified using a parameter known as Dilution of Precision.
AB - We present a method for audio-visual speaker detection and tracking in a smart meeting room environment based on bearing measurements and particle filtering. Bearing measurements are determined using the Time Difference of Arrival (TDOA) of the acoustic signal reaching a pair of microphones, and by tracking facial regions in images from monocular cameras. A particle filter is used to sample the space of possible speaker locations within the meeting room, and to fuse the bearing measurements from auditory and visual sources. The proposed system was tested in a video messaging scenario, using a single participant seated in front of a screen to which a camera and microphone pair are attached. The experimental results show that the accuracy of speaker tracking using bearing measurements is related to the location of the speaker relative to the locations of the camera and microphones, which can be quantified using a parameter known as Dilution of Precision.
KW - Acoustic arrays
KW - Direction of arrival estimation
KW - Machine vision
KW - Monte carlo methods
KW - Position measurement
UR - https://www.scopus.com/pages/publications/77951491840
U2 - 10.1109/ICSCS.2009.5412478
DO - 10.1109/ICSCS.2009.5412478
M3 - Conference contribution
AN - SCOPUS:77951491840
SN - 9781424443987
T3 - 3rd International Conference on Signals, Circuits and Systems, SCS 2009
BT - 3rd International Conference on Signals, Circuits and Systems, SCS 2009
T2 - 3rd International Conference on Signals, Circuits and Systems, SCS 2009
Y2 - 6 November 2009 through 8 November 2009
ER -