TY - GEN
T1 - Maximising audiovisual correlation with automatic lip tracking and vowel based segmentation
AU - Abel, Andrew
AU - Hussain, Amir
AU - Nguyen, Quoc Dinh
AU - Ringeval, Fabien
AU - Chetouani, Mohamed
AU - Milgram, Maurice
PY - 2009
Y1 - 2009
N2 - In recent years, the established link between the various human communication production domains has become more widely utilised in the field of speech processing. In this work, a state of the art Semi Adaptive Appearance Model (SAAM) approach developed by the authors is used for automatic lip tracking, and an adapted version of our vowel based speech segmentation system is employed to automatically segment speech. Canonical Correlation Analysis (CCA) on segmented and non segmented data in a range of noisy speech environments finds that segmented speech has a significantly better audiovisual correlation, demonstrating the feasibility of our techniques for further development as part of a proposed audiovisual speech enhancement system.
AB - In recent years, the established link between the various human communication production domains has become more widely utilised in the field of speech processing. In this work, a state of the art Semi Adaptive Appearance Model (SAAM) approach developed by the authors is used for automatic lip tracking, and an adapted version of our vowel based speech segmentation system is employed to automatically segment speech. Canonical Correlation Analysis (CCA) on segmented and non segmented data in a range of noisy speech environments finds that segmented speech has a significantly better audiovisual correlation, demonstrating the feasibility of our techniques for further development as part of a proposed audiovisual speech enhancement system.
UR - https://www.scopus.com/pages/publications/77952056171
U2 - 10.1007/978-3-642-04391-8_9
DO - 10.1007/978-3-642-04391-8_9
M3 - Conference contribution
AN - SCOPUS:77952056171
SN - 3642043909
SN - 9783642043901
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 65
EP - 72
BT - Biometric ID Management and Multimodal Communication - Joint COST 2101 and 2102 International Conference, BioID_MultiComm 2009, Proceedings
T2 - Joint COST 2101 and 2102 International Conference on Biometric ID Management and Multimodal Communication, BioID_MultiComm 2009
Y2 - 16 September 2009 through 18 September 2009
ER -