Combining Edge Detection and Region Segmentation for Lip Contour Extraction

Saeed, Usman; Dugelay, Jean-Luc

doi:10.1007/978-3-642-14061-7_2

Usman Saeed¹⁸ &
Jean-Luc Dugelay¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6169))

Included in the following conference series:

International Conference on Articulated Motion and Deformable Objects

839 Accesses
17 Citations

Abstract

The automatic detection of the lip contour is relatively a difficult problem in computer vision due to the variation amongst humans and environmental conditions. In this paper we improve upon the classical methods by introducing fusion. Two separate methods are first applied, one based on edge detection and the other on region segmentation to detect the outer lip contour, the results from them are then combined. Empirical evaluation of the detection process is also presented on an image subset of the Valid database, which contains lighting, pose and speech variation with promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hulbert, A., Poggio, T.: Synthesizing a Color Algorithm from Examples. Science 239, 482–485 (1998)
Article Google Scholar
Canzlerm, U., Dziurzyk, T.: Extraction of Non Manual Features for Video based Sign Language Recognition. In: Proceedings of IAPR Workshop, pp. 318–321 (2002)
Google Scholar
Leung, S.-H., Wang, S.-L., Lau, W.-H.: Lip image segmentation using fuzzy clustering incorporating an elliptic shape function. IEEE Transactions on Image Processing 13(1), 51–62 (2004)
Article Google Scholar
Lucey, S., Sridharan, S., Chandran, V.: Adaptive mouth segmentation using chromatic features. Pattern Recogn. Lett. 23, 1293–1302 (2002)
Article MATH Google Scholar
Zhang, X., Mersereau, R.M.: Lip feature extraction toward an automatic speechreading system. In: Proc. IEEE Int. Conf. Image Processing, vol. 3, pp. 226–229 (2000)
Google Scholar
Lucey, S., Sridharan, S., Chandran, V.: Initialised eigenlip estimator for fast lip tracking using linear regression. In: Proceedings of 15th International Conference on Pattern Recognition, vol. 3, pp. 178–181 (2000)
Google Scholar
Nefian, A., Liang, L., Pi, X., Xiaoxiang, L., Mao, C., Murphy, K.: A couple HMM for audio-visual speech recognition. In: Proc. ICASSP, pp. 2013–2016 (2002)
Google Scholar
Guan, Y.-P.: Automatic extraction of lips based on multi-scale wavelet edge detection. IET Computer Vision 2(1), 23–33 (2008)
Article Google Scholar
Kaucic, R., Dalton, B., Blake, A.: Real-Time Lip Tracking for Audio-Visual Speech Recognition Applications. In: Buxton, B.F., Cipolla, R. (eds.) ECCV 1996. LNCS, vol. 1065. Springer, Heidelberg (1996)
Google Scholar
Coianiz, T., Torresani, L., Caprile, B.: 2D deformable models for visual speech analysis. In: NATO Advanced Study Institute: Speech reading by Man and Machine, pp. 391–398 (1995)
Google Scholar
Aleksic, P.S., Williams, J.J., Wu, Z., Katsaggelos, A.K.: Audiovisual speech recognition using MPEG-4 compliant visual features. EURASIP J. Appl. Signal Processing, 1213–1227 (2002)
Google Scholar
Eveno, N., Caplier, A., Coulon, P.: Accurate and quasi-automatic lip tracking. IEEE Transactions on Circuits and Systems for Video Technology 14, 706–715 (2004)
Article Google Scholar
Cootes, T.F.: Statistical Models of Appearance for Computer Vision. Technical report, University of Manchester (2004)
Google Scholar
Yuille, A.L., Hallinan, P.W., Cohen, D.S.: Feature extraction from faces using deformable templates. Int. J. Comput. Vision 8, 99–111 (1992)
Article Google Scholar
Huang, C.L., Huang, Y.M.: Facial Expression Recognition Using Model-Based Feature Extraction and Action Parameters Classification. Journal of Visual Communication and Image Representation 8, 278–290 (1997)
Article Google Scholar
Werda, S., Mahdi, W., Ben-Hamadou, A.: Colour and Geometric based Model for Lip Localisation: Application for Lip-reading System. In: 14th International Conference on Image Analysis and Processing, pp. 9–14 (2007)
Google Scholar
Mok, L.L., Lau, W.H., Leung, S.H., Wang, S.L., Yan, H.: Person authentication using ASM based lip shape and intensity information. In: International Conference on Image Processing, vol. 1, pp. 561–564 (2004)
Google Scholar
Bouvier, C., Coulon, P.-Y., Maldague, X.: Unsupervised Lips Segmentation Based on ROI Optimisation and Parametric Model. In: IEEE International Conference on Image Processing, vol. 4, pp. 301–304 (2007)
Google Scholar
Tian, Y., Kanade, T., Cohn, J.: Robust lip tracking by combining shape, color and motion. In: Proc. ACCV, pp. 1040–1045 (2000)
Google Scholar
Michael, K., Andrew, W., Demetri, T.: Snakes: active Contour models. International Journal of Computer Vision 1, 259–268 (1987)
Google Scholar
Thejaswi, N.S., Sengupta, S.: Lip Localization and Viseme Recognition from Video Sequences. In: Fourteenth National Conference on Communications (2008)
Google Scholar
Bourel, F., Chibelushi, C.C., Low, A.A.: Robust Facial Feature Tracking. In: Proceedings of the 11th British Machine Vision Conference, UK, vol. 1, pp. 232–241 (2000)
Google Scholar
Fox, N.A., O’Mullane, B., Reilly, R.B.: The realistic multi-modal VALID database and visual speaker identification comparison experiments. In: Kanade, T., Jain, A., Ratha, N.K. (eds.) AVBPA 2005. LNCS, vol. 3546, Springer, Heidelberg (2005)
Google Scholar
Liew, A.W.-C., Shu Hung, L., Wing Hong, L.: Segmentation of color lip images by spatial fuzzy clustering. IEEE Transactions on Fuzzy Systems 11(4), 542–549 (2003)
Article Google Scholar
Chan, T.F., Vese, L.A.: Active contours without edges. IEEE Transactions on Image Processing 10(2), 266–277 (2001)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Eurecom, 2229 Routes des Cretes, 06560, Sophia Antipolis, France
Usman Saeed & Jean-Luc Dugelay

Authors

Usman Saeed
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Luc Dugelay
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Unitat de Gràfics i Visió per Ordinador Departament de Ciències Matemàtiques i Informàtica, Universitat de les Illes Balears Edifici Anselm Turmeda, Ctra. de Valldemossa km 7,5, 07122, Palma de Mallorca, Spain
Francisco J. Perales
School of Informatics, University of Edinburgh, James Clerk Maxwell Building, The King’s Buildings, Mayfield Road, EH9 3JZ, Edinburgh, UK
Robert B. Fisher

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saeed, U., Dugelay, JL. (2010). Combining Edge Detection and Region Segmentation for Lip Contour Extraction. In: Perales, F.J., Fisher, R.B. (eds) Articulated Motion and Deformable Objects. AMDO 2010. Lecture Notes in Computer Science, vol 6169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14061-7_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-14061-7_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14060-0
Online ISBN: 978-3-642-14061-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics