Abstract
Biometric systems, such as face tracking and recognition, are increasingly being used as a means of security in many areas. The usability of these systems depend not only on how accurate they are in terms of detection and recognition but also on how well they withstand attacks. In this paper we developed a text-driven face-video signal from the XM2VTS database. The synthesized video can be used as a means of playback attack for face detection and recognition systems. We use Hidden Markov Model to recognize the speech of a person and use the transcription file for reshuffling the image sequences as per the prompted text. The discontinuities in the new video are significantly minimized by using a pyramid based multi-resolution frame interpolation technique. The playback can also be used to test liveness detection systems that rely on lip-motion to speech synchronization and motion of the head while posing/speaking. Finally we suggest possible approaches to enable biometric systems to stand against this kind of attacks. Other uses of our results include web-based video communication for electronic commerce.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Jain, A., Ross, A., Prebhakar, S.: An Introduction to Biometric Recognition. IEEE Transactions on Circuits and Systems for Video Technology, Special Issue on Image- and Video-Based Biometrics 14(1) (January 2004)
Ortega-Garcia, J., Bigun, J., Reynolds, D., Gonzalez-Rodriguez, J.: Authentication Gets Personal with Biometrics. IEEE Signal Processing Magazine 21(2), 50–62 (2004)
Faundez-Zanuy, M.: Biometric Security Technology. IEEE Aerospace and Electronic Systems Magazine 21(6), 15–26 (2006)
Ratha, N.K., Connell, J.H., Bolle, R.M.: Enhancing Security and Privacy in Biometrics-Based Authentication Systems. IBM Systems Journal 40(3), 614–634 (2001)
Kollreider, K., Fronthaller, H., Bigun, J.: Evaluating Liveness by Face Images and the Structure Tensor. In: AutoID 2005. Fourth Workshop on Automatic Identification Advanced Technologies, pp. 75–80. IEEE Computer Society Press, Los Alamitos (2005)
Li, J., Wang, Y., Tan, T., Jain, A.K.: Live Face Detection Based on the Analysis of Fourier Spectra. In: Jain, A.K., Ratha, N.K. (eds.) Biometric Technology for Human Identification. Proceedings of the SPIE, vol. 5404, pp. 296–303 (August 2004)
Faraj, M., Bigun, J.: Person Verification by Lip-Motion. In: CVPRW. Computer Vision and Pattern Recognition Workshop, pp. 37–45 (June 2006)
Messer, K., Matas, J., Kitler, J., Luettin, J., Maitre, G.: XM2VTSDB: The Extended M2VTS Database. In: AVBPA 1999. 2nd International Conference on Audio and Video-based Biometric Person Authentication, pp. 72–77 (1999)
Veeravalli, A.G., Pan, W., Adhami, R., Cox, P.G.: A Tutorial on Using Hidden Markov Models for Phoneme Recognition. In: SSST 2005. Thirty-Seventh Southeastern Symposium on System Theory (2005)
Young, S., Evermann, G., Gales, M., Hein, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The htk Book. for Version 3.3 (April 2005), http://htk.eng.cam.ac.uk/docs/docs.shtml
Bigun, J.: Vision with Direction: A Systematic Introduction to Image Processing and Computer Vision. Springer, Heidlberg (2006)
Jain, J., Jain, A.K.: Displacement Measurement and its Application in Interframe Image Coding. IEEE Transactions on Communication COM 29, 1799–1808 (December 1981)
Cheng, K.W., Chan, S.C.: Fast Block Matching Algorithms for Motion Estimation. In: ICASSP 1996. IEEE International Conference on Acoustic Speech and Signal Processing, vol. 4(1), pp. 2311–2314. IEEE Computer Society Press, Los Alamitos (1996)
Aly, S., Youssef, A.: Real-Time Motion Based Frame Estimation in Video Lossy Transmission. In: Symposium on Applications and the Internet, pp. 139–146 (January 2001)
Zhai, J., Yu, K., Li, J., Li, S.: A Low Complexity Motion Compensated Frame Interpolation Method. In: ISCAS 2005. IEEE International Symposium on Circuits and Systems, vol. 5, pp. 4927–4930. IEEE Computer Society Press, Los Alamitos (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Teferi, D., Bigun, J. (2007). Pyramid Based Interpolation for Face-Video Playback in Audio Visual Recognition. In: Lee, SW., Li, S.Z. (eds) Advances in Biometrics. ICB 2007. Lecture Notes in Computer Science, vol 4642. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74549-5_91
Download citation
DOI: https://doi.org/10.1007/978-3-540-74549-5_91
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74548-8
Online ISBN: 978-3-540-74549-5
eBook Packages: Computer ScienceComputer Science (R0)