Abstract
The tracking and recognition of human motion is a challenging problem with diverse applications in virtual reality, medicine, teleoperations, animation, and human-computer interaction to name a few. The study of human motion has a long history with the use of images for analyzing animate motion beginning with the improvements in photography and the development of motion-pictures in the late nineteenth century. Scientists and artists such as Marey [12] and Muybridge [26] were early explorers of human and animal motion in images and image sequences. Today, commercial motion-capture systems can be used to accurately record the 3D movements of an instrumented person, but the motion analysis and motion recognition of an arbitrary person in a video sequence remains an unsolved problem. In this chapter we describe the representation and recognition of human motion using parameterized models of optical flow. A person’s limbs, face, and facial features are represented as patches whose motion In a image sequence can be modeled by low-order polynomials. A robust optical flow estimation technique is used to recover the motion of these patches and the recovered motion parameters provide a rich, yet concise, description of the human motion which can be used to recognize human activities, gestures, and facial expressions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
M. Allmen and C.R. Dyer. Cyclic motion detection using spatiotemporal surfaces and curves. In ICPR, pages 365–370, 1990.
A. Azarbayejani, T. Starner, B. Horowitz, and A. Pentland. Visually controlled graphics. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15 (6): 602–604, June 1993.
A. Baumberg and D. Hogg. Learning flexible models from image sequences. In J. Eklundh, editor, European Conf. on Computer Vision, ECCV-94, volume 800 of LNCS-Series, pages 299–308, Stockholm, Sweden, 1994. Springer-Verlag.
A. G. Bharatkumar, K. E. Daigle, M. G. Pandy, and J. K. Aggarwal. Lower limb kinematics of human walking with the medial axis tranfromation. In Proceedings of the Workshop on Motion of Non-rigid and Articulated Objects, pages 70–76, Austin, Texas, November 1994.
M. J. Black and P. Anandan. The robust estimation of multiple motions: Parametric and piecewise-smooth flow fields. Computer Vision and Image Understanding,63(1):75–104,. January 1996.
M. J. Black and A. D. Jepson. Eigentracking: Robust matching and tracking of articulated objects using a view-based representation. In B. Buxton and R. Cipolla, editors, European Conf. on Computer Vision, ECCV-96, volume 1064 of LNCSSeries, pages 329–342, Cambridge, UK, 1996. Springer-Verlag.
M. J. Black and Y. Yacoob. Tracking and recognizing rigid and non-rigid facial motions using local parametric models of image motions. In Proceedings of the International Conference on Computer Vision, pages 374–381, Boston, Mass., June 1995.
A. Blake and M. Isard. 3D position, attitude and shape input using video tracking of hands and lips. In Proceedings of SIGGRAPH 94, pages 185–192, 1994.
L.W. Campbell and A.F. Bobick. Recognition of human body motion using phase space constraints. In Proceedings of the International Conference on Computer Vision, pages 624–630, Boston, Mass., June 1995.
C. Cédras and M. Shah. Motion-based recognition: A survey. Image and Vision Computing, 13 (2): 129–155, March 1995.
R. Cipolla and A. Blake. Surface orientation and time to contact from image divergence and deformation. In G. Sandini, editor, Proc. of Second European Conference on Computer Vision, ECCV-92, volume 588 of LNCS-Series, pages 187–202. Springer-Verlag, May 1992.
François Dagognet. Etienne-Jules Marey: A Passion for the Trace. Zone Books, New York, 1992.
I. Essa, T. Darrell, and A. Pentland. Tracking facial motion. In Proceedings of the Workshop on Motion of Non-rigid and Articulated Objects, pages 36–42, Austin, Texas, November 1994.
I. A. Essa and A. Pentland. A vision system for observing and extracting facial action parameters. In Proc. Computer Vision and Pattern Recognition, CVPR-94, pages 76–83, Seattle, WA, June 1994.
D. Gavrila and L.S. Davis. 3d model-based tracking of humans in action: A multi-view approach. In Proc. Computer Vision and Pattern Recognition, CVPR-96, San Francisco, CA, June 1996.
L. Goncalves, E. Di Bernardo, E. Ursella, and P. Perona. Monocular tracking of the human arm in 3D. In Proceedings of the International Conference on Computer Vision, pages 744–770, Boston, Mass., June 1995.
F. R. Hampel, E. M. Ronchetti, P. J. Rousseeuw, and W. A. Stahel. Robust Statistics: The Approach Based on Influence Functions. John Wiley and Sons, New York, NY, 1986.
S. X. Ju, M. J. Black, and Y. Yacoob. Cardboard people: A parameterized model of articulated motion. In International Conference on Automatic Face and Gesture Recognition, pages 38–44, Killington, Vermont, 1996.
M. Kass, A. Witkin, and D. Terzopoulos. Snakes: Active contour models. In Proc. First International Conference on Computer Vision, pages 259–268, June 1987.
J. J. Koenderink and A. J. van Doorn. Invariant properties of the motion parallax field due to the movement of rigid bodies relative to an observer. Optica Acta, 22 (9): 773–791, 1975.
H. Li, P. Roivainen, and R. Forcheimer. 3-D motion estimation in model-based facial image coding. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15 (6): 545–555, June 1993.
N. Li, S. Dettmer, and M. Shah. Lipreading using eigensequences. In International Workshop on Automatic Face and Gesture Recognition, pages 30–34, Zurich, 1995.
J. Little and J. Boyd. Describing motion for recognition. In International Symposium on Computer Vision, pages 235–240, Miami, FL, November 1995.
K. Mase. Recognition of facial expression from optical flow. IEICE Transactions, E 74: 3474–3483, 1991.
C. Morimoto, Y. Yacoob, and L.S. Davis. Recognition of head gestures using Hidden Markov Models. In Proceedings of the International Conference on Pattern Recognition, Vienna, Austria, 1996.
Eadweard Muybridge. The Human Figure in Motion. Dover Publications, Inc., New York, 1955.
S. A. Niyogi and E. H. Adelson. Analyzing and recognizing walking figures in xyt. In Proc. Computer Vision and Pattern Recognition, CVPR-94, pages 469–474, Seattle, WA, June 1994.
A. Pentland and B. Horowitz. Recovery of nonrigid motion and structure. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13 (7): 730–742, July 1991.
K. Rangraajan, W. Allen, and M.A. Shah. Matching motion trajectories using scale-space. Pattern Recognition, 26 (4): 595–609, 1993.
J. Rehg and T. Kanade. Model-based tracking of self-occluding articulated objects. In Proceedings of the International Conference on Computer Vision, pages 612–617, Boston, Mass., June 1995.
K. Rohr. Towards model-based recognition of human movements in image sequences. CVGIP: Image Processing, 59: 94–115, 1994.
M. Rosenblum, Y. Yacoob, and L.S. Davis. Human emotion recognition from motion using a radial basis function network architecture. In Proceedings of the Workshop on Motion of Non-rigid and Articulated Objects, Austin, Texas, November 1994.
H. S. Sawhney. 3D geometry from planar parallax. In Computer Vision and Pattern Recognition, CVPR-94, pages 929–934, Seattle, WA, 1994.
S.M. Seitz and C.R. Dyer. Affine invariant detection of periodic motion. In Proc. Computer Vision and Pattern Recognition, CVPR-94, pages 970–975, Seattle, WA, June 1994.
T. Starner and A. Pentland. Visual recognition of American Sign Language using Hidden Markov Models. In International Workshop on Automatic Face and Gesture Recognition, pages 189–194, Zurich, 1995.
D. Terzopoulos and K. Waters. Analysis and synthesis of facial image sequences using physical and anatomical models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15 (6): 569–579, June 1993.
S. Toelg and T. Pogio. Towards an example-based image compression architecture for video-conferencing. Technical Report CAR-TR-723, Center for Automation Research, U. of Maryland, 1994.
J. Wang, G. Lorette, and P. Bouthemy. Analysis of human motion: A model-based approach. In 7th Scandinavian Conf. Image Analysis, Aalborg, Denmark, 1991.
Y. Yacoob and L.S. Davis. Computing spatio-temporal representations of human faces. In Proc. Computer Vision and Pattern Recognition, CVPR-94, pages 70–75, Seattle, WA, June 1994.
A. L. Yuille, D. S. Cohen, and P. W. Hallinan. Feature extraction from faces using deformable templates. In Proc. Computer Vision and Pattern Recognition, CVPR89, pages 104–109, June 1989.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Black, M.J., Yacoob, Y., Ju, S.X. (1997). Recognizing Human Motion Using Parameterized Models of Optical Flow. In: Shah, M., Jain, R. (eds) Motion-Based Recognition. Computational Imaging and Vision, vol 9. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-8935-2_11
Download citation
DOI: https://doi.org/10.1007/978-94-015-8935-2_11
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-4870-7
Online ISBN: 978-94-015-8935-2
eBook Packages: Springer Book Archive