ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Computer Vision and Image Understanding
Volume 102, Issue 3, June 2006, Pages 260-316
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (1973 K)

  E-mail Article   
  Add to my Quick Links   
Bookmark and share in 2collab (opens in new window)
Request permission to reuse this article
  Cited By in Scopus (0)
 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.cviu.2004.10.012    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2006 Elsevier Inc. All rights reserved.

Erratum

A multi-Kalman filtering approach for video tracking of human-delineated objects in cluttered environments

Jean GaoCorresponding Author Contact Information, E-mail The Corresponding Author, Akio KosakaE-mail The Corresponding Author and Avinash C. KakE-mail The Corresponding Author

Robot Vision Lab, School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907, USA

Received 12 November 2002; 
accepted 27 October 2004. 
Available online 4 April 2006.


Refers to:Erratum to “A multi-Kalman filtering approach for video tracking of human-delineated objects in cluttered environments” [Comput. Vision Image Understanding 99 (2005) 1–57]
Computer Vision and Image UnderstandingVolume 102, Issue 3June 2006, Page 259
PDF (48 K)
Referred to by:Erratum to “A multi-Kalman filtering approach for video tracking of human-delineated objects in cluttered environments” [Comput. Vision Image Understanding 99 (2005) 1–57]
Computer Vision and Image UnderstandingVolume 102, Issue 3June 2006, Page 259
PDF (48 K)
Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

In this paper, we propose a new approach that uses a motion–estimation based framework for video tracking of objects in cluttered environments. Our approach is semi-automatic, in the sense that a human is called upon to delineate the boundary of the object to be tracked in the first frame of the image sequence. The approach presented requires no camera calibration; therefore it is not necessary that the camera be stationary. The heart of the approach lies in extracting features and estimating motion through multiple applications of Kalman filtering. The estimated motion is used to place constraints on where to seek feature correspondences; successful correspondences are subsequently used for Kalman-based recursive updating of the motion parameters. Associated with each feature is the frame number in which the feature makes its first appearance in an image sequence. All features that make first-time appearances in the same frame are grouped together for Kalman-based updating of motion parameters. Finally, in order to make the tracked object look visually familiar to the human observer, the system also makes its best attempt at extracting the boundary contour of the object—a difficult problem in its own right since self-occlusion created by any rotational motion of the tracked object would cause large sections of the boundary contour in the previous frame to disappear in the current frame. Boundary contour is estimated by projecting the previous-frame contour into the current frame for the purpose of creating neighborhoods in which to search for the true boundary in the current frame. Our approach has been tested on a wide variety of video sequences, some of which are shown in this paper.

Keywords: Tracking; Kalman filtering; Object tracking; Normalized cross-correlation; Perspective; Segmentation; Motion estimation; Recursive motion estimation; Feature extraction; Correspondence problem; Extended Kalman filtering; Boundary extraction; Region growing; Semi-automatic segmentation; Human-in-the-loop segmentation; Video surveillance; Video tracking

Article Outline

1. Introduction
2. The motion tracking framework—an overview
3. Extraction of feature points, their representations, and uncertainty modeling
3.1. Automatic selection of feature points for tracking and for boundary description
3.2. Object representation for tracking and uncertainty modeling
3.3. Feature prediction and finding correspondences
3.3.1. Motion uncertainty nUk prediction
3.3.2. Projecting predicted motion uncertainty into image space
3.3.3. Feature extraction using predicted uncertainty
4. Two-frame motion estimation
4.1. Updating motion uncertainty from initial feature correspondences
4.2. A second update of motion uncertainty
4.3. Seeking new matches for invalidated feature pairings
5. Multi-frame based motion estimation
5.1. Motion vector estimation
5.1.1. Feature representation
5.1.2. Genesis frame based grouping of features in the current frame
5.1.3. Final motion estimation
5.2. Shape vector estimation
6. Object boundary updating by region-growing
6.1. Boundary prediction and uncertainty field definition
6.2. Region-growing for boundary point detection
6.2.1. Recursive partitioning of the eroded boundary
6.2.2. Growing boundary segments
6.3. Selecting new features for tracking
7. Experimental results
7.1. Experiments with synthetic data
7.2. Experiments with real video sequences
7.2.1. Video 1
7.2.2. Video 2
7.2.3. Video 3
7.2.4. Video 4
8. Concluding remarks
Appendix A. Motion transform prediction from two transforms
Appendix B. Jacobian matrix of perspective motion transform
Appendix C. Motion estimation from two transforms
Appendix D. Motion estimation from inverse transforms
Appendix E. Jacobian matrices used in depth updating
References