ABSTRACT
In this paper we propose an approach to indoor environment surveillance and, in particular, to people behaviour control in home automation context. The reference application is a silent and automatic control of the behaviour of people living alone in the house and specially conceived for people with limited autonomy (e.g., elders or disabled people). The aim is to detect dangerous events (such as a person falling down) and to react to these events by establishing a remote connection with low-performance clients, such as PDA (Personal Digital Assistant). To this aim, we propose an integrated server architecture, typically connected in intranet with network cameras, able to segment and track objects of interest; in the case of objects classified as people, the system must also evaluate the people posture and infer possible dangerous situations. Finally, the system is equipped with a specifically designed transcoding server to adapt the video content to PDA requirements (display area and bandwidth) and to the user's requests. The main issues of the proposal are a reliable real-time object detector and tracking module, a simple but effective posture classifier improved by a supervised learning phase, and an high performance transcoding inspired on MPEG-4 object-level standard, tailored to PDA. Results on different video sequences and performance analysis are discussed.
- M. Bertini, R. Cucchiara, A. D. Bimbo, and A. Prati. Object and event detection for semantic annotation and transcoding. In Proceedings of IEEE Conference on Multimedia & Expo, volume 2, pages 421--424, 2003. Google ScholarDigital Library
- A. Cavallaro, O. Steiger, and T. Ebrahimi. Semantic segmentation and description for video transcoding. In Proceedings of IEEE Conference on Multimedia & Expo, volume 3, pages 597--600, 2003. Google ScholarDigital Library
- R. Cucchiara, C. Grana, G. Neri, M. Piccardi, and A. Prati. The sakbot system for moving object detection and tracking. In Video-based Surveillance Systems - Computer Vision and Distributed Processing, chapter 12. Kluwer Academic, 2001.Google Scholar
- R. Cucchiara, C. Grana, M. Piccardi, and A. Prati. Detecting moving objects, ghosts and shadows in video streams. to appear in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003. Google ScholarDigital Library
- R. Cucchiara, C. Grana, and A. Prati. Semantic video transcoding using classes of relevance. International Journal of Image and Graphics, 3(1):145--169, Jan. 2003.Google ScholarCross Ref
- A. Datta, M. Shah, and N. D. V. Lobo. Person-on-person violence detection in video data. In Proceedings of Int'l Conference on Pattern Recognition, volume 1, pages 433--438, 2002. Google ScholarDigital Library
- A. Elgammal, D. Harwood, and L. Davis. Non-parametric model for background subtraction. In Proceedings of IEEE ICCV'99 FRAME-RATE Workshop, 1999.Google ScholarDigital Library
- H. Fujiyoshi and A. Lipton. Realtime human motion analysislly by image skeletonization. In Proceedings of IEEE Workshop on Applications of Computer Vision (WACV), 1998. Google ScholarDigital Library
- I. Haritaoglu, D. Harwood, and L. Davis. Ghost: a human body part labeling system using silhouettes. In Proceedings of Int'l Conference on Pattern Recognition, volume 1, pages 77--82, 1998. Google ScholarDigital Library
- I. Haritaoglu, D. Harwood, and L. Davis. W4: real-time surveillance of people and their activities. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):809--830, Aug. 2000. Google ScholarDigital Library
- Y.-K. Jung and Y.-S. Ho. Traffic parameter extraction using video-based vehicle tracking. In Proceedings of IEEE Int'l Conference on Intelligent Transportation Systems, pages 764--769, 1999.Google Scholar
- I. Mikic, M. Trivedi, E. Hunter, and P. Cosman. Human body model acquisition and tracking using voxel data. International Journal of Computer Vision, 53(3):199--223, July-August 2003. Google ScholarDigital Library
- T. Moeslund and E. Granum. A survey of computer vision-based human motion capture. Computer Vision and Image Understanding, 81(3):231--268, Mar. 2001. Google ScholarDigital Library
- K. Nagao, Y. Shirai, and K. Squire. Semantic annotation and transcoding: Making web content more accessible. IEEE Multimedia, 8(2):69--81, April-June 2001. Google ScholarDigital Library
- N. Oliver, B. Rosario, and A. Pentland. A bayesian computer vision system for modeling human interactions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):831--843, Aug. 2000. Google ScholarDigital Library
- F. Olivier. Three-Dimensional Computer Vision: A Geometric Viewpoint. The MIT Press, Cambridge, Mass., 1993. Google ScholarDigital Library
- A. Senior, A. Hampapur, Y.-L. Tian, L. Brown, S. Pankanti, and R. Bolle. Tracking people with probabilistic appearance models. In Proceedings of International Workshop on Performance Evaluation of Tracking and Surveillance (PETS) systems, 2002.Google Scholar
- A. Shio and J. Sklansky. Segmentation of people in motion. In Proceedings of IEEE Workshop on Visual Motion, pages 325--332, 1991.Google ScholarCross Ref
- Y. Song, L. Goncalves, and P. Perona. Unsupervised learning of human motion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(7):814--828, July 2003. Google ScholarDigital Library
- C. Stauffer and W. Grimson. Learning patterns of activity using real-time tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):747--757, Aug. 2000. Google ScholarDigital Library
- H. Tao, H. Sawhney, and R. Kumar. Object tracking with bayesian estimation of dynamic layer representations. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(1):75--89, Jan. 2002. Google ScholarDigital Library
- A. Vetro, T. Haga, K. Sumi, and S. H. Object-based coding for long-term archive of surveillance video. In IEEE Conference on Multimedia & Expo, volume 2, pages 417--420, 2003. Google ScholarDigital Library
- C. Wren, A. Azarbayejani, T. Darrell, and A. Pentland. Pfinder: real-time tracking of the human body. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(7):780--785, July 1997. Google ScholarDigital Library
Index Terms
- Computer vision techniques for PDA accessibility of in-house video surveillance
Recommendations
The development of a PDA-based communication architecture for surveillance services
We present a real-time client server communication architecture for surveillance services with personal digital assistants (PDAs), and propose an application-layer communication protocol for enhancing its performance. The architecture is based on pure ...
Mobile ADVICE: an accessible device for visually impaired capability enhancement
CHI EA '03: CHI '03 Extended Abstracts on Human Factors in Computing SystemsThe visually impaired have limited access to the world of mobile devices. Our goal was to design a handheld mobile device to overcome limitations such as reliance on visual display and lack of audio and tactile feedback. We built a prototype handheld ...
Comments