doi:10.1016/S0167-8655(02)00218-0
Copyright © 2002 Published by Elsevier Science B.V.
Video segmentation based on 2D image analysis
a NPDI/DCC/UFMG, Caixa Postal 702, 30161-970, Belo Horizonte, MG, Brazil
b A
2SI/ESIEE––Cité Descartes, BP 99, 93162, Noisy le Grand, France
c IC/UNICAMP, Caixa Postal 6176, 13083-970, Campinas, SP, Brazil
Available online 12 October 2002.
References and further reading may be available for this article. To view references and further reading you must
purchase this article.
Abstract
The video segmentation problem consists in the identification of the boundary between consecutive shots. The common approach to solve this problem is based on dissimilarity measures between frames. In this work, the video segmentation problem is transformed into a problem of pattern detection, where each video event is transformed into a different pattern on a 2D image, called visual rhythm, obtained by a specific transformation. In our analysis we use topological and morphological tools to detect cuts. Also, we use discrete line analysis and max tree analysis to detect fade transitions and flashes, respectively. We present a comparative analysis of our method for cut detection with respect to some other methods, which shows the better results of our method.
Author Keywords: Video segmentation; Visual rhythm; Mathematical morphology; Image segmentation
Fig. 1. Video transformation: (a) simplification of the video content by transformation of each frame into a column on R; (b) a real example of the principal diagonal sub-sampling.
Fig. 2. Visual rhythm by sub-sampling (a) and by histogram (b) computed from the same video.
Fig. 3. Cut detection from a visual rhythm by sub-sampling (a)–(f) and by histogram (g)–(l): visual rhythms (a) and (g); thinning (b) and (h); maximum points (c) and (i); maxima filtering (d) and (j); normalized number of maximum points in the range [0,255] (e) and (k); detected cuts (white bars) superimposed on the visual rhythms (f) and (l).
Fig. 4. Fade detection process: (a) visual rhythm by histogram, (b) thresholding, (c) gradient, (d) line filtering and (e) result superimposed on the visual rhythm by histogram (a).
Fig. 5. Flash video detection: (a) some frames of a sequence with the flash presence; (b) visual rhythm by sub-sampling; (c) detected flash.
Fig. 6. Robustness (μ) measure.
Fig. 7. Experimental results.
Fig. 8. Experimental results for flash detection.
Table 1. Features of the videos which were selected for the experiments

Table 2. Quality measures

Cut detection (a) μ(0.10,0.30),
Em(0.03),
Rf(0.01) and γ; flash detection (b) μ(0.40,0.30),
Em(0.05),
Rf(0.01) and γ; fade detection (c). The best values are highlighted.