short-paper

Digital video scenes identification using audiovisual features

Authors:
Danilo Barbosa Coimbra

Universidade de São Paulo, São Carlos, SP -- Brasil

Universidade de São Paulo, São Carlos, SP -- Brasil
View Profile

,
Rudinei Goularte

Universidade de São Paulo, São Carlos, SP -- Brasil

Universidade de São Paulo, São Carlos, SP -- Brasil
View Profile

WebMedia '09: Proceedings of the XV Brazilian Symposium on Multimedia and the WebOctober 2009Article No.: 43Pages 1–4https://doi.org/10.1145/1858477.1858520

Published:05 October 2009Publication History

WebMedia '09: Proceedings of the XV Brazilian Symposium on Multimedia and the Web

Pages 1–4

ABSTRACT

This paper proposes a new technique to segment digital video into semantic units called scenes. This technique is derived from the combined use of color histograms and silence detection, visual and aural features, respectively. The category of digital video used in this work is television news. The results show that the technique can identify more than 80% of the video scenes.

References

}}A. Barla, F. Odone, and A. Verri. Histogram intersection kernel for image classification. In Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on, volume 3, pages III-513--16 vol. 2, 2003.Google ScholarCross Ref
}}J. Calic and E. Izquierdo. Temporal segmentation of mpeg video streams. EURASIP J. Appl. Signal Process., 2002(1):561--565, 2002. Google ScholarDigital Library
}}J.-R. Cao. Algorithm of scene segmentation based on svm for scenery documentary. icnc, 3:95--98, 2007. Google ScholarDigital Library
}}A. Dong and H. Li. Semantic segmentation of documentary video using music breaks. icme, 0:1825--1828, 2006.Google Scholar
}}J. Han and K.-K. Ma. Fuzzy color histogram and its use in color image retrieval. Image Processing, IEEE Transactions on, 11(8):944--952, 2002. Google ScholarDigital Library
}}A. Hanjalic. Content-Based Analysis of Digital Video. Kluwer Academic Publishers, 2004. 193 pags.Google ScholarDigital Library
}}H. Jiang, T. Lin, and H.-J. Zhang. Video segmentation with the assistance of audio content analysis. In Proc. IEEE International Conference on Multimedia and Expo ICME 2000, volume 3, pages 1507--1510 vol. 3, 2000.Google ScholarCross Ref
}}W. Lee, H. Kim, H. Kang, J. Lee, Y. Kim, and S. Jeon. Video cataloging system for real-time scene change detection of news video. pages 705--715. 2005.Google Scholar
}}Y. Li, S. Narayanan, and C. Kuo. Content-based movie analysis and indexing based on audiovisual cues. 14(8):1073--1085, 2004. Google ScholarDigital Library
}}Y. Li and D. Zhang, Tong anf Tretter. An overview of video abstraction techniques. Technical report, HP Image Systems Laboratory, Palo Alto, CA, 2001.Google Scholar
}}L. Lu, H. Jiang, and H. Zhang. A robust audio classification and segmentation method. In MULTIMEDIA '01: Proceedings of the ninth ACM international conference on Multimedia, pages 203--211, 2001. Google ScholarDigital Library
}}X. Shao, C. Xu, N. C. Maddage, Q. Tian, M. S. Kankanhalli, and J. S. Jin. Automatic summarization of music videos. ACM Trans. Multimedia Comput. Commun. Appl., 2(2):127--148, 2006. Google ScholarDigital Library
}}T. Zhang and C.-C. Kuo. Hierarchical classification of audio data for archiving and retrieving. In Acoustics, Speech, and Signal Processing, 1999. ICASSP '99. Proceedings., 1999 IEEE International Conference on, volume 6, pages 3001--3004 vol. 6, Mar 1999. Google ScholarDigital Library
}}L. Zhao, S.-Q. Yang, and B. Feng. Video scene detection using slide windows method based on temporal constrain shot similarity. In Proc. IEEE International Conference on Multimedia and Expo ICME 2001, pages 1171--1174, 2001.Google ScholarCross Ref
}}S. Zhu and Y. Liu. A novel scheme for video scenes segmentation and semantic representation. In Proc. IEEE International Conference on Multimedia and Expo, pages 1289--1292, 2008.Google Scholar

Index Terms

Digital video scenes identification using audiovisual features
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Image segmentation
        Video segmentation
      2. Computer vision tasks
        Scene understanding
        Video summarization

Recommendations

Salient objects detection in dynamic scenes using color and texture features

Visual saliency is an important research topic in the field of computer vision due to its numerous possible applications. It helps to focus on regions of interest instead of processing the whole image or video data. Detecting visual saliency in still ...
Read More
Using the eyes to encode and recognize social scenes
ETRA '06: Proceedings of the 2006 symposium on Eye tracking research & applications

In a previous study, we found that observers look mostly at the eyes when viewing natural scenes containing one or more people (Birmingham et al. submitted). This prioritization of eye regions occurred regardless of the type of scene being viewed (e.g. ...
Read More
Robust TV news story identification via visual characteristics of anchorperson scenes
PSIVT'06: Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology

In this paper, a new scheme for TV news segmentation via exploring the efficient visual features is proposed especially for TV news which contains lots of changeful background of anchorperson shots. The proposed scheme can be divided into two parts: ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WebMedia '09: Proceedings of the XV Brazilian Symposium on Multimedia and the Web
October 2009
382 pages
ISBN:9781605588803
DOI:10.1145/1858477
Conference Chairs:
Fernando Antonio Mota Trinta
Unifor
,
Pedro Porfírio
Unifor
,
Program Chairs:
Rudinei Goularte,
Renata Pontin de Mattos Fortes
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 October 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate270of873submissions,31%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 72
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Digital video scenes identification using audiovisual features

WebMedia '09: Proceedings of the XV Brazilian Symposium on Multimedia and the Web

ABSTRACT

References

Cited By

Index Terms

Recommendations

Salient objects detection in dynamic scenes using color and texture features

Using the eyes to encode and recognize social scenes

Robust TV news story identification via visual characteristics of anchorperson scenes

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Digital video scenes identification using audiovisual features

WebMedia '09: Proceedings of the XV Brazilian Symposium on Multimedia and the Web

ABSTRACT

References

Cited By

Index Terms

Recommendations

Salient objects detection in dynamic scenes using color and texture features

Using the eyes to encode and recognize social scenes

Robust TV news story identification via visual characteristics of anchorperson scenes

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media