Abstract
This paper focuses on the problem of news event timeline summary in Multi-Document Summarization, which aims to summarize multi-news regarding the same event in timeline. The majority of the traditional solutions to this problem consider the text surface features and topic-related features, such as the length of each sentence, the position of the sentence in the document, the number of topic words, etc. Traditional methods ignored that every event has its life circle including birth, growth, maturity and death. In this paper, a novel approach is presented for summarizing multi-news regarding the same topic in consideration of both the traditional features and the life circle feature of each event. The proposed approach consists of four steps. First, sentences and their publishing date are extracted from each news article. Second, the extracted sentences are pretreated to reduce the influence of noises like synonyms. Third, life circle features and other four categories of features which are common used in this field are collected. Finally, SVM model is used to train these features to recognize the summary sentence of the news document. This approach have been tested on the public datasets, DUC-2002 and TAC-2010, and the results show that our approach is more effective in summarizing multi-news in timeline than existing methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Yan, R., Kong, L., Hunag, C., Wan, X., Li, X., Zhang, Y.: Timeline generation through evolutionary trans-temporal summarization. In: EMNLP (2011)
Ramanujam, N.: Centroid based summarization of multiple documents implemented using timestamps. In: ICETET (2008)
Tran, B.: Structured summarization for news events. In: WWW (2013)
Tran, B., Alrifai, M., Nguyen, D.Q.: Predicting relevant news events for timeline summaries. In: WWW (2013)
Chen, K.-Y., Luesukprasert, L., Chou, S.-C.T.: Hot topic extraction based on timeline analysis and multidimensional sentence modeling. Knowledge and Data Engineering, 1016–1025 (2007)
Russell Swan, James Allan: Automatic generation of overview timelines. In: ACM, pp. 49-56 (2000)
Chawla, N.V., Lazarevic, A., Hall, L.O., Bowyer, K.W.: SMOTEBoost: Improving prediction of the minority class in boosting. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS (LNAI), vol. 2838, pp. 107–119. Springer, Heidelberg (2003)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 321–357 (2002)
Lin, C.-Y.: Rouge: A package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL 2004 Workshop, pp. 74–81 (2004)
Deerwester, S., Dumais, S.T., Harshman, R.: Indexing by latent semantic analysis. JASIS (1990)
Chen, C.C., Chen, Y.-T., Sun, Y., Chen, M.C.: Life cycle modeling of news events using aging theory. In: Lawry, J., G. Shanahan, J., L. Ralescu, A. (eds.) ECML 2003 (LNAI), vol. 2873, pp. 47–59. Springer, Heidelberg (2003)
Barzilay, R., McKeown, K.R., Elhadad, M.: Information fusion in the context of multi-document summarization. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, pp. 550–557 (1999)
Kevin, K., Daniel, M.: Summarization beyond sentence extraction: A probabilistic approach to sentence compression. Artificial Intelligence, 91–107 (2002)
McKeown, K.R., Klavans, J.L., Hatzivassiloglou, V., Barzilay, R., Eskin, E.: Towards multidocument summarization by reformulation: Progress and prospects. In: AAAI/IAAI, pp. 453–460 (1999)
Radev, D.R., Jing, H., Stys, M., Tam, D.: Centroid-based summarization of multiple documents. Information Processing & Management, 919–938 (2004)
Wan, X.: Manifold-ranking based topic-focused multi-document summarization. In: IJCAI, pp. 2903–2908 (2007)
Wan, X., Yang, J., Xiao, J.: Multi-document summarization using cluster-based link analysis. In: 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 299–306 (2008)
McKeown, K., Barzilay, R., Chen, J., Elson, D., Evans, D., Klavans, J., Nenkova, A., Schiffman, B., Sigelman, S.: Columbia’s newsblaster: new features and future directions. In: The 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Demonstrations, pp. 15–16 (2003)
Allan, J., Gupta, R., Khandelwal, V.: Temporal summaries of new topics. In: 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 10–18 (2001)
Wong, K.-F., Wu, M., Li, W.: Extractive summarization using supervised and semi-supervised learning. In: 22nd International Conference on Computational Linguistics, pp. 985–992 (2008)
Yan, R., Wan, X., Otterbacher, J., Kong, L., Li, X., Zhang, Y.: Evolutionary timeline summarization: a balanced optimization framework via iterative substitution. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 745–754 (2011)
Tran, G.B., Tran, T.A., Tran, N.-K.: Leveraging learning To rank in an optimization framework for timeline Summarization. In: SIGIR (2013)
Zhao, W.X., Guo, Y., Yan, R., He, Y., Li, X.: Timeline generation with social attention. In: 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1061–1064 (2013)
Georgescu, M., Pham, D.D., Kanhabua, N., Zerr, S., Siersdorfer, S., Nejdl, W.: Temporal summarization of event-related updates in wikipedia. In: Proceedings of the 22nd International Conference on World Wide Web Companion, pp. 281–284 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Chen, J., Niu, Z., Fu, H. (2015). A Multi-news Timeline Summarization Algorithm Based on Aging Theory. In: Cheng, R., Cui, B., Zhang, Z., Cai, R., Xu, J. (eds) Web Technologies and Applications. APWeb 2015. Lecture Notes in Computer Science(), vol 9313. Springer, Cham. https://doi.org/10.1007/978-3-319-25255-1_37
Download citation
DOI: https://doi.org/10.1007/978-3-319-25255-1_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25254-4
Online ISBN: 978-3-319-25255-1
eBook Packages: Computer ScienceComputer Science (R0)