Abstract
Nowadays, any person, company or public institution uses and exploits different channels to share private or public information with other people (friends, customers, relatives, etc.) or institutions. This context has changed the journalism, thus, the major newspapers report news not just on its own web site, but also on several social media such as Twitter or YouTube. The use of multiple communication media stimulates the need for integration and analysis of the content published globally and not just at the level of a single medium. An analysis to achieve a comprehensive overview of the information that reaches the end users and how they consume the information is needed. This analysis should identify the main topics in the news flow and reveal the mechanisms of publication of news on different media (e.g. news timeline). Currently, most of the work on this area is still focused on a single medium. So, an analysis across different media (channels) should improve the result of topic detection. This paper shows the application of a graph analytical approach, called Keygraph, to a set of very heterogeneous documents such as the news published on various media. A preliminary evaluation on the news published in a 5 days period was able to identify the main topics within the publications of a single newspaper, and also within the publications of 20 newspapers on several on-line channels.
The research presented in this paper was partially funded by Keystone Action COST IC1302.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
ISTAT http://tinyurl.com/jc5sfc8.
- 2.
The code of the version 2.2 of March 2014 is available on-line at http://keygraph.codeplex.com/.
- 3.
- 4.
- 5.
The average circulations of each newspaper refer to February 2015 as reported by the Italian Federation of Newspaper Publishers (Federazione Italiana Editori Giornali available at http://www.fieg.it).
- 6.
As the content available on web sites and Facebook news is greater than the content on Twitter news, the number of shared keywords is different according to the channel: 5 if news is published on a Website or on Facebook, 3 if news is published on Twitter.
References
Allan, J. (ed.): Topic Detection and Tracking: Event-based Information Organization. Kluwer Academic Publishers, Norwell (2002)
Atefeh, F., Khreich, W.: A survey of techniques for event detection in Twitter. Comput. Intell. 31(1), 132–164 (2015)
Bergamaschi, S., Beneventano, D., Po, L., Sorrentino, S.: Automatic normalization and annotation for discovering semantic mappings. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 6585, pp. 85–100. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19668-3_8
Bergamaschi, S., Po, L., Sorrentino, S.: Comparing topic models for a movie recommendation system. WEBIST 2, 172–183 (2014)
Fiscus, J.G., Doddington, G.R.: Topic detection and tracking. In: Allan, J. (ed.) Topic Detection and Tracking Evaluation Overview, pp. 17–31. Kluwer Academic Publishers, Norwell (2002)
Garrido, A.L., Buey, M.G., Escudero, S., Ilarri, S., Mena, E., Silveira, S.B.: TM-gen: a topic map generator from text documents. In: 25th IEEE International Conference on Tools with Artificial Intelligence, Washington (USA). IEEE Computer Society, November 2013
Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets. Cambridge University Press, New York (2011)
Sayyadi, H., Hurst, M., Maykov, A.: Event detection and tracking in social streams. In: Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2009). AAAI (2009)
Sayyadi, H., Raschid, L.: A graph analytical approach for topic detection. ACM Trans. Internet Technol. 13(2), 4:1–4:23 (2013)
Trillo, R., Po, L., Ilarri, S., Bergamaschi, S., Mena, E.: Using semantic techniques to access web data. Inf. Syst. 36(2), 117–133 (2011)
Veglis, A.: Cross-media publishing by US newspapers. J. Electron. Publ. 10(2), 131–150 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Po, L., Rollo, F., Trillo Lado, R. (2017). Topic Detection in Multichannel Italian Newspapers. In: Calì, A., Gorgan, D., Ugarte, M. (eds) Semantic Keyword-Based Search on Structured Data Sources. IKC 2016. Lecture Notes in Computer Science(), vol 10151. Springer, Cham. https://doi.org/10.1007/978-3-319-53640-8_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-53640-8_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-53639-2
Online ISBN: 978-3-319-53640-8
eBook Packages: Computer ScienceComputer Science (R0)