ABSTRACT
The past decades have seen an increase in academic research and public debates on online news and journalism in general, with an emphasis on fake news and low-quality reporting.
This paper presents TARO: a model and a software framework for the collection and analysis of online news sources.
The novel aspects of the TARO model and framework are: the distinction between abstract pieces of news and concrete news items, news comparison techniques based on similarity on embedded spaces, and the management of rolling news via so-called snapshot extensions. One advantage of TARO is the ability to perform comparative analysis of international news sources in various languages and across time zones.
To prove the applicability and soundness of TARO, two quantitative cases studies related to the concept of churnalism are also presented in this paper. The two case studies provide quantitative insights on two tendencies of news outlets: news commonality (publishing the same news) and news churn (quickly removing recent news to make space for even more recent news).
- Yoel Cohen. 2017. Diffusion Theories: News Diffusion. John Wiley & Sons, Ltd, Hoboken, USA, 1--11. https://doi.org/10.1002/9781118783764.wbieme0060 arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1002/9781118783764.wbieme0060Google Scholar
- Tony Harcup. 2021. Journalism: principles and practice. Sage Publications Ltd, Thousand Oaks, USA. 1--100 pages.Google Scholar
- Neha Heda Hemlata Shelar, Gagandeep Kaur and Poorva Agrawal. 2020. Named Entity Recognition Approaches and Their Comparison for Custom NER Model. Science & Technology Libraries 39(3) (2020), 324--337. https://doi.org/10.1080/0194262X.2020.1759479Google Scholar
- Alfirna Rizqi Lahitani, Adhistya Erna Permanasari, and Noor Akhmad Setiawan. 2016. Cosine similarity to determine similarity measure: Study case in online essay assessment. In 2016 4th International Conference on Cyber and IT Service Management. IEEE, Bandung, Indonesia, 1--6. https://doi.org/10.1109/CITSM.2016.7577578Google ScholarCross Ref
- Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Introduction to information retrieval. Cambridge University Press, Cambridge, UK. https://doi.org/10.1017/CBO9780511809071Google Scholar
- Enrico Motta, Enrico Daga, Andreas L. Opdahl, and Bjørnar Tessem. 2020. Analysis and Design of Computational News Angles. IEEE Access 8 (2020), 120613--120626. https://doi.org/10.1109/ACCESS.2020.3005513Google ScholarCross Ref
- Andreas Opdahl and Bjørnar Tessem. 2021. Ontologies for finding journalistic angles. Software and Systems Modeling 20 (02 2021). https://doi.org/10.1007/s10270-020-00801-wGoogle ScholarDigital Library
- Gregory P. Perreault. 2022. Digital Journalism and the Facilitation of Hate (1st ed.). Routledge, London, UK. https://doi.org/10.4324/9781003284567Google Scholar
- Aurora Pons-Porrata, Rafael Berlanga-Llavori, and José Ruiz-Shulcloper. 2002. Temporal-Semantic Clustering of Newspaper Articles for Event Detection. In Pattern Recognition in Information Systems. 2nd International Workshop on Pattern Recognition in Information Systems, PRIS 2002, Ciudad Real, Spain, 104--113.Google Scholar
- David Tewksbury and Scott Althaus. 2000. Differences in Knowledge Acquisition among Readers of the Paper and Online Versions of a National Newspaper. Journalism & Mass Communication Quarterly 77 (09 2000), 457--479. https://doi.org/10.1177/107769900007700301Google Scholar
- Vikas Thada and Vivek Jaglan. 2013. Comparison of Jaccard, Dice, Cosine Similarity Coefficient To Find Best Fitness Value for Web Retrieved Documents Using Genetic Algorithm. International Journal of Innovations in Engineering and Technology 2 (08 2013), 202--205.Google Scholar
- Andreas Widholm. 2017. Online Methodology: Analysing News Flows of Online Journalism. Westminster Papers in Communication and Culture 5(2) (2017), 81--97. https://doi.org/10.16997/wpcc.69Google Scholar
Index Terms
- Comparison of news commonality and churn in international news outlets with TARO
Recommendations
Ranking scholarly outlets for information technology
SIGITE '09: Proceedings of the 10th ACM conference on SIG-information technology educationThe purpose of this paper is to establish a ranking for scholarly outlets for those publishing in the field of information technology. Many well-established disciplines have a number of outlets for scholarly work, including archival journals, conference ...
The impact of misconduct on the published medical and non-medical literature, and the news media
Better understanding of research and publishing misconduct can improve strategies to mitigate their occurrence. In this study, we examine various trends among 2,375 articles retracted due to misconduct in all scholarly fields. Proportions of articles ...
News stories as evidence for research? BBC citations from articles, Books, and Wikipedia
Although news stories target the general public and are sometimes inaccurate, they can serve as sources of real-world information for researchers. This article investigates the extent to which academics exploit journalism using content and citation ...
Comments