ABSTRACT
There is an increasing amount of false claims in news, social media, and other web sources. While prior work on truth discovery has focused on the case of checking factual statements, this paper addresses the novel task of assessing the credibility of arbitrary claims made in natural-language text - in an open-domain setting without any assumptions about the structure of the claim, or the community where it is made. Our solution is based on automatically finding sources in news and social media, and feeding these into a distantly supervised classifier for assessing the credibility of a claim (i.e., true or fake). For inference, our method leverages the joint interaction between the language of articles about the claim and the reliability of the underlying web sources. Experiments with claims from the popular website snopes.com and from reported cases of Wikipedia hoaxes demonstrate the viability of our methods and their superior accuracy over various baselines.
- C. Castillo, M. Mendoza, and B. Poblete. Information credibility on twitter. In WWW 2011. Google ScholarDigital Library
- X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: The role of source dependence. Proc. VLDB Endow., 2(1):550--561, 2009. Google ScholarDigital Library
- A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroborating information from disagreeing views. In WSDM 2010. Google ScholarDigital Library
- A. Gupta, H. Lamba, P. Kumaraguru, and A. Joshi. Faking sandy: characterizing and identifying fake images on twitter during hurricane sandy. In WWW Companion 2013. Google ScholarDigital Library
- S. Kumar, R. West, and J. Leskovec. Disinformation on the web: Impact, characteristics, and detection of wikipedia hoaxes. In WWW 2016. Google ScholarDigital Library
- S. Kwon, M. Cha, K. Jung, W. Chen, and Y. Wang. Prominent features of rumor propagation in online social media. In ICDM 2013.Google ScholarCross Ref
- X. Li, X. L. Dong, K. Lyons, W. Meng, and D. Srivastava. Truth finding on the deep web: Is the problem solved? PVLDB, 6(2):97--108, 2012. Google ScholarDigital Library
- X. Li, W. Meng, and C. Yu. T-verifier: Verifying truthfulness of fact statements. In ICDE 2011. Google ScholarDigital Library
- Y. Li, J. Gao, C. Meng, Q. Li, L. Su, B. Zhao, W. Fan, and J. Han. A survey on truth discovery. SIGKDD Explorations, 17(2):1--16, 2015. Google ScholarDigital Library
- Y. Li, Q. Li, J. Gao, L. Su, B. Zhao, W. Fan, and J. Han. On the discovery of evolving truth. In KDD 2015. Google ScholarDigital Library
- S. Mukherjee and G. Weikum. Leveraging joint interactions for credibility analysis in news communities. In CIKM 2015. Google ScholarDigital Library
- S. Mukherjee, G. Weikum, and C. Danescu-Niculescu-Mizil. People on drugs: Credibility of user statements in health communities. In KDD 2014. Google ScholarDigital Library
- N. Nakashole and T. M. Mitchell. Language-aware truth assessment of fact candidates. In ACL 2014.Google ScholarCross Ref
- J. Pasternack and D. Roth. Knowing what to believe (when you already know something). In COLING 2010. Google ScholarDigital Library
- D. Pogue. At snopes.com, rumors are held up to the light. http://www.nytimes.com/2010/07/15/technology/personaltech/15pogue-email.html, July 15, 2010. {Online; accessed 26-Apr-2016}.Google Scholar
- V. Qazvinian, E. Rosengren, D. R. Radev, and Q. Mei. Rumor has it: Identifying misinformation in microblogs. In EMNLP 2011. Google ScholarDigital Library
- M. Recasens, C. Danescu-Niculescu-Mizil, and D. Jurafsky. Linguistic models for analyzing and detecting biased language. In ACL 2013.Google Scholar
- Q. Xu and H. Zhao. Using deep linguistic features for finding deceptive opinion spam. In COLING 2012.Google Scholar
- X. Yin, J. Han, and P. S. Yu. Truth discovery with multiple conflicting information providers on the web. IEEE Trans. on Knowl. and Data Eng., 20(6):796--808, June 2008. Google ScholarDigital Library
Index Terms
Credibility Assessment of Textual Claims on the Web
Recommendations
Assessing the Credibility of Claims on the Web
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web CompanionIn my doctoral research, I plan to address the problem of assessing the credibility of arbitrary claims made in natural-language text - in an open-domain setting. Automatic credibility assessment is a complex task depending upon many factors. To start ...
Where the Truth Lies: Explaining the Credibility of Emerging Claims on the Web and Social Media
WWW '17 Companion: Proceedings of the 26th International Conference on World Wide Web CompanionThe web is a huge source of valuable information. However, in recent times, there is an increasing trend towards false claims in social media, other web-sources, and even in news. Thus, factchecking websites have become increasingly popular to identify ...
#CycloneGaja-Rank based Credibility Analysis System in social media during the crisis
AbstractOnline Social Networking (OSN) is a platform, which allows users to impart information to other people. Twitter plays an instrumental role in public and political conversations related to the particular crisis events. This research work aims to ...
Comments