Abstract
Current knowledge bases suffer from either low coverage or low accuracy. The underlying hypothesis of this work is that user feedback can greatly improve the quality of automatically extracted knowledge bases. The feedback could help quantify the uncertainty associated with the stored statements and would enable mechanisms for searching, ranking and reasoning at entity-relationship level. Most importantly, a principled model for exploiting user feedback to learn the truth values of statements in the knowledge base would be a major step forward in addressing the issue of knowledge base curation.
We present a family of probabilistic graphical models that builds on user feedback and logical inference rules derived from the popular Semantic-Web formalism of RDFS [1]. Through internal inference and belief propagation, these models can learn both, the truth values of the statements in the knowledge base and the reliabilities of the users who give feedback. We demonstrate the viability of our approach in extensive experiments on real-world datasets, with feedback collected from Amazon Mechanical Turk.
Chapter PDF
Similar content being viewed by others
References
W3C RDF: Vocabulary Description Language 1.0: RDF Schema, http://www.w3.org/TR/rdf-schema/
W3C: OWL Web Ontology Language, http://www.w3.org/TR/owl-features/
Minsky, M.: A Framework for Representing Knowledge. MIT-AI Laboratory Memo 306 (1974), http://web.media.mit.edu/~minsky/papers/Frames/frames.html
Brachman, R.J., Schmolze, J.: An Overview of the KL-ONE Knowledge Representation System. Cognitive Science 9(2) (1985)
Baader, F., Calvanese, D., McGuinness, D.L., Nardi, D., Patel-Schneider, P.F.: The Description Logic Handbook. Cambridge University Press, Cambridge (2003)
W3C SweoIG: The Linking Open Data Community Project, http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
Wolfram Alpha: A Computational Knowledge Engine, http://www.wolframalpha.com/
EntityCube, http://entitycube.research.microsoft.com/
True Knowledge, http://www.trueknowledge.com/
Infer.NET, http://research.microsoft.com/en-us/um/cambridge/projects/infernet/
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Lehmann, J., Schüppel, J., Auer, S.: Discovering Unknown Connections - The DBpedia Relationship Finder. In: 1st Conference on Social Semantic Web (CSSW 2007) pp. 99–110. GI (2007)
Suchanek, F.M., Sozio, M., Weikum, G.: SOFIE: Self-Organizing Flexible Information Extraction. In: 18th International World Wide Web conference (WWW 2009), pp. 631–640. ACM Press, New York (2009)
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: A Core of Semantic Knowledge. In: 16th International World Wide Web Conference (WWW 2007), pp. 697–706. ACM Press, New York (2007)
Kasneci, G., Suchanek, F.M., Ifrim, G., Ramanath, M., Weikum, G.: NAGA: Searching and Ranking Knowledge. In: 24th International Conference on Data Engineering (ICDE 2008), pp. 953–962. IEEE, Los Alamitos (2008)
Kasneci, G., Ramanath, M., Sozio, M., Suchanek, F.M., Weikum, G.: STAR: Steiner-Tree Approximation in Relationship Graphs. In: 25th International Conference on Data Engineering (ICDE 2009), pp. 868–879. IEEE, Los Alamitos (2009)
Kasneci, G., Shady, E., Weikum, G.: MING: Mining Informative Entity Relationship Subgraphs. In: 18th ACM Conference on Information and Knowledge Management (CIKM 2009), pp. 1653–1656. ACM Press, New York (2009)
Preda, N., Kasneci, G., Suchanek, F.M., Yuan, W., Neumann, T., Weikum, G.: Active Knowledge: Dynamically Enriching RDF Knowledge Bases by Web Services. In: 30th ACM International Conference on Management Of Data (SIGMOD 2010). ACM Press, New York (2010)
Wu, F., Weld, D.S.: Autonomously Semantifying Wikipedia. In: 16th ACM Conference on Information and Knowledge Management (CIKM 2007), pp. 41–50. ACM Press, New York (2007)
Weld, D.S., Wu, F., Adar, E., Amershi, S., Fogarty, J., Hoffmann, R., Patel, K., Skinner, M.: Intelligence in Wikipedia. In: 23rd AAAI Conference on Artificial Intelligence (AAAI 2008), pp. 1609–1614. AAAI Press, Menlo Park (2008)
Minka, T.P.: A Family of Algorithms for Approximate Bayesian Inference. Massachusetts Institute of Technology (2001)
Poole, D.: First-Order Probabilistic Inference. In: 8th International Joint Conference on Artificial Intelligence (IJCAI 2003), pp. 985–991. Morgan Kaufmann, San Francisco (2003)
Domingos, P., Singla, P.: Lifted First-Order Belief Propagation. In: 23rd AAAI Conference on Artificial Intelligence (AAAI 2008), pp. 1094–1099. AAAI Press, Menlo Park (2008)
Domingos, P., Richardson, M.: Markov Logic Networks. Machine Learning 62(1-2), 107–136 (2006)
Jaimovich, A., Meshi, O., Friedman, N.: Template Based Inference in Symmetric Relational Markov Random Fields. In: 23rd Conference on Uncertainty in Artificial Intelligence (UAI 2007), pp. 191–199. AUAI Press (2007)
Sen, P., Deshpande, A., Getoor, L.: PrDB: Managing and Exploiting Rich Correlations in Probabilistic Databases. Journal of Very Large Databases 18(5), 1065–1090 (2009)
Friedman, N., Getoor, L., Koller, D., Pfeffer, A.: Learning Probabilistic Relational Models. In: 16th International Joint Conference on Artificial Intelligence (IJCAI 1999), pp. 1300–1309. Morgan Kaufmann, San Francisco (1999)
Getoor, L.: Tutorial on Statistical Relational Learning. In: Kramer, S., Pfahringer, B. (eds.) ILP 2005. LNCS (LNAI), vol. 3625, pp. 415–415. Springer, Heidelberg (2005)
Da Costa, P.C.G., Ladeira, M., Carvalho, R.N., Laskey, K.B., Santos, L.L., Matsumoto, S.: A First-Order Bayesian Tool for Probabilistic Ontologies. In: 21st International Florida Artificial Intelligence Research Society Conference (FLAIRS 2008), pp. 631–636. AAAI Press, Menlo Park (2008)
Frey, B.J., Mackay, D.J.C.: A Revolution: Belief Propagation in Graphs with Cycles. In: Advances in Neural Information Processing Systems, vol. 10, pp. 479–485. MIT Press, Cambridge (1997)
Antova, L., Koch, C., Olteanu, D.: 1010 6 Worlds and Beyond: Efficient Representation and Processing of Incomplete Information. In: 23rd International Conference on Data Engineering (ICDE 2007), pp. 606–615. IEEE, Los Alamitos (2007)
Dalvi, N.N., Ré, C., Suciu, D.: Probabilistic Databases: Diamonds in the Dirt. Communications of ACM (CACM 2009) 52(7), 86–94 (2009)
Agrawal, P., Benjelloun, O., Sarma, A.D., Hayworth, C., Nabar, S.U., Sugihara, T., Widom, J.: Trio: A System for Data, Uncertainty, and Lineage. In: 32nd International Conference on Very Large Data Bases (VLDB 2006), pp. 1151–1154. ACM Press, New York (2006)
Osherson, D., Vardi, M.Y.: Aggregating Disparate Estimates of Chance. Games and Economic Behavior 56(1), 148–173 (2006)
Jøsang, A., Marsh, S., Pope, S.: Exploring Different Types of Trust Propagation. In: Stølen, K., Winsborough, W.H., Martinelli, F., Massacci, F. (eds.) iTrust 2006. LNCS, vol. 3986, pp. 179–192. Springer, Heidelberg (2006)
Kelly, D., Teevan, J.: Implicit Feedback for Inferring User Preference: A Bibliography. SIGIR Forum 37(2), 18–28 (2003)
Horst, H.J.T.: Completeness, Decidability and Complexity of Entailment for RDF Schema and a Semantic Extension Involving the OWL Vocabulary. Journal of Web Semantics: Science, Services and Agents on the World Wide Web 3(2-3), 79–115 (2005)
Galland, A., Abiteboul, S., Marian, A., Senellart, P.: Corroborating Information from Disagreeing Views. In: 3rd ACM International Conference on Web Search and Data Mining (WSDM 2010), pp. 1041–1064. ACM Press, New York (2010)
Raykar, V.C., Yu, S., Zhao, L.H., Valadez, G.H., Florin, C., Bogoni, L., Moy, L.: Learning From Crowds. Journal of Machine Learning Research 11, 1297–1322 (2010)
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Francisco (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kasneci, G., Van Gael, J., Herbrich, R., Graepel, T. (2010). Bayesian Knowledge Corroboration with Logical Rules and User Feedback. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2010. Lecture Notes in Computer Science(), vol 6322. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15883-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-15883-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15882-7
Online ISBN: 978-3-642-15883-4
eBook Packages: Computer ScienceComputer Science (R0)