Distribution-free bounds for relational classification

Dhurandhar, Amit; Dobra, Alin

doi:10.1007/s10115-011-0406-4

Distribution-free bounds for relational classification

Regular Paper
Published: 08 May 2011

Volume 31, pages 55–78, (2012)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Amit Dhurandhar¹ &
Alin Dobra²

138 Accesses
5 Citations
Explore all metrics

Abstract

Statistical relational learning (SRL) is a subarea in machine learning which addresses the problem of performing statistical inference on data that is correlated and not independently and identically distributed (i.i.d.)—as is generally assumed. For the traditional i.i.d. setting, distribution-free bounds exist, such as the Hoeffding bound, which are used to provide confidence bounds on the generalization error of a classification algorithm given its hold-out error on a sample size of N. Bounds of this form are currently not present for the type of interactions that are considered in the data by relational classification algorithms. In this paper, we extend the Hoeffding bounds to the relational setting. In particular, we derive distribution-free bounds for certain classes of data generation models that do not produce i.i.d. data and are based on the type of interactions that are considered by relational classification algorithms that have been developed in SRL. We conduct empirical studies on synthetic and real data which show that these data generation models are indeed realistic and the derived bounds are tight enough for practical use.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Arias M, Feigelson A, Khardon R, Servedio R (2006) Polynomial certificates for propositional classes. Inf Comput 204(5): 816–834
Article MathSciNet MATH Google Scholar
Arias M, Khardon R (2002) Learning closed horn expressions. Inf Comput 178(1): 214–240
MathSciNet MATH Google Scholar
Bakir G, Hofmann T, Schölkopf B, Smola A, Taskar B, Vishwanathan SVN (2007) Predicting structured data. The MIT Press, Cambridge
Google Scholar
Bartlett P, Bousquet O, Mendelson S (2002) Local rademacher complexities. Ann Stat 33: 44–58
Google Scholar
Bennett G (1962) Probability inequalities for the sums of independent random variables. JASA 57: 33–45
MATH Google Scholar
Blum A, Kalai A, Langford J (1999) Beating the hold-out: bounds for k-fold and progressive cross-validation. Comput Learn Theory 203–208
Blumer A, Ehrenfueucht A, Haussler D, Warmuth M (1987) Occam’s razor. Inf Process Lett 24: 377–380
Article MATH Google Scholar
Chernoff H (1952) A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations. Ann Math Stat 23: 493–507
Article MathSciNet MATH Google Scholar
Cohen W (1995) Polynomial learnability and inductive logic programming: methods and results. New Gener Comput 13: 369–409
Article Google Scholar
Devroye L, Györfi L, Lugosi G (1996) A Probabilistic theory of pattern recognition. Springer, New York
MATH Google Scholar
Floyd S, Warmuth M (1995) Sample compression, learnability and the vapnik-chervonenkis dimension. Mach Learn 21: 269–304
Google Scholar
Friedman N, Getoor L, Koller D, Pfeffer A (1999) Learning probabilistic relational models. IJCAI 1300–1309
Getoor L, Taskar B (2007) Introduction to statistical relational learning. MIT Press, Cambridge
MATH Google Scholar
Godwin H (1955) On generalization of tchebyshev’s inequality. JASA 50: 923–945
MathSciNet MATH Google Scholar
Grimmett G, Stirzaker D (2001) Probability and random processes, 3rd edn. Oxford University Press, Oxford
Google Scholar
Hoeffding W (1963) Probability inequalities for sums of bounded random variables. JASA 58(301): 13–30
MathSciNet MATH Google Scholar
Hulten G, Domingos P, Abe Y (2003) Mining massive relational databases
Jensen D, Neville J (2002) Linkage and autocorrelation cause feature selection bias in relational learning
Jensen J (1906) Sur les fonctions convexes et les ingalits entre les valeurs moyennes. Acta Math 30: 175–193
Article MathSciNet MATH Google Scholar
Jia Y, Zhang J, Huan J (2011) An efficient graph-mining method for complicated and noisy data with real-world applications. Knowl Inf Syst
Kok S, Singla P, Richardson M, Domingos P (2005) The alchemy system for statistical relational ai. Technical report, department of computer science and engineering, UW, http://www.cs.washington.edu/ai/alchemy/
Langford J (2005) Tutorial on practical prediction theory for classification. J Mach Learn Res 6: 273–306
MathSciNet MATH Google Scholar
Mcallester D (1999) Pac-bayesian model averaging. In: Proceedings of the twelfth annual conference on computational learning theory. ACM Press, pp 164–170
Neville J (2006) Statistical models and analysis techniques for learning in relational data. Ph.D. Thesis, University of Massachusetts Amhers
Neville J, Gallagher B, Eliassi-Rad T, Wang T (2011) Correcting evaluation bias of relational classifiers with network cross validation. Knowl Inf Syst
Neville J, Jensen D (2005) Leveraging relational autocorrelation with latent group models. In: MRDM ’05: Proceedings of the 4th international workshop on Multi-relational mining. ACM, New York, NY, USA, pp 49–55
Neville J, Jensen D (2007) Relational dependency networks. J Mach Learn Res 8: 653–692
MATH Google Scholar
Neville J, Jensen D, Gallagher B (2003) Simple estimators for relational bayesian classifiers
Okamoto M (1958) Some inequalities relating to the partial sum of binomial probabilites. Ann Inst Stat Math 10: 29–35
Article MATH Google Scholar
Papoulis A (1991) Probability, random variables and stochastic processes. 3. McGraw-Hill, New York
Google Scholar
Preisach C, Schmidt-Thieme L (2008) Ensembles of relational classifiers. Knowl Inf Syst 14(2): 249–272
Article MATH Google Scholar
Raedt L (1994) First order jk-clausal theories are pac-learnable. Artif Intell 70: 375–392
Article MATH Google Scholar
Reddy C, Park J (2010) Multi-resolution boosting for classification and regression problems. Knowl Inf Syst
Richardson M, Domingos P (2006) Markov logic networks. Mach Learn 62(1–2): 107–136
Article Google Scholar
Rusu F, Dobra A (2007) Pseudo-random number generation for sketch-based estimations. ACM Trans Database Syst 32(2): 11
Article Google Scholar
Savage I (1961) Probability inequalities of the tchebyshev type. J Res Natl Bur Stand 65B: 211–222
MathSciNet Google Scholar
Schmidt J, Siegel A, Srinivasan A (1995) Chernoff-hoeffding bounds for applications with limited independence. SIAM J Discret Math 8: 223–250
Article MathSciNet MATH Google Scholar
Taskar B, Abbeel P, Koller D (2002) Discriminative probabilistic models for relational data. In: Proceedings 18th conference on uncertainty in AI, pp 485–492
Vapnik V (1998) Statistical learning theory. Wiley, New York
MATH Google Scholar

Download references

Author information

Authors and Affiliations

IBM T. J. Watson, Yorktown Heights, NY, USA
Amit Dhurandhar
University of Florida, Gainesville, FL, USA
Alin Dobra

Authors

Amit Dhurandhar
View author publications
You can also search for this author in PubMed Google Scholar
Alin Dobra
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amit Dhurandhar.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dhurandhar, A., Dobra, A. Distribution-free bounds for relational classification. Knowl Inf Syst 31, 55–78 (2012). https://doi.org/10.1007/s10115-011-0406-4

Download citation

Received: 22 March 2010
Revised: 29 March 2011
Accepted: 23 April 2011
Published: 08 May 2011
Issue Date: April 2012
DOI: https://doi.org/10.1007/s10115-011-0406-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Distribution-free bounds for relational classification

Abstract

Access this article

Similar content being viewed by others

Statistical Relational Learning

What Kinds of Relational Features Are Useful for Statistical Learning?

Toward New Evaluation Metrics for Relational Learning

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Distribution-free bounds for relational classification

Abstract

Access this article

Similar content being viewed by others

Statistical Relational Learning

What Kinds of Relational Features Are Useful for Statistical Learning?

Toward New Evaluation Metrics for Relational Learning

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation