Improving Progressive Sampling via Meta-learning on Learning Curves

Leite, Rui; Brazdil, Pavel

doi:10.1007/978-3-540-30115-8_25

Rui Leite²² &
Pavel Brazdil²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3201))

Included in the following conference series:

European Conference on Machine Learning

3966 Accesses
7 Citations

Abstract

This paper describes a method that can be seen as an improvement of the standard progressive sampling. The standard method uses samples of data of increasing size until accuracy of the learned concept cannot be further improved. The issue we have addressed here is how to avoid using some of the samples in this progression. The paper presents a method for predicting the stopping point using a meta-learning approach. The method requires just four iterations of the progressive sampling. The information gathered is used to identify the nearest learning curves, for which the sampling procedure was carried out fully. This in turn permits to generate the prediction regards the stopping point. Experimental evaluation shows that the method can lead to significant savings of time without significant losses of accuracy.

Download to read the full chapter text

Chapter PDF

Parameter Sensitivity Analysis for the Progressive Sampling-Based Bayesian Optimization Method for Automated Machine Learning Model Selection

Adaptive Sampling for Incremental Optimization Using Stochastic Gradient Descent

Challenges in Learning from Streaming Data Extended Abstract

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Blake, C.L., Merz, C.J.U.: repository of machine learning databases (1998)
Google Scholar
Provost Foster, J., David, J., Tim, O.: Efficient progressive sampling. In: Knowledge Discovery and Data Mining, pp. 23–32 (1999)
Google Scholar
John George, H., Pat, L.: Static versus dynamic sampling for data mining. In: Simoudis, E., Han, J., Fayyad, U.M. (eds.) Proc. 2nd Int. Conf. Knowledge Discovery and Data Mining, KDD, pp. 367–370. AAAI Press, Menlo Park (1996)
Google Scholar
Bias, B.L.: Variance, and arcing classifiers. Technical Report 460, Statistics Department, University of California (1996)
Google Scholar
Metal project site, http://www.metal-kdd.org/
Brazdil, P., Soares, C., Costa, J.: Ranking learning algorithms: Using IBL and meta-learning on accuracy and time results. Machine Learning 50, 251–277 (2003)
Article MATH Google Scholar
Leite, R., Brazdil, P.: Improving progressive sampling via meta-learning. In: Pires, F.M., Abreu, S.P. (eds.) EPIA 2003. LNCS (LNAI), vol. 2902, pp. 313–323. Springer, Heidelberg (2003)
Chapter Google Scholar
Quinlan, R.: C5.0 an informal tutorial. RuleQuest (1998), http://www.rulequest.com/see5-info.html

Download references

Author information

Authors and Affiliations

LIACC/FEP, University of Porto, Rua do Campo Alegre, 823, 4150-180, Porto
Rui Leite & Pavel Brazdil

Authors

Rui Leite
View author publications
You can also search for this author in PubMed Google Scholar
Pavel Brazdil
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INSA-Lyon, LIRIS CNRS UMR5205, F-69621, Villeurbanne, France
Jean-François Boulicaut
Dipartimento di Informatica, Università degli Studi di Bari,
Floriana Esposito
Pisa KDD Laboratory, ISTI - CNR, Area della Ricerca di Pisa, Via Giuseppe Moruzzi 1, Pisa, Italy
Fosca Giannotti
Dipartimento di Informatica, Via F. Buonarroti 2, 56127, Pisa, Italy
Dino Pedreschi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Leite, R., Brazdil, P. (2004). Improving Progressive Sampling via Meta-learning on Learning Curves. In: Boulicaut, JF., Esposito, F., Giannotti, F., Pedreschi, D. (eds) Machine Learning: ECML 2004. ECML 2004. Lecture Notes in Computer Science(), vol 3201. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30115-8_25

Download citation

DOI: https://doi.org/10.1007/978-3-540-30115-8_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23105-9
Online ISBN: 978-3-540-30115-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Improving Progressive Sampling via Meta-learning on Learning Curves

Abstract

Chapter PDF

Similar content being viewed by others

Parameter Sensitivity Analysis for the Progressive Sampling-Based Bayesian Optimization Method for Automated Machine Learning Model Selection

Adaptive Sampling for Incremental Optimization Using Stochastic Gradient Descent

Challenges in Learning from Streaming Data Extended Abstract

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improving Progressive Sampling via Meta-learning on Learning Curves

Abstract

Chapter PDF

Similar content being viewed by others

Parameter Sensitivity Analysis for the Progressive Sampling-Based Bayesian Optimization Method for Automated Machine Learning Model Selection

Adaptive Sampling for Incremental Optimization Using Stochastic Gradient Descent

Challenges in Learning from Streaming Data Extended Abstract

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation