Skip to main content

An Efficient Active Learning Method Based on Random Sampling and Backward Deletion

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7751))

Abstract

Active learning aims to select data samples which would be the most informative to improve classification performance so that their class labels are obtained from an expert. Recently, an active learning method based on locally linear reconstruction(LLR) has been proposed and the performance of LLR was demonstrated well in the experiments comparing with other active learning methods. However, the time complexity of LLR is very high due to matrix operations required repeatedly for data selection. In this paper, we propose an efficient active learning method based on random sampling and backward deletion. We select a small subset of data samples by random sampling from the total data set, and a process of deleting the most redundant points in the subset is performed iteratively by searching for a pair of data samples having the smallest distance. The distance measure using a graph-based shortest path distance is utilized in order to consider the underlying data distribution. Experimental results demonstrate that the proposed method has very low time complexity, but the prediction power of data samples selected by our method outperforms that by LLR.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Settles, B.: Active learning literature survey: Computer sciences technical report 1648, University of Wisconsin-Madison (2009)

    Google Scholar 

  2. Freund, Y., Seung, H.S., Shamir, E., Tishby, N.: Selective sampling using the query by committee algorithm. Machine Learning 28(2-3), 138–168 (1997)

    Google Scholar 

  3. Tong, S., Keller, D.: Support vector machine active learning with applications to text classifications. Journal of Machine Learning Research 2, 45–66 (2002)

    MATH  Google Scholar 

  4. Cohn, D.A., Ghahramani, Z., Jordan, M.I.: Active learning with statistical models. Journal of Artificial Intelligence Research 4, 129–145 (1996)

    MATH  Google Scholar 

  5. Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using Gaussian Fields and Harmonic functions. In: Proceedings of ICML, pp. 912–919 (2003)

    Google Scholar 

  6. Atkinson, A., Donev, A., Tobias, R.: Optimum Experimental Designs, with SAS. Oxford Univ. Press (2007)

    Google Scholar 

  7. Asprey, S.P., Macchietto, S.: Designing robust optimal dynamic experimetns. J. Process Control 12(4), 545–556 (2002)

    Article  Google Scholar 

  8. Hardin, R.H., Sloane, N.J.A.: A new approach to the construction of optimal designs: J. Statistical Planning and Inference 37(3), 339–369 (1993)

    Article  MathSciNet  MATH  Google Scholar 

  9. Zhang, L., Chen, C., Bu, J., Cai, D., He, X., Huang, T.S.: Active Learning Based on Locally Linear Reconstruction. IEEE Trans. Pattern Anal. Machine Int. 33(10), 2026–2038 (2011)

    Article  Google Scholar 

  10. Groves, R.M., Fowler Jr., F.J.: Survey methodology. Wiley (2010)

    Google Scholar 

  11. Tan, P., Steinbach, M., Kumar, V.: Introduction to Data Mining. Addison Wesley, Boston (2006)

    Google Scholar 

  12. UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/

  13. A Library for SVM, http://www.csie.ntu.edu.tw/~cjlin/libsvm/

  14. Machine Learning Group at University of Waikato, http://www.cs.waikato.ac.nz/ml/weka/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Woo, H., Park, C.H. (2013). An Efficient Active Learning Method Based on Random Sampling and Backward Deletion. In: Yang, J., Fang, F., Sun, C. (eds) Intelligent Science and Intelligent Data Engineering. IScIDE 2012. Lecture Notes in Computer Science, vol 7751. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36669-7_83

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-36669-7_83

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-36668-0

  • Online ISBN: 978-3-642-36669-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics