skip to main content
10.1145/1553374.1553386acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
research-article

Active learning for directed exploration of complex systems

Published:14 June 2009Publication History

ABSTRACT

Physics-based simulation codes are widely used in science and engineering to model complex systems that would be infeasible to study otherwise. Such codes provide the highest-fidelity representation of system behavior, but are often so slow to run that insight into the system is limited. For example, conducting an exhaustive sweep over a d-dimensional input parameter space with k-steps along each dimension requires kd simulation trials (translating into kd CPU-days for one of our current simulations). An alternative is directed exploration in which the next simulation trials are cleverly chosen at each step. Given the results of previous trials, supervised learning techniques (SVM, KDE, GP) are applied to build up simplified predictive models of system behavior. These models are then used within an active learning framework to identify the most valuable trials to run next. Several active learning strategies are examined including a recently-proposed information-theoretic approach. Performance is evaluated on a set of thirteen synthetic oracles, which serve as surrogates for the more expensive simulations and enable the experiments to be replicated by other researchers.

References

  1. Baram, Y., El-Yaniv, R., & Luz, K. (2004). Online choice of active learning algorithms. J. Machine Learning Research, 5, 255--291. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Burl, M. C., DeCoste, D., Enke, B. L., Mazzoni, D., Merline, W. J., & Scharenbroich, L. (2006). Automated knowledge discovery from simulators. SIAM Int. Conf. on Data Mining (pp. 82--93).Google ScholarGoogle ScholarCross RefCross Ref
  3. Chang, C.-C., & Lin, C.-J. (2001). LIBSVM: A library for support vector machines. http://www.csie.ntu.edu.tw/cjlin/libsvm.Google ScholarGoogle Scholar
  4. Cohn, D., Atlas, L., & Ladner, R. (1994). Improving generalization with active learning. Machine Learning, 15, 201--221. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Durda, D. D., Bottke, W. F., Enke, B. L., Merline, W. J., Asphaug, E., Richardson, D. C., & Leinhardt, Z. M. (2004). The formation of asteroid satellites in large impacts: results from numerical simulations. Icarus, 170, 243--257.Google ScholarGoogle ScholarCross RefCross Ref
  6. Gramacy, R. B., Lee, H. K., & Macready, W. G. (2004). Parameter space exploration with gaussian process trees. Int. Conf. on Machine Learning (pp. 353--360). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Guestrin, C., Krause, A., & Singh, A. P. (2005). Near-optimal sensor placements in gaussian processes. Int. Conf. on Machine Learning (pp. 265--272). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Holub, A., Perona, P., & Burl, M. C. (2008). Entropy-based active learning for object recognition. Proc. Online Learning for Classification (pp. 1--8).Google ScholarGoogle ScholarCross RefCross Ref
  9. Knuth, K., Erner, P., & Frasso, S. (2007). Designing intelligent instruments. MaxEnt, Amer. Inst. of Physics, Conf. Proc. 954 (pp. 203--211).Google ScholarGoogle Scholar
  10. Kwakernaak, H., & Sivan, R. (1972). Linear optimal control systems. Wiley-Interscience. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Lindenbaum, M., Markovitch, S., & Rusakov, D. (2004). Selective sampling for nearest neighbor classifiers. Machine Learning, 54, 125--152. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. MacKay, D. (1992). Information-Based Objective Functions for Active Data Selection. Neural Computation, 4, 590--604. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Olivetti, E. (2008). Sampling strategies for expensive data. Doctoral dissertation, U. of Trento.Google ScholarGoogle Scholar
  14. Pfingsten, T. (2006). Bayesian active learning for sensitivity analysis. Lec. Notes in C.S., 4212, 353--364. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Platt, J. (1999). Probabilistic outputs for support vector machines and comparison to regularized likelihood methods. Adv. in Large Margin Classifiers (pp. 61--74). MIT Press.Google ScholarGoogle Scholar
  16. Sacks, J., Welch, W., Mitchell, T., & Wynn, H. (1989). Design and Analysis of Computer Experiments. Statistical Science, 4, 409--423.Google ScholarGoogle ScholarCross RefCross Ref
  17. Tong, S., & Koller, D. (2001). Support vector machine active learning with applications to text classification. Machine Learning Research, 2, 45--66. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Vapnik, V. (1995). The nature of statistical learning theory. Springer, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Veeramachaneni, S., Olivetti, E., & Avesani, P. (2006). Active sampling for detecting irrelevant features. Int. Conf. on Machine Learning (pp. 961--968). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Zadrozny, B. (2004). Learning and evaluating classifiers under sample selection bias. Int. Conf. on Machine Learning (p. 114). Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Active learning for directed exploration of complex systems

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning
            June 2009
            1331 pages
            ISBN:9781605585161
            DOI:10.1145/1553374

            Copyright © 2009 Copyright 2009 by the author(s)/owner(s).

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 14 June 2009

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate140of548submissions,26%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader