skip to main content
10.1145/1943403.1943423acmconferencesArticle/Chapter ViewAbstractPublication PagesiuiConference Proceedingsconference-collections
research-article

End-user feature labeling: a locally-weighted regression approach

Published:13 February 2011Publication History

ABSTRACT

When intelligent interfaces, such as intelligent desktop assistants, email classifiers, and recommender systems, customize themselves to a particular end user, such customizations can decrease productivity and increase frustration due to inaccurate predictions - especially in early stages, when training data is limited. The end user can improve the learning algorithm by tediously labeling a substantial amount of additional training data, but this takes time and is too ad hoc to target a particular area of inaccuracy. To solve this problem, we propose a new learning algorithm based on locally weighted regression for feature labeling by end users, enabling them to point out which features are important for a class, rather than provide new training instances. In our user study, the first allowing ordinary end users to freely choose features to label directly from text documents, our algorithm was both more effective than others at leveraging end users' feature labels to improve the learning algorithm, and more robust to real users' noisy feature labels. These results strongly suggest that allowing users to freely choose features to label is a promising method for allowing end users to improve learning algorithms effectively.

References

  1. Attenberg, J., Melville, P., and Provost, F. A unified approach to active dual supervision for labeling features and examples, in Proc. European Conf. Machine Learn-ing (2010). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Cleveland, W., and Devlin, S. Locally-weighted regres-sion: An approach to regression analysis by local fitting. J. American Statistical Assn. 83, 403 (1988), 596--610.Google ScholarGoogle ScholarCross RefCross Ref
  3. Cohn D. A., Ghahramani, Z., and Jordan, M. I. Active learning with statistical models. J. Artificial Intelligence Research, 4 (1996), 129--145. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Deng, K. Omega: On-line Memory-Based General Pur-pose System Classifier (PhD Dissertation). Carnegie Mellon University, Pittsburgh, PA, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Druck, G., Mann, G., and McCallum, A. Learning from labeled features using generalized expectation criteria, in Proc. SIGIR (2008), ACM, 595--602. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Hastie, T., Tibshirani, R., and Friedman, J. H. The Elements of Statistical Learning. Springer, 2003.Google ScholarGoogle Scholar
  7. Kulesza, T., Wong, W.-K., Stumpf, S., Perona, S., White, S., Burnett, M., Oberst, I. Ko, A. Fixing the program my computer learned: Barriers for end users, challenges for the machine, in Proc. IUI (2009), ACM, 187--196. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Kulesza, T., Stumpf, S., Burnett, M., Wong, W.-K., Riche, Y., Moore, T., Oberst, I., Shinsel, A., McIntosh, K., Explanatory debugging: supporting end-user debugging of machine-learned programs, in Proc. IEEE Symp. Visual Languages and Human-Centric Computing (2010), IEEE, 41--48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Lang, K. Newsweeder: Learning to filter netnews, in Proc. ICML (1995), 331--339.Google ScholarGoogle Scholar
  10. Lewis, D. Reuters-21578. Available at http://www.daviddlewis.com/resoursce/testcollections/reuters21578.Google ScholarGoogle Scholar
  11. 1Lewis, D. D., Yang, Y., Rose, T., Li, F. RCV1: A new benchmark collection for text categorization research. JMLR, 5 (2004), 361--397. http://www.jmlr.org/papers/volume5/lewis04a/lewis04a.pdf. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Liu, H., and Singh, P. ConceptNet-a practical com-monsense reasoning tool-kit. BT Technology Journal 22, 4 (2004), 211--226. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. McCallum, A., Mann, G., and Druck, G. Generalized Expectation Criteria (Technical Report UM-CS-2007-60). University of Massachusetts, Amherst, 2007.Google ScholarGoogle Scholar
  14. Nocedal, J. Updating quasi-Newton matrices with limited storage. Mathematics of Computation, 35 (1980), 773--782.Google ScholarGoogle ScholarCross RefCross Ref
  15. Raghavan, H., Madani, O., and Jones, R. Active Learning with Feedback on Both Features and Instances. JMLR 7 (2006), 1655--1686. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Raghavan, H. and Allan, J. An interactive algorithm for asking and incorporating feature feedback into support vector machines, in Proc. SIGIR (2007), ACM, 79--86. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Roth, D. and Small, K. Interactive feature space construction using semantic information, in Proc. CoNLL (2009), 66--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Sindhwani, V., Melville, P., and Lawrence, R. Uncertainty sampling and transductive experimental design for active dual supervision. Int. Conf. Machine Learning (2009), 953--960. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Speer, R., Havasi, C., and Lieberman, H. AnalogySpace: Reducing the dimensionality of common sense knowledge, in Proc. AAAI (2008). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Stumpf, S. Rajaram, V., Li, L., Wong, W.-K., Burnett, M., Dietterich, T., Sullivan, E., and Herlocker J. Interacting meaningfully with machine learning systems: Three ex-periments. Int. J. Human-Computer Studies 67, 8 (2009), 639--662.. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. End-user feature labeling: a locally-weighted regression approach

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        IUI '11: Proceedings of the 16th international conference on Intelligent user interfaces
        February 2011
        504 pages
        ISBN:9781450304191
        DOI:10.1145/1943403

        Copyright © 2011 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 13 February 2011

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate746of2,811submissions,27%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader