skip to main content
10.1145/3334480.3382945acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
abstract

Things Change: Comparing Results Using Historical Data and User Testing for Evaluating a Recommendation Task

Published:25 April 2020Publication History

ABSTRACT

We address a recommendation task for next likely flight destination to customers of a major international airline company. We compare performance using historical flight data and an actual user evaluation. Using two years of historical flight data consisting of tens of millions of flights, an ensemble and a collaborative filtering approach obtained an accuracy of 47% and 20% using a test set of 100,000 customers, respectively, highlighting the challenge of the domain. We then evaluated our recommendations on 10,000 actual customers, with a 45-45-10 split among ensemble, collaborative filtering, and control group. The overall predictive power employed with real users was 23%, with the ensemble method having a predictive power of 19% and 30% for collaborative filtering. Results indicate that, in complex and shifting domains such as this one, one cannot rely solely on historical data for evaluating the impact of user recommendations. We discuss implications for recommendation systems and future research in this and related domains.

References

  1. J. Beel, M. Genzmehr, S. Langer, A. Nürnberger, and B. Gipp, "A comparative analysis of offline and online evaluations and discussion of research paper recommender system evaluation," in Proceedings of the international workshop on reproducibility and replication in recommender systems evaluation, 2013, pp. 7--14.Google ScholarGoogle Scholar
  2. S. Chen, W. Huang, M. Chen, J. Zhong, and J. Cheng, "Airlines Content Recommendations Based on Passengers' Choice Using Bayesian Belief Networks," in Bayesian Inference, J. P. Tejedor, Ed., ed: IntechOpen, 2017.Google ScholarGoogle Scholar
  3. H. Fani, E. Jiang, E. Bagheri, F. Al-Obeidat, W. Du, and M. Kargar, "User community detection via embedding of social network structure and temporal content," Information Processing & Management, vol. 57, p. 102056, 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Fitchett and A. Cockburn, "AccessRank: predicting what users will do next," presented at the Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Austin, Texas, USA, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. L. Herlocker, J. A. Konstan, L. G. Terveen, and J. T. Riedl, "Evaluating collaborative filtering recommender systems," ACM Transactions on Information Systems, vol. 22, pp. 5--53, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Hueglin and F. Vannotti, "Data mining techniques to improve forecast accuracy in airline business," presented at the Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, San Francisco, CA, 2001.Google ScholarGoogle Scholar
  7. B. P. Knijnenburg and M. C. Willemsen, "Evaluating recommender systems with user experiments," in Recommender Systems Handbook, ed Boston, MA: Springer, 2015, pp. 309--352.Google ScholarGoogle Scholar
  8. R. D. Lawrence, S. J. Hong, and J. Cherrier, "Passenger-based predictive modeling of airline no-show rates," presented at the Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, Washington, D.C., 2003.Google ScholarGoogle Scholar
  9. D. Lian, V. W. Zheng, and X. Xie, "Collaborative filtering meets next check-in location prediction," presented at the Proceedings of the 22nd International Conference on World Wide Web, Rio de Janeiro, Brazil, 2013.Google ScholarGoogle Scholar
  10. X. Ling, W. Deng, C. Gu, H. Zhou, C. Li, and F. Sun, "Model Ensemble for Click Prediction in Bing Search Ads," presented at the Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, 2017.Google ScholarGoogle Scholar
  11. P. Pu, L. Chen, and R. Hu, "A user-centric evaluation framework for recommender systems," in Proceedings of the fifth ACM conference on Recommender Systems, 2011, pp. 157--164.Google ScholarGoogle Scholar
  12. S. J. Racine and J. P. Curtin, "Developing an airline freight management system: meeting airline and end-user challenges," presented at the CHI '03 Extended Abstracts on Human Factors in Computing Systems, Ft. Lauderdale, Florida, USA, 2003.Google ScholarGoogle Scholar
  13. S. Renjith, A. Sreekumar, and M. Jathavedan, "An extensive study on the evolution of context-aware personalized travel recommender systems," Information Processing & Management, vol. 57, p. 102078, 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. Sánchez and A. Bellogíns, "Building user profiles based on sequences for content and collaborative filtering," Information Processing & Management, vol. 56, pp. 192--211, 2019.Google ScholarGoogle ScholarCross RefCross Ref
  15. O. S. Shalom, N. Koenigstein, U. Paquet, and H. P. Vanchinathan, "Beyond Collaborative Filtering: The List Recommendation Problem," presented at the Proceedings of the 25th International Conference on World Wide Web, Montreal, Quebec, Canada, 2016.Google ScholarGoogle Scholar
  16. Y. Wang, C. Breitinger, B. Sommer, F. Schreiber, and H. Reiterer, "Comparing Sequential and Temporal Patterns from Human Mobility Data for Next-Place Prediction," presented at the Adjunct Publication of the 26th Conference on User Modeling, Adaptation and Personalization, Singapore, Singapore, 2018.Google ScholarGoogle Scholar

Index Terms

  1. Things Change: Comparing Results Using Historical Data and User Testing for Evaluating a Recommendation Task

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems
      April 2020
      4474 pages
      ISBN:9781450368193
      DOI:10.1145/3334480

      Copyright © 2020 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 April 2020

      Check for updates

      Qualifiers

      • abstract

      Acceptance Rates

      Overall Acceptance Rate6,164of23,696submissions,26%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format