abstract

Things Change: Comparing Results Using Historical Data and User Testing for Evaluating a Recommendation Task

Authors:
Soon-Gyo Jung

Hamad Bin Khalifa University, Doha, Qatar

Hamad Bin Khalifa University, Doha, Qatar
View Profile

,
Joni Salminen

Hamad Bin Khalifa University & University of Turku, Doha, Qatar

Hamad Bin Khalifa University & University of Turku, Doha, Qatar
View Profile

,
Shammur A. Chowdhury

Hamad Bin Khalifa University, Doha, Qatar

Hamad Bin Khalifa University, Doha, Qatar
View Profile

,
Dianne Ramirez Robillos

University of the Philippines, Quezon City, Diliman, Philippines

University of the Philippines, Quezon City, Diliman, Philippines
View Profile

,
Bernard J. Jansen

Hamad Bin Khalifa University, Doha, Qatar

Hamad Bin Khalifa University, Doha, Qatar
View Profile

CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing SystemsApril 2020Pages 1–7https://doi.org/10.1145/3334480.3382945

Published:25 April 2020Publication History

CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems

Pages 1–7

ABSTRACT

We address a recommendation task for next likely flight destination to customers of a major international airline company. We compare performance using historical flight data and an actual user evaluation. Using two years of historical flight data consisting of tens of millions of flights, an ensemble and a collaborative filtering approach obtained an accuracy of 47% and 20% using a test set of 100,000 customers, respectively, highlighting the challenge of the domain. We then evaluated our recommendations on 10,000 actual customers, with a 45-45-10 split among ensemble, collaborative filtering, and control group. The overall predictive power employed with real users was 23%, with the ensemble method having a predictive power of 19% and 30% for collaborative filtering. Results indicate that, in complex and shifting domains such as this one, one cannot rely solely on historical data for evaluating the impact of user recommendations. We discuss implications for recommendation systems and future research in this and related domains.

References

J. Beel, M. Genzmehr, S. Langer, A. Nürnberger, and B. Gipp, "A comparative analysis of offline and online evaluations and discussion of research paper recommender system evaluation," in Proceedings of the international workshop on reproducibility and replication in recommender systems evaluation, 2013, pp. 7--14.Google Scholar
S. Chen, W. Huang, M. Chen, J. Zhong, and J. Cheng, "Airlines Content Recommendations Based on Passengers' Choice Using Bayesian Belief Networks," in Bayesian Inference, J. P. Tejedor, Ed., ed: IntechOpen, 2017.Google Scholar
H. Fani, E. Jiang, E. Bagheri, F. Al-Obeidat, W. Du, and M. Kargar, "User community detection via embedding of social network structure and temporal content," Information Processing & Management, vol. 57, p. 102056, 2020.Google ScholarDigital Library
S. Fitchett and A. Cockburn, "AccessRank: predicting what users will do next," presented at the Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Austin, Texas, USA, 2012.Google ScholarDigital Library
J. L. Herlocker, J. A. Konstan, L. G. Terveen, and J. T. Riedl, "Evaluating collaborative filtering recommender systems," ACM Transactions on Information Systems, vol. 22, pp. 5--53, 2004.Google ScholarDigital Library
C. Hueglin and F. Vannotti, "Data mining techniques to improve forecast accuracy in airline business," presented at the Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, San Francisco, CA, 2001.Google Scholar
B. P. Knijnenburg and M. C. Willemsen, "Evaluating recommender systems with user experiments," in Recommender Systems Handbook, ed Boston, MA: Springer, 2015, pp. 309--352.Google Scholar
R. D. Lawrence, S. J. Hong, and J. Cherrier, "Passenger-based predictive modeling of airline no-show rates," presented at the Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, Washington, D.C., 2003.Google Scholar
D. Lian, V. W. Zheng, and X. Xie, "Collaborative filtering meets next check-in location prediction," presented at the Proceedings of the 22nd International Conference on World Wide Web, Rio de Janeiro, Brazil, 2013.Google Scholar
X. Ling, W. Deng, C. Gu, H. Zhou, C. Li, and F. Sun, "Model Ensemble for Click Prediction in Bing Search Ads," presented at the Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, 2017.Google Scholar
P. Pu, L. Chen, and R. Hu, "A user-centric evaluation framework for recommender systems," in Proceedings of the fifth ACM conference on Recommender Systems, 2011, pp. 157--164.Google Scholar
S. J. Racine and J. P. Curtin, "Developing an airline freight management system: meeting airline and end-user challenges," presented at the CHI '03 Extended Abstracts on Human Factors in Computing Systems, Ft. Lauderdale, Florida, USA, 2003.Google Scholar
S. Renjith, A. Sreekumar, and M. Jathavedan, "An extensive study on the evolution of context-aware personalized travel recommender systems," Information Processing & Management, vol. 57, p. 102078, 2020.Google ScholarDigital Library
P. Sánchez and A. Bellogíns, "Building user profiles based on sequences for content and collaborative filtering," Information Processing & Management, vol. 56, pp. 192--211, 2019.Google ScholarCross Ref
O. S. Shalom, N. Koenigstein, U. Paquet, and H. P. Vanchinathan, "Beyond Collaborative Filtering: The List Recommendation Problem," presented at the Proceedings of the 25th International Conference on World Wide Web, Montreal, Quebec, Canada, 2016.Google Scholar
Y. Wang, C. Breitinger, B. Sommer, F. Schreiber, and H. Reiterer, "Comparing Sequential and Temporal Patterns from Human Mobility Data for Next-Place Prediction," presented at the Adjunct Publication of the 26th Conference on User Modeling, Adaptation and Personalization, Singapore, Singapore, 2018.Google Scholar

Index Terms

Things Change: Comparing Results Using Historical Data and User Testing for Evaluating a Recommendation Task
1. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

Transparent, Scrutable and Explainable User Models for Personalized Recommendation
SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Most recommender systems base their recommendations on implicit or explicit item-level feedback provided by users. These item ratings are combined into a complex user model, which then predicts the suitability of other items. While effective, such ...
Read More
User Experience and The Role of Personalization in Critiquing-Based Conversational Recommendation
Critiquing — where users propose directional preferences to attribute values — has historically been a highly popular method for conversational recommendation. However, with the growing size of catalogs and item attributes, it becomes increasingly ...
Read More
Implicit Recommendation with Interest Change and User Influence
ICSCA '19: Proceedings of the 2019 8th International Conference on Software and Computer Applications

Aiming at the problem of rich websites in campus without targeted recommendation, which makes it difficult for users to find the information resources of high interest and high quality, this paper proposes an implicit feedback recommendation algorithm ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems
April 2020
4474 pages
ISBN:9781450368193
DOI:10.1145/3334480
General Chairs:
Regina Bernhaupt
Eindhoven University of Technology, Netherlands
,
Florian 'Floyd' Mueller
Monash University, Australia
,
David Verweij
Newcastle University, UK
,
Josh Andres
RMIT, Australia
,
Program Chairs:
Joanna McGrenere
University of British Columbia, Canada
,
Andy Cockburn
University of Canterbury, New Zealand
,
Ignacio Avellino
University of Maryland Baltimore County, USA
,
Alix Goguey
Grenoble Alpes University, France
,
Pernille Bjørn
University of Copenhagen, Denmark
,
Shengdong (Shen) Zhao
National University of Singapore, Singapore
,
Briane Paul Samson
Future University Hakodate, Japan & De La Salle University, Philippines
,
Rafal Kocielnik
University of Washington, USA
Copyright © 2020 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 April 2020
Check for updates
Author Tags
algorithmic trade-off
prediction
recommendations
user study
Qualifiers
- abstract
Conference

Acceptance Rates
Overall Acceptance Rate6,164of23,696submissions,26%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 121
  Total Downloads
- Downloads (Last 12 months)9
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Things Change: Comparing Results Using Historical Data and User Testing for Evaluating a Recommendation Task

CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Transparent, Scrutable and Explainable User Models for Personalized Recommendation

User Experience and The Role of Personalization in Critiquing-Based Conversational Recommendation

Implicit Recommendation with Interest Change and User Influence