research-article

Calibration in Collaborative Filtering Recommender Systems: a User-Centered Analysis

Authors:
Kun Lin

DePaul University, Chicago, IL, USA

DePaul University, Chicago, IL, USA
View Profile

,
Nasim Sonboli

University of Colorado Boulder, Boulder, CO, USA

University of Colorado Boulder, Boulder, CO, USA
View Profile

,
Bamshad Mobasher

DePaul University, Chicago, IL, USA

DePaul University, Chicago, IL, USA
View Profile

,
Robin Burke

University of Colorado Boulder, Boulder, CO, USA

University of Colorado Boulder, Boulder, CO, USA
View Profile

HT '20: Proceedings of the 31st ACM Conference on Hypertext and Social MediaJuly 2020Pages 197–206https://doi.org/10.1145/3372923.3404793

Published:13 July 2020Publication History

HT '20: Proceedings of the 31st ACM Conference on Hypertext and Social Media

Pages 197–206

ABSTRACT

Recommender systems learn from past user preferences in order to predict future user interests and provide users with personalized suggestions. Previous research has demonstrated that biases in user profiles in the aggregate can influence the recommendations to users who do not share the majority preference. One consequence of this bias propagation effect is miscalibration, a mismatch between the types or categories of items that a user prefers and the items provided in recommendations. In this paper, we conduct a systematic analysis aimed at identifying key characteristics in user profiles that might lead to miscalibrated recommendations. We consider several categories of profile characteristics, including similarity to the average user, propensity towards popularity, profile diversity, and preference intensity. We develop predictive models of miscalibration and use these models to identify the most important features correlated with miscalibration, given different algorithms and dataset characteristics. Our analysis is intended to help system designers predict miscalibration effects and to develop recommendation algorithms with improved calibration properties.

References

Himan Abdollahpouri, Robin Burke, and Bamshad Mobasher. 2017. Controlling popularity bias in learning-to-rank recommendation. In Proceedings of the Eleventh ACM Conference on Recommender Systems. ACM, 42--46.Google ScholarDigital Library
Himan Abdollahpouri, Masoud Mansoury, Robin Burke, and Bamshad Mobasher. 2019. The impact of popularity bias on fairness and calibration in recommendation. arXiv preprint arXiv:1910.05755 (2019).Google Scholar
Alex Beutel, Jilin Chen, Tulsee Doshi, Hai Qian, Li Wei, Yi Wu, Lukasz Heldt, Zhe Zhao, Lichan Hong, Ed H Chi, et almbox. 2019. Fairness in recommendation ranking through pairwise comparisons. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining . 2212--2220.Google ScholarDigital Library
Erik Brynjolfsson, Yu Jeffrey Hu, and Michael D Smith. 2006. From niches to riches: Anatomy of the long tail. Sloan Management Review, Vol. 47, 4 (2006), 67--71.Google Scholar
Robin Douglas Burke, Masoud Mansoury, and Nasim Sonboli. 2020. Experimentation with Fairness-Aware Recommendation Using Librec-Auto: Hands-on Tutorial. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (Barcelona, Spain) (FAT* '20). Association for Computing Machinery, New York, NY, USA, 700. https://doi.org/10.1145/3351095.3375670Google ScholarDigital Library
Toon Calders and Sicco Verwer. 2010. Three naive Bayes approaches for discrimination-free classification. Data Mining and Knowledge Discovery, Vol. 21, 2 (2010), 277--292.Google ScholarDigital Library
Òscar Celma and Pedro Cano. 2008. From hits to niches?: or how popular artists can bias music recommendation and discovery. In Proceedings of the 2nd KDD Workshop on Large-Scale Recommender Systems and the Netflix Prize Competition. ACM, 5.Google ScholarDigital Library
Cynthia S Crowson, Elizabeth J Atkinson, and Terry M Therneau. 2016. Assessing calibration of prognostic risk scores. Statistical methods in medical research, Vol. 25, 4 (2016), 1692--1706.Google Scholar
Michael D Ekstrand, Mucun Tian, Ion Madrazo Azpiazu, Jennifer D Ekstrand, Oghenemaro Anuyah, David McNeill, and Maria Soledad Pera. 2018a. All the cool kids, how do they fit in?: Popularity and demographic biases in recommender evaluation and effectiveness. In Conference on Fairness, Accountability and Transparency. 172--186.Google Scholar
Michael D Ekstrand, Mucun Tian, Mohammed R Imran Kazi, Hoda Mehrpouyan, and Daniel Kluver. 2018b. Exploring author gender in book rating and recommendation. In Proceedings of the 12th ACM Conference on Recommender Systems. ACM, 242--250.Google ScholarDigital Library
Farzad Eskandanian, Bamshad Mobasher, and Robin Burke. 2017. A clustering approach for personalizing diversity in collaborative recommender systems. In Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization. ACM, 280--284.Google ScholarDigital Library
Farzad Eskandanian, Bamshad Mobasher, and Robin D Burke. 2016. User Segmentation for Controlling Recommendation Diversity.. In RecSys Posters.Google Scholar
Farzad Eskandanian, Nasim Sonboli, and Bamshad Mobasher. 2019. Power of the Few: Analyzing the Impact of Influential Users in Collaborative Recommender Systems. In Proceedings of the 27th ACM Conference on User Modeling, Adaptation and Personalization. 225--233.Google ScholarDigital Library
Jerome Friedman, Trevor Hastie, and Robert Tibshirani. 2001. The elements of statistical learning. Vol. 1. Springer series in statistics New York.Google Scholar
Guibing Guo, Jie Zhang, Zhu Sun, and Neil Yorke-Smith. 2015. LibRec: A Java Library for Recommender Systems.. In UMAP Workshops, Vol. 4.Google Scholar
Moritz Hardt, Eric Price, Nati Srebro, et almbox. 2016. Equality of opportunity in supervised learning. In Advances in neural information processing systems. 3315--3323.Google ScholarDigital Library
Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative Filtering for Implicit Feedback Datasets.. In ICDM, Vol. 8. Citeseer, 263--272.Google ScholarDigital Library
Dietmar Jannach, Lukas Lerche, Fatih Gedikli, and Geoffray Bonnin. 2013. What recommenders recommend--an analysis of accuracy, popularity, and sales diversity effects. In International conference on user modeling, adaptation, and personalization. Springer, 25--37.Google ScholarCross Ref
Dietmar Jannach, Lukas Lerche, Iman Kamehkhosh, and Michael Jugovac. 2015. What recommenders recommend: an analysis of recommendation biases and possible countermeasures. User Modeling and User-Adapted Interaction, Vol. 25, 5 (2015), 427--491.Google ScholarDigital Library
Toshihiro Kamishima, Shotaro Akaho, Hideki Asoh, and Jun Sakuma. 2014. Correcting Popularity Bias by Enhancing Recommendation Neutrality.. In RecSys Posters.Google Scholar
Caitlin Kuhlman, MaryAnn VanValkenburg, and Elke Rundensteiner. 2019. FARE: Diagnostics for Fair Ranking using Pairwise Error Metrics. In The World Wide Web Conference. 2936--2942.Google ScholarDigital Library
Solomon Kullback. 1997. Information theory and statistics. Courier Corporation.Google ScholarDigital Library
Matevvz Kunaver and Tomavz Povz rl. 2017. Diversity in recommender systems--A survey. Knowledge-Based Systems, Vol. 123 (2017), 154--162.Google ScholarDigital Library
Kun Lin, Nasim Sonboli, Bamshad Mobasher, and Robin Burke. 2019. Crank up the volume: preference bias amplification in collaborative recommendation. arXiv preprint arXiv:1909.06362 (2019).Google Scholar
Masoud Mansoury, Himan Abdollahpouri, Jessie Smith, Arman Dehpanah, Mykola Pechenizkiy, and Bamshad Mobasher. 2020. Investigating Potential Factors Associated with Gender Discrimination in Collaborative Recommender Systems. arXiv preprint arXiv:2002.07786 (2020).Google Scholar
Masoud Mansoury, Bamshad Mobasher, Robin Burke, and Mykola Pechenizkiy. 2019. Bias disparity in collaborative recommendation: Algorithmic evaluation and comparison. arXiv preprint arXiv:1908.00831 (2019).Google Scholar
Tien T Nguyen, Pik-Mai Hui, F Maxwell Harper, Loren Terveen, and Joseph A Konstan. 2014. Exploring the filter bubble: the effect of using recommender systems on content diversity. In Proceedings of the 23rd international conference on World wide web. 677--686.Google ScholarDigital Library
Eli Pariser. 2011. The filter bubble: How the new personalized web is changing what we read and how we think. Penguin.Google ScholarDigital Library
Geoff Pleiss, Manish Raghavan, Felix Wu, Jon Kleinberg, and Kilian Q Weinberger. 2017. On fairness and calibration. In Advances in Neural Information Processing Systems. 5680--5689.Google Scholar
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence. AUAI Press, 452--461.Google ScholarDigital Library
Claude E Shannon. 1951. Prediction and entropy of printed English. Bell system technical journal, Vol. 30, 1 (1951), 50--64.Google Scholar
Harald Steck. 2018. Calibrated recommendations. In Proceedings of the 12th ACM conference on recommender systems. ACM, 154--162.Google ScholarDigital Library
Virginia Tsintzou, Evaggelia Pitoura, and Panayiotis Tsaparas. 2018. Bias Disparity in Recommendation Systems. arXiv preprint arXiv:1811.01461 (2018).Google Scholar
Sirui Yao and Bert Huang. 2017. Beyond parity: Fairness objectives for collaborative filtering. In Advances in Neural Information Processing Systems. 2921--2930.Google Scholar
Hongzhi Yin, Bin Cui, Jing Li, Junjie Yao, and Chen Chen. 2012. Challenging the long tail recommendation. arXiv preprint arXiv:1205.6700 (2012).Google Scholar
Cai-Nicolas Ziegler, Sean M McNee, Joseph A Konstan, and Georg Lausen. 2005. Improving recommendation lists through topic diversification. In Proceedings of the 14th international conference on World Wide Web. 22--32.Google ScholarDigital Library

Index Terms

Calibration in Collaborative Filtering Recommender Systems: a User-Centered Analysis
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Recommender systems
  2. World Wide Web
    1. Web searching and information discovery
      1. Collaborative filtering
      2. Personalization
2. Social and professional topics
  1. User characteristics

Recommendations

The Connection Between Popularity Bias, Calibration, and Fairness in Recommendation
RecSys '20: Proceedings of the 14th ACM Conference on Recommender Systems

Recently there has been a growing interest in fairness-aware recommender systems including fairness in providing consistent performance across different users or groups of users. A recommender system could be considered unfair if the recommendations do ...
Read More
User-centered Evaluation of Popularity Bias in Recommender Systems
UMAP '21: Proceedings of the 29th ACM Conference on User Modeling, Adaptation and Personalization

Recommendation and ranking systems are known to suffer from popularity bias; the tendency of the algorithm to favor a few popular items while under-representing the majority of other items. Prior research has examined various approaches for mitigating ...
Read More
Bias disparity in graph-based collaborative filtering recommenders
SAC '22: Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing

In collaborative filtering (CF) recommendations, user preferences may be reinforced resulting in the under-representation of specific categories of items in the recommendation lists received by users belonging to specific groups. We call this phenomenon ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
HT '20: Proceedings of the 31st ACM Conference on Hypertext and Social Media
July 2020
327 pages
ISBN:9781450370981
DOI:10.1145/3372923
General Chairs:
Anastasia Salter
University of Central Florida, USA
,
John F. Barber
Washington State University Vancouver, USA
,
Program Chair:
Ujwal Gadiraju
Delft University of Technology, The Netherlands
Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 July 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
algorithmic bias
bias amplification
calibration
fairness
recommendation algorithms
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate378of1,158submissions,33%
Upcoming Conference
HT '24

Sponsor:

sigweb

35th ACM Conference on Hypertext and Social Media

September 10 - 13, 2024

Poznan , Poland
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 11
  Total Citations
  View Citations
- 351
  Total Downloads
- Downloads (Last 12 months)41
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Calibration in Collaborative Filtering Recommender Systems: a User-Centered Analysis

HT '20: Proceedings of the 31st ACM Conference on Hypertext and Social Media

ABSTRACT

References

Cited By

Index Terms

Recommendations

The Connection Between Popularity Bias, Calibration, and Fairness in Recommendation

User-centered Evaluation of Popularity Bias in Recommender Systems

Bias disparity in graph-based collaborative filtering recommenders