tutorial

Uncertainty Quantification for Text Classification

Authors:
Dell Zhang

Thomson Reuters Labs, London, United Kingdom

Thomson Reuters Labs, London, United Kingdom

0000-0002-8774-3725
View Profile

,
Murat Sensoy

Amazon Alexa AI, London, United Kingdom

Amazon Alexa AI, London, United Kingdom

0000-0001-8806-4508
View Profile

,
Masoud Makrehchi

Thomson Reuters Labs, Toronto, ON, Canada

Thomson Reuters Labs, Toronto, ON, Canada

0000-0003-0287-9568
View Profile

,
Bilyana Taneva-Popova

Thomson Reuters Labs, Zug, Switzerland

Thomson Reuters Labs, Zug, Switzerland

0009-0001-3914-6707
View Profile

,
Lin Gui

King's College London, London, United Kingdom

King's College London, London, United Kingdom

0000-0002-8054-9524
View Profile

,
Yulan He

King's College London & The Alan Turing Institute, London, United Kingdom

King's College London & The Alan Turing Institute, London, United Kingdom

0000-0003-3948-5845
View Profile

SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information RetrievalJuly 2023Pages 3426–3429https://doi.org/10.1145/3539618.3594243

Published:18 July 2023Publication History

SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 3426–3429

ABSTRACT

This full-day tutorial introduces modern techniques for practical uncertainty quantification specifically in the context of multi-class and multi-label text classification. First, we explain the usefulness of estimating aleatoric uncertainty and epistemic uncertainty for text classification models. Then, we describe several state-of-the-art approaches to uncertainty quantification and analyze their scalability to big text data: Virtual Ensemble in GBDT, Bayesian Deep Learning (including Deep Ensemble, Monte-Carlo Dropout, Bayes by Backprop, and their generalization Epistemic Neural Networks), Evidential Deep Learning (including Prior Networks and Posterior Networks), as well as Distance Awareness (including Spectral-normalized Neural Gaussian Process and Deep Deterministic Uncertainty). Next, we talk about the latest advances in uncertainty quantification for pre-trained language models (including asking language models to express their uncertainty, interpreting uncertainties of text classifiers built on large-scale language models, uncertainty estimation in text generation, calibration of language models, and calibration for in-context learning). After that, we discuss typical application scenarios of uncertainty quantification in text classification (including in-domain calibration, cross-domain robustness, and novel class detection). Finally, we list popular performance metrics for the evaluation of uncertainty quantification effectiveness in text classification. Practical hands-on examples/exercises are provided to the attendees for them to experiment with different uncertainty quantification methods on a few real-world text classification datasets such as CLINC150.

References

Moloud Abdar, Farhad Pourpanah, Sadiq Hussain, Dana Rezazadegan, Li Liu, Mohammad Ghavamzadeh, Paul Fieguth, Xiaochun Cao, Abbas Khosravi, U. Rajendra Acharya, Vladimir Makarenkov, and Saeid Nahavandi. 2021. A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges. Information Fusion, Vol. 76 (Dec. 2021), 243--297. https://doi.org/10.1016/j.inffus.2021.05.008Google ScholarDigital Library
Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, and Daan Wierstra. 2015. Weight Uncertainty in Neural Network. In Proceedings of the 32nd International Conference on Machine Learning. PMLR, 1613--1622.Google Scholar
Bertrand Charpentier, Daniel Zügner, and Stephan Günnemann. 2020. Posterior Network: Uncertainty Estimation without OOD Samples via Density-Based Pseudo-Counts. In Advances in Neural Information Processing Systems, Vol. 33. Curran Associates, Inc., 1356--1367.Google Scholar
Yarin Gal and Zoubin Ghahramani. 2016. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. In Proceedings of The 33rd International Conference on Machine Learning. PMLR, 1050--1059.Google Scholar
Jakob Gawlikowski, Cedrique Rovile Njieutcheu Tassi, Mohsin Ali, Jongseok Lee, Matthias Humt, Jianxiang Feng, Anna Kruspe, Rudolph Triebel, Peter Jung, Ribana Roscher, Muhammad Shahzad, Wen Yang, Richard Bamler, and Xiao Xiang Zhu. 2022. A Survey of Uncertainty in Deep Neural Networks. https://doi.org/10.48550/arXiv.2107.03342 arxiv: 2107.03342 [cs, stat]Google ScholarCross Ref
Chuan Guo, Geoff Pleiss, Yu Sun, and Kilian Q. Weinberger. 2017. On Calibration of Modern Neural Networks. In Proceedings of the 34th International Conference on Machine Learning - Volume 70 (ICML '17). JMLR.org, Sydney, NSW, Australia, 1321--1330.Google Scholar
Zhen Guo, Zelin Wan, Qisheng Zhang, Xujiang Zhao, Feng Chen, Jin-Hee Cho, Qi Zhang, Lance M. Kaplan, Dong H. Jeong, and Audun Jøsang. 2022. A Survey on Uncertainty Reasoning and Quantification for Decision Making: Belief Theory Meets Deep Learning. https://doi.org/10.48550/arXiv.2206.05675 arxiv: 2206.05675 [cs, math]Google ScholarCross Ref
Reihaneh H. Hariri, Erik M. Fredericks, and Kate M. Bowers. 2019. Uncertainty in Big Data Analytics: Survey, Opportunities, and Challenges. Journal of Big Data, Vol. 6, 1 (June 2019), 44. https://doi.org/10.1186/s40537-019-0206-3Google ScholarCross Ref
Dan Hendrycks, Steven Basart, Mantas Mazeika, Andy Zou, Joseph Kwon, Mohammadreza Mostajabi, Jacob Steinhardt, and Dawn Song. 2022. Scaling Out-of-Distribution Detection for Real-World Settings. In Proceedings of the 39th International Conference on Machine Learning. PMLR, 8759--8773.Google Scholar
Dan Hendrycks and Kevin Gimpel. 2017. A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks. In International Conference on Learning Representations.Google Scholar
Dan Hendrycks, Mantas Mazeika, and Thomas G. Dietterich. 2019. Deep Anomaly Detection with Outlier Exposure. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net.Google Scholar
Neil Houlsby, Ferenc Huszár, Zoubin Ghahramani, and Máté Lengyel. 2011. Bayesian Active Learning for Classification and Preference Learning. https://doi.org/10.48550/arXiv.1112.5745 arxiv: 1112.5745 [cs, stat]Google ScholarCross Ref
Yibo Hu and Latifur Khan. 2021. Uncertainty-Aware Reliable Text Classification. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD '21). Association for Computing Machinery, New York, NY, USA, 628--636. https://doi.org/10.1145/3447548.3467382Google ScholarDigital Library
Eyke Hüllermeier and Willem Waegeman. 2021. Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods. Machine Learning, Vol. 110, 3 (March 2021), 457--506. https://doi.org/10.1007/s10994-021-05946-3Google ScholarCross Ref
Zhengbao Jiang, Jun Araki, Haibo Ding, and Graham Neubig. 2021. How Can We Know When Language Models Know? On the Calibration of Language Models for Question Answering. Transactions of the Association for Computational Linguistics, Vol. 9 (Sept. 2021), 962--977. https://doi.org/10.1162/tacl_a_00407Google ScholarCross Ref
H. M. Dipu Kabir, Abbas Khosravi, Mohammad Anwar Hosen, and Saeid Nahavandi. 2018. Neural Network-Based Uncertainty Quantification: A Survey of Methodologies and Applications. IEEE Access, Vol. 6 (2018), 36218--36234. https://doi.org/10.1109/ACCESS.2018.2836917Google ScholarCross Ref
Saurav Kadavath, Tom Conerly, Amanda Askell, Tom Henighan, Dawn Drain, Ethan Perez, Nicholas Schiefer, Zac Hatfield-Dodds, Nova DasSarma, Eli Tran-Johnson, Scott Johnston, Sheer El-Showk, Andy Jones, Nelson Elhage, Tristan Hume, Anna Chen, Yuntao Bai, Sam Bowman, Stanislav Fort, Deep Ganguli, Danny Hernandez, Josh Jacobson, Jackson Kernion, Shauna Kravec, Liane Lovitt, Kamal Ndousse, Catherine Olsson, Sam Ringer, Dario Amodei, Tom Brown, Jack Clark, Nicholas Joseph, Ben Mann, Sam McCandlish, Chris Olah, and Jared Kaplan. 2022. Language Models (Mostly ) Know What They Know. https://doi.org/10.48550/arXiv.2207.05221 arxiv: 2207.05221 [cs]Google ScholarCross Ref
Alex Kendall and Yarin Gal. 2017. What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision?. In Advances in Neural Information Processing Systems, Vol. 30. Curran Associates, Inc.Google Scholar
Lorenz Kuhn, Yarin Gal, and Sebastian Farquhar. 2023. Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation. In The Eleventh International Conference on Learning Representations.Google Scholar
Balaji Lakshminarayanan, Alexander Pritzel, and Charles Blundell. 2017. Simple and Scalable Predictive Uncertainty Estimation Using Deep Ensembles. In Advances in Neural Information Processing Systems, Vol. 30. Curran Associates, Inc.Google Scholar
Stefan Larson, Anish Mahendran, Joseph J. Peper, Christopher Clarke, Andrew Lee, Parker Hill, Jonathan K. Kummerfeld, Kevin Leach, Michael A. Laurenzano, Lingjia Tang, and Jason Mars. 2019. An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP ). Association for Computational Linguistics, Hong Kong, China, 1311--1316. https://doi.org/10.18653/v1/D19-1131Google ScholarCross Ref
Christian Leibig, Vaneeda Allken, Murat Sec ckin Ayhan, Philipp Berens, and Siegfried Wahl. 2017. Leveraging Uncertainty Information from Deep Neural Networks for Disease Detection. Scientific Reports, Vol. 7, 1 (Dec. 2017), 17816. https://doi.org/10.1038/s41598-017-17876-zGoogle ScholarCross Ref
Hongjing Li, Hanqi Yan, Yanran Li, Li Qian, Yulan He, and Lin Gui. 2023. Distinguishability Calibration to In-Context Learning. https://doi.org/10.48550/arXiv.2302.06198 arxiv: 2302.06198 [cs]Google ScholarCross Ref
Stephanie Lin, Jacob Hilton, and Owain Evans. 2022. Teaching Models to Express Their Uncertainty in Words. https://doi.org/10.48550/arXiv.2205.14334 arxiv: 2205.14334 [cs]Google ScholarCross Ref
Jeremiah Zhe Liu, Shreyas Padhy, Jie Ren, Zi Lin, Yeming Wen, Ghassen Jerfel, Zachary Nado, Jasper Snoek, Dustin Tran, and Balaji Lakshminarayanan. 2023. A Simple Approach to Improve Single-Model Deep Uncertainty via Distance-Awareness. Journal of Machine Learning Research, Vol. 24, 42 (2023), 1--63.Google Scholar
Andrey Malinin and Mark Gales. 2018. Predictive Uncertainty Estimation via Prior Networks. In Advances in Neural Information Processing Systems, Vol. 31. Curran Associates, Inc.Google Scholar
Andrey Malinin, Liudmila Prokhorenkova, and Aleksei Ustimenko. 2021. Uncertainty in Gradient Boosting via Ensembles. In International Conference on Learning Representations.Google Scholar
José Mena, Oriol Pujol, and Jordi Vitrià. 2021. A Survey on Uncertainty Estimation in Deep Learning Classification Systems from a Bayesian Perspective. Comput. Surveys, Vol. 54, 9 (Oct. 2021), 193:1--193:35. https://doi.org/10.1145/3477140Google ScholarDigital Library
Jishnu Mukhoti, Andreas Kirsch, Joost van Amersfoort, Philip H. S. Torr, and Yarin Gal. 2022. Deep Deterministic Uncertainty: A Simple Baseline. https://doi.org/10.48550/arXiv.2102.11582 arxiv: 2102.11582 [cs, stat]Google ScholarCross Ref
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, and Benjamin Van Roy. 2023. Epistemic Neural Networks. https://doi.org/10.48550/arXiv.2107.08924 arxiv: 2107.08924 [cs, stat]Google ScholarCross Ref
Apostolos F. Psaros, Xuhui Meng, Zongren Zou, Ling Guo, and George Em Karniadakis. 2022. Uncertainty Quantification in Scientific Machine Learning: Methods, Metrics, and Comparisons. https://doi.org/10.48550/arXiv.2201.07766 arxiv: 2201.07766 [cs]Google ScholarCross Ref
Murat Sensoy, Lance Kaplan, and Melih Kandemir. 2018. Evidential Deep Learning to Quantify Classification Uncertainty. In Advances in Neural Information Processing Systems, Vol. 31. Curran Associates, Inc.Google Scholar
Aditya Siddhant and Zachary C. Lipton. 2018. Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Brussels, Belgium, 2904--2909. https://doi.org/10.18653/v1/D18-1318Google ScholarCross Ref
Dennis Ulmer, Christian Hardmeier, and Jes Frellsen. 2023. Prior and Posterior Networks: A Survey on Evidential Deep Learning Methods for Uncertainty Estimation. https://doi.org/10.48550/arXiv.2110.03051 arxiv: 2110.03051 [cs, stat]Google ScholarCross Ref
Jordy Van Landeghem, Matthew Blaschko, Bertrand Anckaert, and Marie-Francine Moens. 2022. Benchmarking Scalable Predictive Uncertainty in Text Classification. IEEE Access, Vol. 10 (2022), 43703--43737. https://doi.org/10.1109/ACCESS.2022.3168734Google ScholarCross Ref
Haoran Wang, Weitang Liu, Alex Bocchieri, and Yixuan Li. 2021. Can Multi-Label Classification Networks Know What They Don't Know?. In Advances in Neural Information Processing Systems, Vol. 34. Curran Associates, Inc., 29074--29087.Google Scholar
David Widmann, Fredrik Lindsten, and Dave Zachariah. 2019. Calibration Tests in Multi-class Classification: A Unifying Framework. In Advances in Neural Information Processing Systems, Vol. 32. Curran Associates, Inc.Google Scholar
David Widmann, Fredrik Lindsten, and Dave Zachariah. 2021. Calibration Tests Beyond Classification. In International Conference on Learning Representations.Google Scholar
Dell Zhang, Murat Sensoy, Masoud Makrehchi, and Bilyana Taneva-Popova. 2023. Uncertainty Quantification for Text Classification. In Advances in Information Retrieval - 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, April 2-6, 2023, Proceedings, Part III (Lecture Notes in Computer Science ), Jaap Kamps, Lorraine Goeuriot, Fabio Crestani, Maria Maistro, Hideo Joho, Brian Davis, Cathal Gurrin, Udo Kruschwitz, and Annalina Caputo (Eds.). Springer Nature Switzerland, Cham, 362--369. https://doi.org/10.1007/978-3-031-28241-6_38Google ScholarDigital Library

Index Terms

Uncertainty Quantification for Text Classification
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
  2. Modeling and simulation
    1. Model development and analysis
      1. Uncertainty quantification
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
      1. Language models
    2. Retrieval tasks and goals
      1. Clustering and classification

Recommendations

Uncertainty Quantification for Text Classification
Advances in Information Retrieval
Abstract
This half-day tutorial introduces modern techniques for practical uncertainty quantification specifically in the context of multi-class and multi-label text classification. First, we explain the usefulness of estimating aleatoric uncertainty and ...
Read More
Numerical approach for quantification of epistemic uncertainty

In the field of uncertainty quantification, uncertainty in the governing equations may assume two forms: aleatory uncertainty and epistemic uncertainty. Aleatory uncertainty can be characterised by known probability distributions whilst epistemic ...
Read More
Uncertainty quantification methods for evolutionary optimization under uncertainty
GECCO '20: Proceedings of the 2020 Genetic and Evolutionary Computation Conference Companion

In this paper, we discuss the role of uncertainty quantification (UQ) in assisting optimization under uncertainty. UQ plays a significant role in quantifying the robustness of solutions so as to help the optimizer in achieving robust optimum solutions. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2023
3567 pages
ISBN:9781450394086
DOI:10.1145/3539618
General Chairs:
Hsin-Hsi Chen
National Taiwan University
,
Wei-Jou (Edward) Duh
National Taiwan University
,
Hen-Hsen Huang
Academia Sinica
,
Program Chairs:
Makoto P. Kato
Spotify
,
Josiane Mothe
Universite de Toulouse
,
Barbara Poblete
University of Chile and Amazon Visiting Academic
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 July 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
language models
text classification
uncertainty quantification
Qualifiers
- tutorial
Conference

Acceptance Rates
Overall Acceptance Rate792of3,983submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 330
  Total Downloads
- Downloads (Last 12 months)330
- Downloads (Last 6 weeks)31
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Uncertainty Quantification for Text Classification

SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Uncertainty Quantification for Text Classification

Numerical approach for quantification of epistemic uncertainty

Uncertainty quantification methods for evolutionary optimization under uncertainty