abstract

Incorporating Fairness in Large Scale NLU Systems

Authors:
Rahul Gupta

Amazon Alexa, Cambridge, MA, USA

Amazon Alexa, Cambridge, MA, USA

0000-0002-9277-3718
View Profile

,
Lisa Bauer

Amazon Alexa, New York, NY, USA

Amazon Alexa, New York, NY, USA

0000-0002-4725-1232
View Profile

,
Kai-Wei Chang

Amazon Alexa, Cambridge, MA, USA

Amazon Alexa, Cambridge, MA, USA

0000-0001-5365-0072
View Profile

,
Jwala Dhamala

Amazon Alexa, Cambridge, MA, USA

Amazon Alexa, Cambridge, MA, USA

0000-0002-5396-9187
View Profile

,
Aram Galstyan

Amazon Alexa, Los Angeles, CA, USA

Amazon Alexa, Los Angeles, CA, USA

0000-0003-4215-0886
View Profile

,
Palash Goyal

Amazon Alexa, Sunnyvale, CA, USA

Amazon Alexa, Sunnyvale, CA, USA

0000-0003-2455-2160
View Profile

,
Qian Hu

Amazon Alexa, Cambridge, MA, USA

Amazon Alexa, Cambridge, MA, USA

0000-0001-6645-2200
View Profile

,
Avni Khatri

Amazon Alexa, Cambridge, MA, USA

Amazon Alexa, Cambridge, MA, USA

0000-0001-8668-4068
View Profile

,
Rohit Parimi

Amazon Alexa, New York, NY, USA

Amazon Alexa, New York, NY, USA

0000-0002-3996-8743
View Profile

,
Charith Peris

Amazon Alexa, Cambridge, MA, USA

Amazon Alexa, Cambridge, MA, USA

0000-0003-3648-8389
View Profile

,
Apurv Verma

Amazon Alexa, Cambridge, MA, USA

Amazon Alexa, Cambridge, MA, USA

0000-0002-5185-0860
View Profile

,
Richard Zemel

Amazon Alexa, New York, NY, USA

Amazon Alexa, New York, NY, USA

0000-0001-9353-7509
View Profile

,
Prem Natarajan

Amazon Alexa, Los Angeles, CA, USA

Amazon Alexa, Los Angeles, CA, USA

0000-0002-4386-6651
View Profile

WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data MiningFebruary 2023Pages 1289–1290https://doi.org/10.1145/3539597.3575785

Published:27 February 2023Publication History

WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining

Pages 1289–1290

ABSTRACT

NLU models power several user facing experiences such as conversations agents and chat bots. Building NLU models typically consist of 3 stages: a) building or finetuning a pre-trained model b) distilling or fine-tuning the pre-trained model to build task specific models and, c) deploying the task-specific model to production. In this presentation, we will identify fairness considerations that can be incorporated in the aforementioned three stages in the life-cycle of NLU model building: (i) selection/building of a large scale language model, (ii) distillation/fine-tuning the large model into task specific model and, (iii) deployment of the task specific model. We will present select metrics that can be used to quantify fairness in NLU models and fairness enhancement techniques that can be deployed in each of these stages. Finally, we will share some recommendations to successfully implement fairness considerations when building an industrial scale NLU system.

Supplemental Material

wsdm2023_special_industry_day_gupta_scale_01.mp4-streaming.mp4

mp4

280.6 MB

Download

References

Ioana Baldini, Dennis Wei, Karthikeyan Natesan Ramamurthy, Moninder Singh, and Mikhail Yurochkin. 2022. Your fairness may vary: Pretrained language model fairness in toxic text classification. In Findings of the Association for Computational Linguistics: ACL 2022. 2245--2262.Google ScholarCross Ref
Tolga Bolukbasi, Kai-Wei Chang, James Y Zou, Venkatesh Saligrama, and Adam T Kalai. 2016. Man is to computer programmer as woman is to homemaker? debiasing word embeddings. Advances in neural information processing systems 29 (2016).Google Scholar
Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. 2020. Language models are few-shot learners. Advances in neural information processing systems 33 (2020), 1877--1901.Google Scholar
Yang Cao, Yada Pruksachatkun, Kai-Wei Chang, Rahul Gupta, Varun Kumar, Jwala Dhamala, and Aram Galstyan. 2022. On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 561--570.Google ScholarCross Ref
Jwala Dhamala, Tony Sun, Varun Kumar, Satyapriya Krishna, Yada Pruksachatkun, Kai-Wei Chang, and Rahul Gupta. 2021. Bold: Dataset and metrics for measuring biases in open-ended language generation. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. 862--872.Google ScholarDigital Library
Zahra Fatemi, Chen Xing, Wenhao Liu, and Caiming Xiong. 2021. Improving gender fairness of pre-trained language models without catastrophic forgetting. arXiv preprint arXiv:2110.05367 (2021).Google Scholar
Wei Guo and Aylin Caliskan. 2021. Detecting emergent intersectional biases: Contextualized word embeddings contain a distribution of human-like biases. In Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society. 122--133.Google ScholarDigital Library
Umang Gupta, Jwala Dhamala, Varun Kumar, Apurv Verma, Yada Pruksachatkun, Satyapriya Krishna, Rahul Gupta, Kai-Wei Chang, Greg Ver Steeg, and Aram Galstyan. 2022. Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal. arXiv preprint arXiv:2203.12574 (2022).Google Scholar
Xisen Jin, Francesco Barbieri, Brendan Kennedy, Aida Mostafazadeh Davani, Leonardo Neves, and Xiang Ren. 2021. On Transferability of Bias Mitigation Effects in Language Model Fine-Tuning. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 3770--3783.Google ScholarCross Ref
Satyapriya Krishna, Rahul Gupta, Apurv Verma, Jwala Dhamala, Yada Pruksachatkun, and Kai-Wei Chang. 2022. Measuring Fairness of Text Classifiers via Prediction Sensitivity. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 5830--5842.Google ScholarCross Ref
Paul Pu Liang, Chiyu Wu, Louis-Philippe Morency, and Ruslan Salakhutdinov. 2021. Towards understanding and mitigating social biases in language models. In International Conference on Machine Learning. PMLR, 6565--6576.Google Scholar
Moin Nadeem, Anna Bethke, and Siva Reddy. 2020. Stereoset: Measuring stereotypical bias in pretrained language models. arXiv preprint arXiv:2004.09456 (2020).Google Scholar
Andy Rosenbaum, Saleh Soltan,Wael Hamza, Yannick Versley, and Markus Boese. 2022. LINGUIST: Language model instruction tuning to generate annotated utterances for intent classification and slot tagging. arXiv preprint arXiv:2209.09900 (2022).Google Scholar
Saleh Soltan, Shankar Ananthakrishnan, Jack FitzGerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, et al. 2022. Alexatm 20b: Few-shot learning using a large-scale multilingual seq2seq model. arXiv preprint arXiv:2208.01448 (2022).Google Scholar

Index Terms

Incorporating Fairness in Large Scale NLU Systems
1. Computing methodologies
  1. Machine learning

Recommendations

Syntax-based word ordering incorporating a large-scale language model
EACL '12: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics

A fundamental problem in text generation is word ordering. Word ordering is a computationally difficult problem, which can be constrained to some extent for particular applications, for example by using synchronous grammars for statistical machine ...
Read More
Airtime Fairness for IEEE 802.11 Multirate Networks

Under a multi rate network scenario, the IEEE 802.11 DCF MAC fails to provide air-time fairness for all competing stations since the protocol is designed for ensuring max-min throughput fairness and the maximum achievable throughput by any station gets ...
Read More
Large-scale syntactic language modeling with treelets
ACL '12: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1

We propose a simple generative, syntactic language model that conditions on overlapping windows of tree context (or treelets) in the same way that n-gram language models condition on overlapping windows of linear context. We estimate the parameters of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining
February 2023
1345 pages
ISBN:9781450394079
DOI:10.1145/3539597
General Chairs:
Tat-Seng Chua
National University of Singapore
,
Hady Lauw
Singapore Management University
,
Program Chairs:
Luo Si
Salesforce
,
Evimaria Terzi
Boston University
,
Panayiotis Tsaparas
University of Ioannina
Copyright © 2023 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 February 2023
Check for updates
Author Tags
fairness enhancement
fairness measurement
large language models
Qualifiers
- abstract
Conference

Acceptance Rates
Overall Acceptance Rate498of2,863submissions,17%
Upcoming Conference
WSDM '25

Sponsor:

sigir

sigir

sigir

sigir

The Eighteenth ACM International Conference on Web Search and Data Mining

April 7 - 11, 2025

Hannover , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 207
  Total Downloads
- Downloads (Last 12 months)138
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Incorporating Fairness in Large Scale NLU Systems

WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Syntax-based word ordering incorporating a large-scale language model

Airtime Fairness for IEEE 802.11 Multirate Networks

Large-scale syntactic language modeling with treelets

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Incorporating Fairness in Large Scale NLU Systems

WSDM '23: Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Syntax-based word ordering incorporating a large-scale language model

Airtime Fairness for IEEE 802.11 Multirate Networks

Large-scale syntactic language modeling with treelets

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media