research-article

Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue

Authors:
Longxuan Ma

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

0000-0003-1431-8485
View Profile

,
Jiapeng Li

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

0009-0004-1521-7787
View Profile

,
Mingda Li

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

0009-0003-3693-8412
View Profile

,
Wei-Nan Zhang

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

0000-0001-5981-4752
View Profile

,
Ting Liu

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

Research Center for Social Computing and Information Retrieval, Harbin Institute of Technology, China

0009-0007-1999-2600
View Profile

Authors Info & Claims

ACM Transactions on Information Systems Volume 42 Issue 2Article No.: 49pp 1–29https://doi.org/10.1145/3617829

Published:08 November 2023Publication History

ACM Transactions on Information Systems

Abstract

Document-grounded dialogue (DGD) uses documents as external knowledge for dialogue generation. Correctly understanding the dialogue context is crucial for selecting knowledge from the document and generating proper responses. In this article, we propose using a dialogue policy to help the dialogue understanding in DGD. Our dialogue policy consists of two kinds of guiding signals: utterance function and topic transfer intent. The utterance function reflects the purpose and style of an utterance, and the topic transfer intent reflects the topic and content of an utterance. We propose a novel framework exploiting our dialogue policy for two core tasks in DGD, namely, knowledge selection (KS) and response generation (RG). The framework consists of two modules: the policy planner leverages policy-aware dialogue representation to select knowledge and predict the policy of the response; the generator uses policy/knowledge-aware dialogue representation for response generation. Our policy-driven model gets state-of-the-art performance on three public benchmarks, and we provide a detailed analysis of the experimental results. Our code/data will be released on GitHub.

REFERENCES

[1] Anderson Anne H., Bader Miles, Bard Ellen Gurman, Boyle Elizabeth, Doherty Gwyneth, Garrod Simon, Isard Stephen, Kowtko Jacqueline, McAllister Jan, Miller Jim, Sotillo Catherine, Thompson Henry, and Weinert Regina. 1991. The HCRC map task corpus. Lang. Speech 34, 4 (1991), 351–366.Google ScholarCross Ref
[2] Baheti Ashutosh, Ritter Alan, Li Jiwei, and Dolan Bill. 2018. Generating more interesting responses in neural conversation models with distributional constraints. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Riloff Ellen, Chiang David, Hockenmaier Julia, and Tsujii Jun’ichi (Eds.). Association for Computational Linguistics, 3970–3980. DOI:Google ScholarCross Ref
[3] Bunt Harry, Alexandersson Jan, Carletta Jean, Choe Jae-Woong, Fang Alex Chengyu, Hasida Kôiti, Lee Kiyong, Petukhova Volha, Popescu-Belis Andrei, Romary Laurent, Soria Claudia, and Traum David R.. 2010. Towards an ISO standard for dialogue act annotation. In Proceedings of the International Conference on Language Resources and Evaluation, Calzolari Nicoletta, Choukri Khalid, Maegaard Bente, Mariani Joseph, Odijk Jan, Piperidis Stelios, Rosner Mike, and Tapias Daniel (Eds.). European Language Resources Association. Retrieved from http://www.lrec-conf.org/proceedings/lrec2010/summaries/560.htmlGoogle Scholar
[4] Bunt Harry, Petukhova Volha, Gilmartin Emer, Pelachaud Catherine, Fang Alex Chengyu, Keizer Simon, and Prévot Laurent. 2020. The ISO standard for dialogue act annotation, second edition. In Proceedings of the 12th Language Resources and Evaluation Conference, Calzolari Nicoletta, Béchet Frédéric, Blache Philippe, Choukri Khalid, Cieri Christopher, Declerck Thierry, Goggi Sara, Isahara Hitoshi, Maegaard Bente, Mariani Joseph, Mazo Hélène, Moreno Asunción, Odijk Jan, and Piperidis Stelios (Eds.). European Language Resources Association, 549–558. Retrieved from https://aclanthology.org/2020.lrec-1.69/Google Scholar
[5] Carletta Jean, Ashby Simone, Bourban Sebastien, Flynn Mike, Guillemot Mael, Hain Thomas, Kadlec Jaroslav, Karaiskos Vasilis, Kraaij Wessel, Kronenthal Melissa, Lincoln Guillaume Lathoud Mike, Lisowska Agnes, McCowan Iain, Post Wilfried, Reidsma Dennis, and Wellner Pierre. 2005. The AMI meeting corpus: A pre-announcement. In Proceedings of the International Workshop on Machine Learning for Multimodal Interaction. Springer, 28–39.Google Scholar
[6] Chen Wenhu, Chen Jianshu, Qin Pengda, Yan Xifeng, and Wang William Yang. 2019. Semantically conditioned dialog response generation via hierarchical disentangled self-attention. In Proceedings of the 57th Conference of the Association for Computational Linguistics, Korhonen Anna, Traum David R., and Màrquez Lluís (Eds.). Association for Computational Linguistics, 3696–3709. DOI:Google ScholarCross Ref
[7] Devlin Jacob, Chang Ming-Wei, Lee Kenton, and Toutanova Kristina. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Burstein Jill, Doran Christy, and Solorio Thamar (Eds.). Association for Computational Linguistics, 4171–4186.Google Scholar
[8] Dinan Emily, Roller Stephen, Shuster Kurt, Fan Angela, Auli Michael, and Weston Jason. 2019. Wizard of Wikipedia: Knowledge-powered conversational agents. In Proceedings of the 7th International Conference on Learning Representations. OpenReview.net. Retrieved from https://openreview.net/forum?id=r1l73iRqKmGoogle Scholar
[9] Fang Hao, Cheng Hao, Sap Maarten, Clark Elizabeth, Holtzman Ari, Choi Yejin, Smith Noah A., and Ostendorf Mari. 2018. Sounding Board: A user-centric and content-driven social chatbot. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics, Liu Yang, Paek Tim, and Patwardhan Manasi S. (Eds.). Association for Computational Linguistics, 96–100. DOI:Google ScholarCross Ref
[10] Feng Song, Wan Hui, Gunasekara R. Chulaka, Patel Siva Sankalp, Joshi Sachindra, and Lastras Luis A.. 2020. doc2dial: A goal-oriented document-grounded dialogue dataset. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Webber Bonnie, Cohn Trevor, He Yulan, and Liu Yang (Eds.). Association for Computational Linguistics, 8118–8128. DOI:Google ScholarCross Ref
[11] Fleiss Joseph L.. 1971. Measuring nominal scale agreement among many raters. Psychol. Bull. 76, 5 (1971), 378–382.Google ScholarCross Ref
[12] Ghazarian Sarik, Liu Zixi, Chakrabarty Tuhin, Ma Xuezhe, Galstyan Aram, and Peng Nanyun. 2021. DiSCoL: Toward engaging dialogue systems through conversational line guided response generation. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, Sil Avi and Lin Xi Victoria (Eds.). Association for Computational Linguistics, 26–34. DOI:Google ScholarCross Ref
[13] Ghazvininejad Marjan, Brockett Chris, Chang Ming-Wei, Dolan Bill, Gao Jianfeng, Yih Wen-tau, and Galley Michel. 2018. A knowledge-grounded neural conversation model. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI’18), the 30th Innovative Applications of Artificial Intelligence (IAAI’18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’18), McIlraith Sheila A. and Weinberger Kilian Q. (Eds.). AAAI Press, 5110–5117. Retrieved from https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16710Google ScholarCross Ref
[14] Godfrey John J., Holliman Edward, and McDaniel Jane. 1992. SWITCHBOARD: Telephone speech corpus for research and development. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE Computer Society, 517–520. DOI:Google ScholarCross Ref
[15] Gopalakrishnan Karthik, Hedayatnia Behnam, Chen Qinglang, Gottardi Anna, Kwatra Sanjeev, Venkatesh Anu, Gabriel Raefer, and Hakkani-Tür Dilek. 2019. Topical-Chat: Towards knowledge-grounded open-domain conversations. In Proceedings of the Interspeech Conference, Kubin Gernot and Kacic Zdravko (Eds.). ISCA, 1891–1895. DOI:Google ScholarCross Ref
[16] Hazarika Devamanyu, Namazifar Mahdi, and Hakkani-Tür Dilek. 2022. Attention biasing and context augmentation for zero-shot control of encoder-decoder transformers for natural language generation. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI’22), 34th Conference on Innovative Applications of Artificial Intelligence (IAAI 2022) 12th Symposium on Educational Advances in Artificial Intelligence (EAAI’22). AAAI Press, 10738–10748. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/21319Google ScholarCross Ref
[17] He Wanwei, Dai Yinpei, Hui Binyuan, Yang Min, Cao Zheng, Dong Jianbo, Huang Fei, Si Luo, and Li Yongbin. 2022. SPACE-2: Tree-structured semi-supervised contrastive pre-training for task-oriented dialog understanding. In Proceedings of the 29th International Conference on Computational Linguistics, Calzolari Nicoletta, Huang Chu-Ren, Kim Hansaem, Pustejovsky James, Wanner Leo, Choi Key-Sun, Ryu Pum-Mo, Chen Hsin-Hsi, Donatelli Lucia, Ji Heng, Kurohashi Sadao, Paggio Patrizia, Xue Nianwen, Kim Seokhwan, Hahm Younggyun, He Zhong, Lee Tony Kyungil, Santus Enrico, Bond Francis, and Na Seung-Hoon (Eds.). International Committee on Computational Linguistics, 553–569. Retrieved from https://aclanthology.org/2022.coling-1.46Google Scholar
[18] He Wanwei, Dai Yinpei, Yang Min, Sun Jian, Huang Fei, Si Luo, and Li Yongbin. 2022. Unified dialog model pre-training for task-oriented dialog understanding and generation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Amigó Enrique, Castells Pablo, Gonzalo Julio, Carterette Ben, Culpepper J. Shane, and Kazai Gabriella (Eds.). ACM, 187–200. DOI:Google ScholarDigital Library
[19] He Wanwei, Dai Yinpei, Zheng Yinhe, Wu Yuchuan, Cao Zheng, Liu Dermot, Jiang Peng, Yang Min, Huang Fei, Si Luo, Sun Jian, and Li Yongbin. 2022. GALAXY: A generative pre-trained model for task-oriented dialog with semi-supervised learning and explicit policy injection. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI’22), 34th Conference on Innovative Applications of Artificial Intelligence (IAAI’22), 12th Symposium on Educational Advances in Artificial Intelligence (EAAI’22). AAAI Press, 10749–10757. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/21320Google ScholarCross Ref
[20] Hedayatnia Behnam, Gopalakrishnan Karthik, Kim Seokhwan, Liu Yang, Eric Mihail, and Hakkani-Tür Dilek. 2020. Policy-driven neural response generation for knowledge-grounded dialog systems. In Proceedings of the 13th International Conference on Natural Language Generation, Davis Brian, Graham Yvette, Kelleher John D., and Sripada Yaji (Eds.). Association for Computational Linguistics, 412–421. Retrieved from https://aclanthology.org/2020.inlg-1.46/Google Scholar
[21] Kawano Seiya, Yoshino Koichiro, and Nakamura Satoshi. 2019. Neural conversation model controllable by given dialogue act based on adversarial learning and label-aware objective. In Proceedings of the International Conference on Natural Language Generation. Association for Computational Linguistics, 198–207.Google ScholarCross Ref
[22] Kendall Alex, Gal Yarin, and Cipolla Roberto. 2018. Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Computer Vision Foundation/IEEE Computer Society, 7482–7491. DOI:Google ScholarCross Ref
[23] Kim Byeongchang, Ahn Jaewoo, and Kim Gunhee. 2020. Sequential latent knowledge selection for knowledge-grounded dialogue. In Proceedings of the 8th International Conference on Learning Representations. OpenReview.net.Google Scholar
[24] Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Veselin, and Zettlemoyer Luke. 2020. BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Jurafsky Dan, Chai Joyce, Schluter Natalie, and Tetreault Joel R. (Eds.). Association for Computational Linguistics, 7871–7880. Retrieved from https://www.aclweb.org/anthology/2020.acl-main.703/Google ScholarCross Ref
[25] Li Jiwei, Galley Michel, Brockett Chris, Gao Jianfeng, and Dolan Bill. 2016. A diversity-promoting objective function for neural conversation models. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. The Association for Computational Linguistics, 110–119.Google ScholarCross Ref
[26] Li Yanran, Su Hui, Shen Xiaoyu, Li Wenjie, Cao Ziqiang, and Niu Shuzi. 2017. DailyDialog: A manually labelled multi-turn dialogue dataset. In Proceedings of the 8th International Joint Conference on Natural Language Processing, Kondrak Greg and Watanabe Taro (Eds.). Asian Federation of Natural Language Processing, 986–995. Retrieved from https://aclanthology.org/I17-1099/Google Scholar
[27] Li Zekang, Niu Cheng, Meng Fandong, Feng Yang, Li Qian, and Zhou Jie. 2019. Incremental transformer with deliberation decoder for document grounded conversations. In Proceedings of the 57th Conference of the Association for Computational Linguistics, Korhonen Anna, Traum David R., and Màrquez Lluís (Eds.). Association for Computational Linguistics, 12–21. DOI:Google ScholarCross Ref
[28] Lin Chin-Yew. 2004. Rouge: A package for automatic evaluation of summaries. In Text Summarization Branches Out. Association for Computational Linguistics, 74–81.Google Scholar
[29] Liu Yinhan, Ott Myle, Goyal Naman, Du Jingfei, Joshi Mandar, Chen Danqi, Levy Omer, Lewis Mike, Zettlemoyer Luke, and Stoyanov Veselin. 2019. RoBERTa: A robustly optimized BERT pretraining approach. CoRR abs/1907.11692 (2019).Google Scholar
[30] Meng Chuan, Ren Pengjie, Chen Zhumin, Monz Christof, Ma Jun, and Rijke Maarten de. 2020. RefNet: A reference-aware network for background based conversation. In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI’20) 32nd Innovative Applications of Artificial Intelligence Conference (IAAI’20), 10th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’20). AAAI Press, 8496–8503. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/6370Google ScholarCross Ref
[31] Meng Chuan, Ren Pengjie, Chen Zhumin, Sun Weiwei, Ren Zhaochun, Tu Zhaopeng, and Rijke Maarten de. 2020. DukeNet: A dual knowledge interaction network for knowledge-grounded conversation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, Huang Jimmy, Chang Yi, Cheng Xueqi, Kamps Jaap, Murdock Vanessa, Wen Ji-Rong, and Liu Yiqun (Eds.). ACM, 1151–1160. DOI:Google ScholarDigital Library
[32] Mezza Stefano, Cervone Alessandra, Stepanov Evgeny A., Tortoreto Giuliano, and Riccardi Giuseppe. 2018. ISO-standard domain-independent dialogue act tagging for conversational agents. In Proceedings of the 27th International Conference on Computational Linguistics, Bender Emily M., Derczynski Leon, and Isabelle Pierre (Eds.). Association for Computational Linguistics, 3539–3551. Retrieved from https://aclanthology.org/C18-1300/Google Scholar
[33] Moghe Nikita, Arora Siddhartha, Banerjee Suman, and Khapra Mitesh M.. 2018. Towards exploiting background knowledge for building conversation systems. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2322–2332.Google ScholarCross Ref
[34] Papineni Kishore, Roukos Salim, Ward Todd, and Zhu Wei-Jing. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the Association for Computational Linguistics. ACL, 311–318.Google Scholar
[35] Prabhumoye Shrimai, Hashimoto Kazuma, Zhou Yingbo, Black Alan W., and Salakhutdinov Ruslan. 2021. Focused attention improves document-grounded generation. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Toutanova Kristina, Rumshisky Anna, Zettlemoyer Luke, Hakkani-Tür Dilek, Beltagy Iz, Bethard Steven, Cotterell Ryan, Chakraborty Tanmoy, and Zhou Yichao (Eds.). Association for Computational Linguistics, 4274–4287. DOI:Google ScholarCross Ref
[36] Qin Lianhui, Galley Michel, Brockett Chris, Liu Xiaodong, Gao Xiang, Dolan Bill, Choi Yejin, and Gao Jianfeng. 2019. Conversing by reading: Contentful neural conversation with on-demand machine reading. In Proceedings of the Association for Computational Linguistics Conference. Association for Computational Linguistics, 5427–5436.Google ScholarCross Ref
[37] Radford Alec, Wu Jeffrey, Child Rewon, Luan David, Amodei Dario, and Sutskever Ilya. 2019. Language models are unsupervised multitask learners. OpenAI Blog 1, 8 (2019).Google Scholar
[38] Rashkin Hannah, Reitter David, Tomar Gaurav Singh, and Das Dipanjan. 2021. Increasing faithfulness in knowledge-grounded dialogue with controllable features. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Zong Chengqing, Xia Fei, Li Wenjie, and Navigli Roberto (Eds.). Association for Computational Linguistics, 704–718. DOI:Google ScholarCross Ref
[39] Ren Pengjie, Chen Zhumin, Monz Christof, Ma Jun, and Rijke Maarten de. 2020. Thinking globally, acting locally: Distantly supervised global-to-local knowledge selection for background based conversation. In Proceedings of the 34th AAAI Conference on Artificial Intelligence. AAAI Press, 8697–8704. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/6395Google ScholarCross Ref
[40] Ren Pengjie, Liu Zhongkun, Song Xiaomeng, Tian Hongtao, Chen Zhumin, Ren Zhaochun, and Rijke Maarten de. 2021. Wizard of search engine: Access to information through conversations with search engines. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Diaz Fernando, Shah Chirag, Suel Torsten, Castells Pablo, Jones Rosie, and Sakai Tetsuya (Eds.). ACM, 533–543. DOI:Google ScholarDigital Library
[41] Saha Sougata, Das Souvik, and Srihari Rohini K.. 2022. Stylistic response generation by controlling personality traits and intent. In Proceedings of the 4th Workshop on NLP for Conversational AI, Liu Bing, Papangelis Alexandros, Ultes Stefan, Rastogi Abhinav, Chen Yun-Nung, Spithourakis Georgios, Nouri Elnaz, and Shi Weiyan (Eds.). Association for Computational Linguistics, 197–211. DOI:Google ScholarCross Ref
[42] Sankar Chinnadhurai and Ravi Sujith. 2019. Deep reinforcement learning for modeling chit-chat dialog with discrete attributes. In Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, Nakamura Satoshi, Gasic Milica, Zuckerman Ingrid, Skantze Gabriel, Nakano Mikio, Papangelis Alexandros, Ultes Stefan, and Yoshino Koichiro (Eds.). Association for Computational Linguistics, 1–10. DOI:Google ScholarCross Ref
[43] Sutton Richard S., McAllester David A., Singh Satinder, and Mansour Yishay. 1999. Policy gradient methods for reinforcement learning with function approximation. In Proceedings of the NIPS Conference, Solla Sara A., Leen Todd K., and Müller Klaus-Robert (Eds.). The MIT Press, 1057–1063. Retrieved from http://papers.nips.cc/paper/1713-policy-gradient-methods-for-reinforcement-learning-with-function-approximationGoogle Scholar
[44] Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N., Kaiser Lukasz, and Polosukhin Illia. 2017. Attention is all you need. In Proceedings of the NIPS Conference. 5998–6008.Google Scholar
[45] Wang Kai, Tian Junfeng, Wang Rui, Quan Xiaojun, and Yu Jianxing. 2020. Multi-domain dialogue acts and response co-generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Jurafsky Dan, Chai Joyce, Schluter Natalie, and Tetreault Joel R. (Eds.). Association for Computational Linguistics, 7125–7134. DOI:Google ScholarCross Ref
[46] Wang Xinyi, Weston Jason, Auli Michael, and Jernite Yacine. 2019. Improving conditioning in context-aware sequence to sequence models. CoRR abs/1911.09728 (2019).Google Scholar
[47] Wu Zeqiu, Lu Bo-Ru, Hajishirzi Hannaneh, and Ostendorf Mari. 2021. DIALKI: Knowledge identification in conversational systems through dialogue-document contextualization. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Moens Marie-Francine, Huang Xuanjing, Specia Lucia, and Yih Scott Wen-tau (Eds.). Association for Computational Linguistics, 1852–1863. DOI:Google ScholarCross Ref
[48] Xu Lin, Zhou Qixian, Fu Jinlan, Kan Min-Yen, and Ng See-Kiong. 2022. CorefDiffs: Co-referential and differential knowledge flow in document grounded conversations. In Proceedings of the 29th International Conference on Computational Linguistics, Calzolari Nicoletta, Huang Chu-Ren, Kim Hansaem, Pustejovsky James, Wanner Leo, Choi Key-Sun, Ryu Pum-Mo, Chen Hsin-Hsi, Donatelli Lucia, Ji Heng, Kurohashi Sadao, Paggio Patrizia, Xue Nianwen, Kim Seokhwan, Hahm Younggyun, He Zhong, Lee Tony Kyungil, Santus Enrico, Bond Francis, and Na Seung-Hoon (Eds.). International Committee on Computational Linguistics, 471–484. Retrieved from https://aclanthology.org/2022.coling-1.38Google Scholar
[49] Yu Dian, Cohn Michelle, Yang Yi Mang, Chen Chun-Yen, Wen Weiming, Zhang Jiaping, Zhou Mingyang, Jesse Kevin, Chau Austin, Bhowmick Antara, Iyer Shreenath, Sreenivasulu Giritheja, Davidson Sam, Bhandare Ashwin, and Yu Zhou. 2019. Gunrock: A social bot for complex and engaging long conversations. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Padó Sebastian and Huang Ruihong (Eds.). Association for Computational Linguistics, 79–84. DOI:Google ScholarCross Ref
[50] Yu Zhou, Xu Ziyu, Black Alan W., and Rudnicky Alexander I.. 2016. Strategy and policy learning for non-task-oriented conversational systems. In Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue. The Association for Computer Linguistics, 404–412. DOI:Google ScholarCross Ref
[51] Zhan Haolan, Shen Lei, Chen Hongshen, and Zhang Hainan. 2021. CoLV: A collaborative latent variable model for knowledge-grounded dialogue generation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Moens Marie-Francine, Huang Xuanjing, Specia Lucia, and Yih Scott Wen-tau (Eds.). Association for Computational Linguistics, 2250–2261. DOI:Google ScholarCross Ref
[52] Zhan Haolan, Zhang Hainan, Chen Hongshen, Ding Zhuoye, Bao Yongjun, and Lan Yanyan. 2021. Augmenting knowledge-grounded conversations with sequential knowledge transition. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Toutanova Kristina, Rumshisky Anna, Zettlemoyer Luke, Hakkani-Tür Dilek, Beltagy Iz, Bethard Steven, Cotterell Ryan, Chakraborty Tanmoy, and Zhou Yichao (Eds.). Association for Computational Linguistics, 5621–5630. DOI:Google ScholarCross Ref
[53] Zhao Xueliang, Wu Wei, Xu Can, Tao Chongyang, Zhao Dongyan, and Yan Rui. 2020. Knowledge-grounded dialogue generation with pre-trained language models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Webber Bonnie, Cohn Trevor, He Yulan, and Liu Yang (Eds.). Association for Computational Linguistics, 3377–3390. DOI:Google ScholarCross Ref
[54] Zhou Hao, Young Tom, Huang Minlie, Zhao Haizhou, Xu Jingfang, and Zhu Xiaoyan. 2018. Commonsense knowledge aware conversation generation with graph attention. In Proceedings of the International Joint Conference on Artificial Intelligence. ijcai.org, 4623–4629.Google ScholarCross Ref
[55] Zhou Kangyan, Prabhumoye Shrimai, and Black Alan W.. 2018. A dataset for document grounded conversations. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Riloff Ellen, Chiang David, Hockenmaier Julia, and Tsujii Jun’ichi (Eds.). Association for Computational Linguistics, 708–713. DOI:Google ScholarCross Ref

Index Terms

Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing

Recommendations

Knowledge-Grounded Dialogue Generation with Contrastive Knowledge Selection
Web Information Systems Engineering – WISE 2023
Abstract
Knowledge selection is the key component in knowledge-ground dialogues, which aims to choice correct knowledge based on external knowledge for dialogue generation. The quality of knowledge selection depend on knowledge representation methods. ...
Read More
Aspect-Aware Response Generation for Multimodal Dialogue System
Survey Paper and Regular Paper

Multimodality in dialogue systems has opened up new frontiers for the creation of robust conversational agents. Any multimodal system aims at bridging the gap between language and vision by leveraging diverse and often complementary information from ...
Read More
Prediction, selection, and generation: a knowledge-driven conversation system
Abstract
In conversational systems, we can use external knowledge to generate more diverse sentences and make these sentences contain actual knowledge. Leveraging knowledge for conversation system is important but challenging. Firstly, the conversation ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Information Systems Volume 42, Issue 2
March 2024
897 pages
ISSN:1046-8188
EISSN:1558-2868
DOI:10.1145/3618075
Editor:
Min Zhang
Tsinghua University, China
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 November 2023
- Online AM: 29 August 2023
- Accepted: 19 August 2023
- Revised: 23 June 2023
- Received: 10 March 2023
Published in tois Volume 42, Issue 2

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Document-grounded dialogue
knowledge selection
response generation
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 323
  Total Downloads
- Downloads (Last 12 months)323
- Downloads (Last 6 weeks)25
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue

ACM Transactions on Information Systems

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Knowledge-Grounded Dialogue Generation with Contrastive Knowledge Selection

Aspect-Aware Response Generation for Multimodal Dialogue System

Prediction, selection, and generation: a knowledge-driven conversation system