skip to main content
research-article

Conco-ERNIE: Complex User Intent Detect Model for Smart Healthcare Cognitive Bot

Published:23 February 2023Publication History
Skip Abstract Section

Abstract

The outbreak of Covid-19 has exposed the lack of medical resources, especially the lack of medical personnel. This results in time and space restrictions for medical services, and patients cannot obtain health information all the time and everywhere. Based on the medical knowledge graph, healthcare bots alleviate this burden effectively by providing patients with diagnosis guidance, pre-diagnosis, and post-diagnosis consultation services in the way of human-machine dialogue. However, the medical utterance is more complicated in language structure, and there are complex intention phenomena in semantics. It is a challenge to detect the single intent, multi-intent, and implicit intent of a patient’s utterance. To this end, we create a high-quality annotated Chinese Medical query (utterance) dataset, CMedQ (about 16.8k queries in medical domain which includes single, multiple, and implicit intents). It is hard to detect intent on such a complex dataset through traditional text classification models. Thus, we propose a novel detect model Conco-ERNIE, using concept co-occurrence patterns to enhance the representation of pre-trained model ERNIE. These patterns are mined using Apriori algorithm and will be embedded via Node2Vec. Their features will be aggregated with semantic features into Conco-ERNIE by using an attention module, which can catch user explicit intents and also predict user implicit intents. Experiments on CMedQ demonstrates that Conco-ERNIE achieves outstanding performance over baseline. Based on Conco-ERNIE, we develop an intelligent healthcare bot, MedicalBot. To provide knowledge support for MedicalBot, we also build a Chinese medical graph, CMedKG (about 45k entities and 283k relationships).

REFERENCES

  1. [1] Adlassnig Klaus-Peter. 1986. Fuzzy set theory in medical diagnosis. IEEE Transactions on Systems, Man, and Cybernetics 16, 2 (1986), 260265.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Agrawal Rakesh, Srikant Ramakrishnan, et al. 1994. Fast algorithms for mining association rules. In Proc. 20th Int. Conf. Very Large Data Bases, VLDB, Vol. 1215. 487499.Google ScholarGoogle Scholar
  3. [3] Bai Guirong, He Shizhu, Liu Kang, and Zhao Jun. 2022. Incremental intent detection for medical domain with contrast replay networks. In Findings of the Association for Computational Linguistics: ACL, Dublin, Ireland, May 22–27. 35493556.Google ScholarGoogle Scholar
  4. [4] ballero Jovan Carlo S. Ca, Ching Carl Benedick D., Co Sean Patrick S., Noble Hazel O., and Barcelo Arne B.. 2021. LifeDoc: Availability and monitoring system of online medical consultation. In 11th IEEE International Conference on Control System, Computing and Engineering, ICCSCE 2021, Penang, Malaysia, August 27–28, 2021. 103108.Google ScholarGoogle Scholar
  5. [5] Cai Ruichu, Zhu Binjun, Ji Lei, Hao Tianyong, Yan Jun, and Liu Wenyin. [n.d.]. An CNN-LSTM attention approach to understanding user query intent from online health communities. In 2017 IEEE International Conference on Data Mining Workshops, ICDM Workshops 2017, New Orleans, LA, USA, November 18–21, 2017. 430437.Google ScholarGoogle Scholar
  6. [6] Chen Nan, Su Xiangdong, Liu Tongyang, Hao Qizhi, and Wei Ming. 2020. A benchmark dataset and case study for Chinese medical question intent classification. BMC Medical Informatics Decis. Mak. 20-S, 3 (2020), 125.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Chen Ziheng and Ren Jiangtao. 2021. Multi-label text classification with latent word-wise label information. Appl. Intell. 51, 2 (2021), 966979.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Devlin Jacob, Chang Ming-Wei, Lee Kenton, and Toutanova Kristina. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).Google ScholarGoogle Scholar
  9. [9] Grover Aditya and Leskovec Jure. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 855864.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Guo Zhaohan Daniel, Tür Gökhan, Yih Wen-tau, and Zweig Geoffrey. [n.d.]. Joint semantic utterance classification and slot filling with recursive neural networks. In 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7–10, 2014. 554559.Google ScholarGoogle Scholar
  11. [11] Haihong E., Niu Peiqing, Chen Zhongfu, and Song Meina. 2019. A novel bi-directional interrelated model for joint intent detection and slot filling. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 54675471.Google ScholarGoogle Scholar
  12. [12] Huang Zhiheng, Xu Wei, and Yu Kai. 2015. Bidirectional LSTM-CRF models for sequence tagging. CoRR abs/1508.01991 (2015). http://arxiv.org/abs/1508.01991.Google ScholarGoogle Scholar
  13. [13] Johnson Rie and Zhang Tong. 2016. Supervised and semi-supervised text categorization using LSTM for region embeddings. In Proceedings of the 33rd International Conference on Machine Learning, ICML 2016, Balcan Maria-Florina and Weinberger Kilian Q. (Eds.), Vol. 48. 526534.Google ScholarGoogle Scholar
  14. [14] Kingma Diederik P. and Ba Jimmy. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google ScholarGoogle Scholar
  15. [15] Lai Siwei, Xu Liheng, Liu Kang, and Zhao Jun. 2015. Recurrent convolutional neural networks for text classification. In Twenty-ninth AAAI Conference on Artificial Intelligence.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] Lin C. Y., Wu Y. H., and Chen Alp. 2021. Selecting the most helpful answers in online health question answering communities. Journal of Intelligent Information Systems3 (2021).Google ScholarGoogle Scholar
  17. [17] Maeda Haruhisa, Saiki Sachio, Nakamura Masahide, and Yasuda Kiyoshi. 2019. Recording daily health status with chatbot on mobile phone - A preliminary study. In Twelfth International Conference on Mobile Computing and Ubiquitous Network, ICMU 2019, Kathmandu, Nepal, November 4–6, 2019. IEEE, 16.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Mao Song, Zhang Lu-Lu, and Guan Zhen-Guo. 2021. An LSTM&Topic-CNN model for classification of online Chinese medical questions. IEEE Access (2021), 5258052589.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Minaee Shervin, Kalchbrenner Nal, Cambria Erik, Nikzad Narjes, Chenaghlu Meysam, and Gao Jianfeng. 2021. Deep learning-based text classification: A comprehensive review. ACM Comput. Surv. 54, 3 (2021), 62:1–62:40.Google ScholarGoogle Scholar
  20. [20] Neuraz Antoine, Llanos Leonardo Campillos, Burgun Anita, and Rosset Sophie. 2018. Natural language understanding for task oriented dialog in the biomedical domain in a low resources context. arXiv preprint arXiv:1811.09417 (2018).Google ScholarGoogle Scholar
  21. [21] Peters Matthew E., Neumann Mark, Iyyer Mohit, Gardner Matt, Clark Christopher, Lee Kenton, and Zettlemoyer Luke. 2018. Deep contextualized word representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, New Orleans, Louisiana, USA, June 1–6, 2018, Volume 1 (Long Papers). 22272237.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Shim Heereen, Lowet Dietwig, Luca Stijn, and Vanrumste Bart. 2021. Building blocks of a task-oriented dialogue system in the healthcare domain. In Proceedings of the Second Workshop on Natural Language Processing for Medical Conversations. 4757.Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Stanton Isabelle, Ieong Samuel, and Mishra Nina. 2014. Circumlocution in diagnostic medical queries. In Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval. 133142.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Sun Yu, Wang Shuohuan, Li Yu-Kun, Feng Shikun, Tian Hao, Wu Hua, and Wang Haifeng. 2020. ERNIE 2.0: A continual pre-training framework for language understanding. In The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020. AAAI Press, 89688975.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Wang Jixuan, Wei Kai, Radfar Martin, Zhang Weiwei, and Chung Clement. 2021. Encoding syntactic knowledge in transformer encoder for intent detection and slot filling. In Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021. 1394313951.Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Wei Zhongyu, Liu Qianlong, Peng Baolin, Tou Huaixiao, Chen Ting, Huang Xuan-Jing, Wong Kam-Fai, and Dai Xiang. 2018. Task-oriented dialogue system for automatic diagnosis. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 201207.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Wu Chaochen, Luo Guan, Guo Chao, Ren Yin, Zheng Anni, and Yang Cheng. 2020. An attention-based multi-task model for named entity recognition and intent analysis of Chinese online medical questions. J. Biomed. Informatics 108 (2020), 103511.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Wu Di, Ding Liang, Lu Fan, and Xie Jian. 2020. SlotRefine: A fast non-autoregressive model for joint intent detection and slot filling. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, EMNLP 2020, Online, November 16–20, 2020. 19321937.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Xu Lin, Zhou Qixian, Gong Ke, Liang Xiaodan, Tang Jianheng, and Lin Liang. 2019. End-to-end knowledge-routed relational dialogue system for automatic diagnosis. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 73467353.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Xu Mengting, Cao Yanrong, Sun Guozi, and Li Huakang. 2019. CRQA: Credibility retrieval for medical question answer service. In 2019 IEEE International Conference on Real-time Computing and Robotics (RCAR). IEEE, 347350.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Yuan Man, Ouyang Yuan Xin, and Xiong Zhang. 2013. A text categorization method using extended vector space model by frequent term sets. Journal of Information Science and Engineering 29, 1 (2013), 99114.Google ScholarGoogle Scholar
  32. [32] Zhang Chenwei, Du Nan, Fan Wei, Li Yaliang, Lu Chun-Ta, and Philip S. Yu. 2017. Bringing semantic structures to user intent detection in online medical queries. In 2017 IEEE International Conference on Big Data (Big Data). IEEE, 10191026.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Zhang Chenwei, Fan Wei, Du Nan, and Yu Philip S.. 2016. Mining user intentions from medical queries: A neural network based heterogeneous jointly modeling approach. In Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11–15, 2016. ACM, 13731384.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Zhang Ningyu, Chen Mosha, Bi Zhen, Liang Xiaozhuan, Li Lei, Shang Xin, Yin Kangping, Tan Chuanqi, Xu Jian, Huang Fei, Si Luo, Ni Yuan, Xie Guotong, Sui Zhifang, Chang Baobao, Zong Hui, Yuan Zheng, Li Linfeng, Yan Jun, Zan Hongying, Zhang Kunli, Tang Buzhou, and Chen Qingcai. [n.d.]. CBLUE: A Chinese biomedical language understanding evaluation benchmark. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2022, Dublin, Ireland, May 22–27, 2022. 78887915.Google ScholarGoogle Scholar
  35. [35] Zhou Binggui, Yang Guanghua, Shi Zheng, and Ma Shaodan. 2021. Natural language processing for smart healthcare. arXiv preprint arXiv:2110.15803 (2021).Google ScholarGoogle Scholar
  36. [36] Zhu Wei, Ni Yuan, Wang Xiaoling, and Xie Guotong. 2021. Discovering better model architectures for medical query understanding. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers. 230237.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Conco-ERNIE: Complex User Intent Detect Model for Smart Healthcare Cognitive Bot

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Internet Technology
          ACM Transactions on Internet Technology  Volume 23, Issue 1
          February 2023
          564 pages
          ISSN:1533-5399
          EISSN:1557-6051
          DOI:10.1145/3584863
          • Editor:
          • Ling Liu
          Issue’s Table of Contents

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 23 February 2023
          • Online AM: 8 December 2022
          • Accepted: 27 November 2022
          • Revised: 20 September 2022
          • Received: 6 October 2021
          Published in toit Volume 23, Issue 1

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
        • Article Metrics

          • Downloads (Last 12 months)258
          • Downloads (Last 6 weeks)25

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        View Full Text

        HTML Format

        View this article in HTML Format .

        View HTML Format