ABSTRACT
Matching is the key problem in both search and recommendation, that is to measure the relevance of a document to a query or the interest of a user on an item. Previously, machine learning methods have been exploited to address the problem, which learns a matching function from labeled data, also referred to as "learning to match''. In recent years, deep learning has been successfully applied to matching and significant progresses have been made. Deep semantic matching models for search and neural collaborative filtering models for recommendation are becoming the state-of-the-art technologies. The key to the success of the deep learning approach is its strong ability in learning of representations and generalization of matching patterns from raw data (e.g., queries, documents, users, and items, particularly in their raw forms). In this tutorial, we aim to give a comprehensive survey on recent progress in deep learning for matching in search and recommendation. Our tutorial is unique in that we try to give a unified view on search and recommendation. In this way, we expect researchers from the two fields can get deep understanding and accurate insight on the spaces, stimulate more ideas and discussions, and promote developments of technologies. The tutorial mainly consists of three parts. Firstly, we introduce the general problem of matching, which is fundamental in both search and recommendation. Secondly, we explain how traditional machine learning techniques are utilized to address the matching problem in search and recommendation. Lastly, we elaborate how deep learning can be effectively used to solve the matching problems in both tasks.
- Nicholas J. Belkin and W. Bruce Croft . 1992. Information Filtering and Information Retrieval: Two Sides of the Same Coin? Commun. ACM Vol. 35, 12 (1992), 29--38. Google ScholarDigital Library
- Adam Berger and John Lafferty . 1999. Information Retrieval As Statistical Translation. In Proceedings of the 22Nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '99). ACM, New York, NY, USA, 222--229. Google ScholarDigital Library
- Jingyuan Chen, Hanwang Zhang, Xiangnan He, Liqiang Nie, Wei Liu, and Tat-Seng Chua . 2017. Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17). ACM, New York, NY, USA, 335--344. Google ScholarDigital Library
- Paul Covington, Jay Adams, and Emre Sargin . 2016. Deep Neural Networks for YouTube Recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys '16). ACM, New York, NY, USA, 191--198. Google ScholarDigital Library
- Nick Craswell, W Bruce Croft, Maarten de Rijke, Jiafeng Guo, and Bhaskar Mitra . 2017. SIGIR 2017 Workshop on Neural Information Retrieval (Neu-IR'17) Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17). ACM, New York, NY, USA, 1431--1432. Google ScholarDigital Library
- Zhuyun Dai, Chenyan Xiong, Jamie Callan, and Zhiyuan Liu . 2018. Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining (WSDM '18). ACM, New York, NY, USA, 126--134. Google ScholarDigital Library
- Jianfeng Gao, Xiaodong He, and Jian-Yun Nie . 2010. Clickthrough-based Translation Models for Web Search: From Word Models to Phrase Models. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM '10). ACM, New York, NY, USA, 1139--1148. Google ScholarDigital Library
- Hector Garcia-Molina, Georgia Koutrika, and Aditya Parameswaran . 2011. Information seeking: convergence of search, recommendations, and advertising. Commun. ACM Vol. 54, 11 (2011), 121--130. Google ScholarDigital Library
- Julio Gonzalo, Hang Li, Alessandro Moschitti, and Jun Xu . 2014. SIGIR 2014 Workshop on Semantic Matching in Information Retrieval Proceedings of the 37th International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '14). ACM, New York, NY, USA, 1296--1296. Google ScholarDigital Library
- Xiangnan He and Tat-Seng Chua . 2017. Neural Factorization Machines for Sparse Predictive Analytics Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17). ACM, New York, NY, USA, 355--364. Google ScholarDigital Library
- Xiangnan He, Xiaoyu Du, Xiang Wang, Feng Tian, Jinhui Tang, and Tat-Seng Chua . 2018. Out Product-based Neural Collaborative Filtering. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI'18). AAAI Press.Google ScholarDigital Library
- Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua . 2017. Neural Collaborative Filtering. In Proceedings of the 26th International Conference on World Wide Web (WWW '17). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, 173--182. Google ScholarDigital Library
- Xiangnan He, Hanwang Zhang, Min-Yen Kan, and Tat-Seng Chua . 2016. Fast Matrix Factorization for Online Recommendation with Implicit Feedback Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '16). ACM, New York, NY, USA, 549--558. Google ScholarDigital Library
- Balázs Hidasi, Alexandros Karatzoglou, Oren Sar-Shalom, Sander Dieleman, Bracha Shapira, and Domonkos Tikk . 2017. DLRS 2017: Second Workshop on Deep Learning for Recommender Systems Proceedings of the Eleventh ACM Conference on Recommender Systems (RecSys '17). ACM, New York, NY, USA, 370--371. Google ScholarDigital Library
- Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen . 2014. Convolutional Neural Network Architectures for Matching Natural Language Sentences. In Advances in Neural Information Processing Systems 27, bibfieldeditorZ. Ghahramani, M. Welling, C. Cortes, N. D. Lawrence, and K. Q. Weinberger (Eds.). Curran Associates, Inc., 2042--2050. Google ScholarDigital Library
- Po-Sen Huang, Xiaodong He, Jianfeng Gao, Li Deng, Alex Acero, and Larry Heck . 2013. Learning Deep Structured Semantic Models for Web Search Using Clickthrough Data Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management (CIKM '13). ACM, New York, NY, USA, 2333--2338. Google ScholarDigital Library
- Yehuda Koren, Robert Bell, and Chris Volinsky . 2009. Matrix Factorization Techniques for Recommender Systems. Computer Vol. 42, 8 (Aug. . 2009), 30--37. Google ScholarDigital Library
- Hang Li and Jun Xu . 2012 a. Beyond Bag-of-words: Machine Learning for Query-document Matching in Web Search. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '12). ACM, New York, NY, USA, 1177--1177. Google ScholarDigital Library
- Hang Li and Jun Xu . 2012 b. Machine Learning for Query-document Matching in Search Proceedings of the Fifth ACM International Conference on Web Search and Data Mining (WSDM '12). ACM, New York, NY, USA, 767--768. Google ScholarDigital Library
- Hang Li and Jun Xu . 2012 c. Semantic Matching in Search. In Proceedings of the 21st international conference on World Wide Web (WWW '12).Google Scholar
- Hang Li and Jun Xu . 2014. Semantic Matching in Search. Foundations and Trends® in Information Retrieval Vol. 7, 5 (2014), 343--469. Google ScholarDigital Library
- Jing Li, Pengjie Ren, Zhumin Chen, Zhaochun Ren, Tao Lian, and Jun Ma . 2017. Neural Attentive Session-based Recommendation. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM '17). ACM, New York, NY, USA, 1419--1428. Google ScholarDigital Library
- Hamid Palangi, Li Deng, Yelong Shen, Jianfeng Gao, Xiaodong He, Jianshu Chen, Xinying Song, and Rabab Ward . 2016. Deep Sentence Embedding Using Long Short-term Memory Networks: Analysis and Application to Information Retrieval. IEEE/ACM Trans. Audio, Speech and Lang. Proc. Vol. 24, 4 (2016), 694--707. Google ScholarDigital Library
- Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, Shengxian Wan, and Xueqi Cheng . 2016. Text Matching As Image Recognition. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press, 2793--2799. Google ScholarDigital Library
- Liang Pang, Yanyan Lan, Jiafeng Guo, Jun Xu, Jingfang Xu, and Xueqi Cheng . 2017. DeepRank: A New Deep Architecture for Relevance Ranking in Information Retrieval Proceedings of the 26th International Conference on Information and Knowledge Mangement (CIKM'17). Google ScholarDigital Library
- Ankur P. Parikh, Oscar T"ackström, Dipanjan Das, and Jakob Uszkoreit . 2016. A Decomposable Attention Model for Natural Language Inference Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016, Austin, Texas, USA, November 1--4, 2016. 2249--2255. deftempurl%http://aclweb.org/anthology/D/D16/D16--1244.pdf tempurlGoogle Scholar
- Xipeng Qiu and Xuanjing Huang . 2015. Convolutional Neural Tensor Network Architecture for Community-based Question Answering. In Proceedings of the 24th International Conference on Artificial Intelligence (IJCAI'15). AAAI Press, 1305--1311. Google ScholarDigital Library
- Yelong Shen, Xiaodong He, Jianfeng Gao, Li Deng, and Grégoire Mesnil . 2014. A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (CIKM '14). ACM, New York, NY, USA, 101--110. Google ScholarDigital Library
- Shengxian Wan, Yanyan Lan, Jun Xu, Jiafeng Guo, Liang Pang, and Xueqi Cheng . 2016. Match-SRNN: Modeling the Recursive Matching Structure with Spatial RNN Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI'16). AAAI Press, 2922--2928. Google ScholarDigital Library
- Xiang Wang, Xiangnan He, Fuli Feng, Liqiang Nie, and Tat-Seng Chua . 2018. TEM: Tree-enhanced Embedding Model for Explainable Recommendation Proceedings of the 2018 World Wide Web Conference (WWW '18). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, 1543--1552. Google ScholarDigital Library
- Wei Wu, Hang Li, and Jun Xu . 2013. Learning Query and Document Similarities from Click-through Bipartite Graph with Metadata. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining (WSDM '13). ACM, New York, NY, USA, 687--696. Google ScholarDigital Library
- Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu, and Tat-Seng Chua . 2017. Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17). AAAI Press, 3119--3125. deftempurl%http://dl.acm.org/citation.cfm?id=3172077.3172324 tempurl Google ScholarDigital Library
- Chenyan Xiong, Zhuyun Dai, Jamie Callan, Zhiyuan Liu, and Russell Power . 2017. End-to-End Neural Ad-hoc Ranking with Kernel Pooling Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17). ACM, New York, NY, USA, 55--64. Google ScholarDigital Library
- Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, and Wei-Ying Ma . 2016. Collaborative Knowledge Base Embedding for Recommender Systems Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16). ACM, New York, NY, USA, 353--362. Google ScholarDigital Library
- Qian Zhao, Yue Shi, and Liangjie Hong . 2017. GB-CENT: Gradient Boosted Categorical Embedding and Numerical Trees Proceedings of the 26th International Conference on World Wide Web (WWW '17). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, 1311--1319. Google ScholarDigital Library
Index Terms
- Deep Learning for Matching in Search and Recommendation
Recommendations
Deep Learning for Matching in Search and Recommendation
WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data MiningMatching is the key problem in search and recommendation, that is to measure the relevance of a document to a query or the interest of a user on an item. Previously, machine learning methods have been exploited to address the problem, which learn a ...
DLIR: a deep learning-based initialization recommendation algorithm for trust-aware recommendation
AbstractIn collaborative filtering recommendations, a good local minimum depends largely on the initialization of the latent feature vectors of users and items. However, many existing methods initialize them by random initialization or zero ones, which ...
CoDFi-DL: a hybrid recommender system combining enhanced collaborative and demographic filtering based on deep learning
AbstractThe cold start problem has always been a major challenge for recommender systems. It arises when the system lacks rating records for new users or items. Addressing the challenge of providing personalized recommendations in the cold start scenario ...
Comments