skip to main content
10.1145/3404835.3463240acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Booking.com Multi-Destination Trips Dataset

Published:11 July 2021Publication History

ABSTRACT

We introduce a novel dataset of real multi-destination trips booked through Booking.com's online travel platform. The dataset consists of 1.5 million reservations representing 359,000 unique journeys made across 39,000 destinations. As such, the data is particularly well suited to model sequential recommendation and retrieval problems in a high cardinality target space. To preserve user privacy and protect business-sensitive statistics, the data is fully anonymized, sampled and limited to five user origin markets. Even so, the dataset is representative of the general travel purchase behavior and therefore presents a uniquely valuable resource for Machine Learning and information retrieval researchers. This work provides an overview of the dataset. It reports several benchmark results for relevant recommendation problems obtained as part of the recently held Booking.com data challenge during the WSDM WebTour workshop.

Skip Supplemental Material Section

Supplemental Material

sigir_bda.mp4

mp4

101.9 MB

References

  1. Jens Adamczak, Gerard-Paul Leyson, Peter Knees, Yashar Deldjoo, Farshad Bakhshandegan Moghaddam, Julia Neidhardt, Wolfgang Wörndl, and Philipp Monreal. 2019. Session-based hotel recommendations: Challenges and future directions. arXiv preprint arXiv:1908.00071 (2019).Google ScholarGoogle Scholar
  2. Mart'in Baigorria Alonso. 2021. Data Augmentation Using Many-To-Many RNNs for Session-Aware Recommender Systems. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM Webtour '21) .Google ScholarGoogle Scholar
  3. Alex Beutel, Paul Covington, Sagar Jain, Can Xu, Jia Li, Vince Gatto, and Ed H Chi. 2018. Latent cross: Making use of context in recurrent recommender systems. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining. 46--54.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Qiwei Chen, Huan Zhao, Wei Li, Pipei Huang, and Wenwu Ou. 2019. Behavior sequence transformer for e-commerce recommendation in alibaba. In Proceedings of the 1st International Workshop on Deep Learning Practice for High-Dimensional Sparse Data. 1--4.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014).Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Barbara Rychalska, Konrad Gouchowski, and Jacek Dbrowski. 2021. Modeling Multi-Destination Trips with Sketch-Based Model. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM Webtour'21) .Google ScholarGoogle Scholar
  7. Dmitri Goldenberg, Kostia Kofman, Javier Albert, Sarai Mizrachi, Adam Horowitz, and Irene Teinemaa. 2021 a. Personalization in Practice: Methods and Applications. In Proceedings of the 14th International Conference on Web Search and Data Mining .Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Dmitri Goldenberg, Kostia Kofman, Pavel Levin, Sarai Mizrachi, Maayan Kafry, and Guy Nadav. 2021 b. Booking.com WSDM WebTour 2021 Data Challenge. http://www.bookingchallenge.com. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM Webtour '21) .Google ScholarGoogle Scholar
  9. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation, Vol. 9, 8 (1997), 1735--1780.Google ScholarGoogle Scholar
  10. Shotaro Ishihara, Shuhei Goda, and Yuya Matsumura. 2021. Weighted Averaging of Various LSTM Models for Next Destination Recommendation. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM Webtour '21) .Google ScholarGoogle Scholar
  11. Paweł Jankiewicz, Liudmyla Kyrashchuk, Paweł Sienkowski, and Magdalena Wójcik. 2019. Boosting algorithms for a session-based, context-aware recommender system in an online travel domain. In Proceedings of the Workshop on ACM Recommender Systems Challenge. 1--5.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Dietmar Jannach, Gabriel de Souza P. Moreira, and Even Oldridge. 2020. Why Are Deep Learning Models Not Consistently Winning Recommender Systems Competitions Yet? A Position Paper. In Proceedings of the Recommender Systems Challenge 2020. 44--49.Google ScholarGoogle Scholar
  13. Tsvi Kuflik, Catalin Mihai Barbu, Amra Deli?, Dmitri Goldenberg, Julia Neidhardt, Ludocik Coba, and Markus Zanker. 2021. WebTour 2021 Workshop on Web and Tourism. In Proceedings of the 14th International Conference on Web Search and Data Mining .Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Tobias Lang and Matthias Rettenmeier. 2017. Understanding consumer behavior with recurrent neural networks. In Workshop on Machine Learning Methods for Recommender Systems .Google ScholarGoogle Scholar
  15. Leland McInnes, John Healy, and James Melville. 2018. Umap: Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426 (2018).Google ScholarGoogle Scholar
  16. Sarai Mizrachi and Pavel Levin. 2019. Combining Context Features in Sequence-Aware Recommender Systems.. In RecSys (Late-Breaking Results). 11--15.Google ScholarGoogle Scholar
  17. Aleksandr Petrov and Yuriy Makarov. 2021. Attention-based neural re-ranking approach for next city in trip recommendations. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM Webtour '21) .Google ScholarGoogle Scholar
  18. Massimo Quadrana, Paolo Cremonesi, and Dietmar Jannach. 2018. Sequence-aware recommender systems. ACM Computing Surveys (CSUR), Vol. 51, 4 (2018), 1--36.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C Quoc and Viet Le. 2007. Learning to rank with nonsmooth cost functions. Proceedings of the Advances in Neural Information Processing Systems, Vol. 19 (2007), 193--200.Google ScholarGoogle Scholar
  20. Barbara Rychalska, Piotr Bka bel, Konrad Gołuchowski, Andrzej Michałowski, and Jacek Dka browski. 2021. Cleora: A Simple, Strong and Scalable Graph Embedding Scheme. arXiv preprint arXiv:2102.02302 (2021).Google ScholarGoogle Scholar
  21. Marlesson RO Santana and Anderson Soares. 2021. Hybrid Model with Time Modeling for Sequential Recommender Systems. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM Webtour '21) .Google ScholarGoogle Scholar
  22. Benedikt Schifferer, Chris Deotte, Jean-Francois Puget, Gabriel de Souza Pereira Moreira, Gilberto Titericz, Jiwei Liu, and Ronay Ak. 2021. Using Deep Learning to Win the Booking.com WSDM WebTour21 Challenge on Sequential Recommendations. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM Webtour '21) .Google ScholarGoogle Scholar
  23. Benedikt Schifferer, Gilberto Titericz, Chris Deotte, Christof Henkel, Kazuki Onodera, Jiwei Liu, Bojan Tunguz, Even Oldridge, Gabriel De Souza Pereira Moreira, and Ahmet Erdem. 2020. GPU Accelerated Feature Engineering and Training for Recommender Systems. In Proceedings of the Recommender Systems Challenge 2020. 16--23.Google ScholarGoogle Scholar
  24. Gourav G Shenoy, Mangirish A Wagle, and Anwar Shaikh. 2017. Kaggle competition: Expedia hotel recommendations. arXiv preprint arXiv:1703.02915 (2017).Google ScholarGoogle Scholar
  25. Fei Sun, Jun Liu, Jian Wu, Changhua Pei, Xiao Lin, Wenwu Ou, and Peng Jiang. 2019. BERT4Rec: Sequential recommendation with bidirectional encoder representations from transformer. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 1441--1450.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is All You Need. https://arxiv.org/pdf/1706.03762.pdfGoogle ScholarGoogle Scholar
  27. Yuanzhe Zhou, Shikang Wu, and Chenyang Zheng. 2021. Explore next destination prediction. In Proceedings of the ACM WSDM Workshop on Web Tourism (WSDM Webtour '21).Google ScholarGoogle Scholar

Index Terms

  1. Booking.com Multi-Destination Trips Dataset

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
        July 2021
        2998 pages
        ISBN:9781450380379
        DOI:10.1145/3404835

        Copyright © 2021 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 July 2021

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • short-paper

        Acceptance Rates

        Overall Acceptance Rate792of3,983submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader