Dataset Creation Framework for Personalized Type-Based Facet Ranking Tasks Evaluation

Ali, Esraa; Caputo, Annalina; Lawless, Séamus; Conlan, Owen

doi:10.1007/978-3-030-85251-1_3

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12880))

Included in the following conference series:

International Conference of the Cross-Language Evaluation Forum for European Languages

893 Accesses
1 Citations

Abstract

Faceted Search Systems (FSS) have gained prominence in many existing vertical search systems. They provide facets to assist users in allocating their desired search target quickly. In this paper, we present a framework to generate datasets appropriate for simulation-based evaluation of these systems. We focus on the task of personalized type-based facet ranking. Type-based facets (t-facets) represent the categories of the resources being searched in the FSS. They are usually organized in a large multilevel taxonomy. Personalized t-facet ranking methods aim at identifying and ranking the parts of the taxonomy which reflects query relevance as well as user interests. While evaluation protocols have been developed for facet ranking, the problem of personalising the facet rank based on user profiles has lagged behind due to the lack of appropriate datasets. To fill this gap, this paper introduces a framework to reuse and customise existing real-life data collections. The framework outlines the eligibility criteria and the data structure requirements needed for this task. It also details the process to transform the data into a ground-truth dataset. We apply this framework to two existing data collections in the domain of Point-of-Interest (POI) suggestion. The generated datasets are analysed with respect to the taxonomy richness (variety of types) and user profile diversity and length. In order to experiment with the generated datasets, we combine this framework with a widely adopted simulated user-facet interaction model to evaluate a number of existing personalized t-facet ranking baselines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
In the scope of this work, the term ‘documents’ is used to refer to the information objects being searched. According to the FSS domain, documents can be places, web pages, products, books or images, etc.
2.
How the document ranking is performed is outside scope of this research.
3.
User picks are the user’s interaction with the system that expresses a preference, like a rating, review, or feedback.
4.
https://www.yelp.com/dataset, accessed June 2021.
5.
https://developer.foursquare.com/docs/resources/categories, version:20180323.
6.
https://www.yelp.com/developers/documentation/v3/all_category_list/categories.json.
7.
https://github.com/csurfer/rake-nltk.

References

Abel, F., Celik, I., Houben, G.J., Siehndel, P.: Leveraging the semantics of tweets for adaptive faceted search on twitter. The Semantic Web (2011)
Google Scholar
Aliannejadi, M., Mele, I., Crestani, F.: A cross-platform collection for contextual suggestion. In: SIGIR. ACM (2017)
Google Scholar
Bayomi, M., Lawless, S.: ADAPT_TCD: an ontology-based context aware approach for contextual suggestion. In: TREC (2016)
Google Scholar
Chantamunee, S., Wong, K.W., Fung, C.C.: Collaborative filtering for personalised facet selection. In: IAIT. ACM (2018)
Google Scholar
Ali, E., Annalina Caputo, S.L., Conlan, O.: Personalizing type-based facet ranking using BERT embeddings. In: SEMANTiCS (2021)
Google Scholar
Ali, E., Caputo, A., Lawless, S., Conlan, O.: A probabilistic approach to personalize type-based facet ranking for POI suggestion. In: Brambilla, M., Chbeir, R., Frasincar, F., Manolescu, I. (eds.) ICWE 2021. LNCS, vol. 12706, pp. 175–182. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-74296-6_14
Chapter Google Scholar
Hashemi, S.H., Clarke, C.L., Kamps, J., Kiseleva, J., Voorhees, E.M.: Overview of the TREC 2016 contextual suggestion track. In: TREC (2016)
Google Scholar
Koren, J., Zhang, Y., Liu, X.: Personalized interactive faceted search. In: WWW. ACM (2008)
Google Scholar
Tunkelang, D.: Faceted search. Synth. Lect. Inf. Concepts Retrieval Serv. 1, 1–80 (2009)
Google Scholar
Vandic, D., Aanen, S., Frasincar, F., Kaymak, U.: Dynamic facet ordering for faceted product search engines. IEEE Trans. Knowl. Data Eng. PP(99), 1 (2017). https://doi.org/10.1109/TKDE.2017.2652461
Article Google Scholar
Vandic, D., Frasincar, F., Kaymak, U.: Facet selection algorithms for web product search. In: Proceedings of the 22nd ACM International Conference on Conference on Information & Knowledge Management, pp. 2327–2332. ACM (2013)
Google Scholar
Wang, Q., Ramírez, G., Marx, M., Theobald, M., Kamps, J.: Overview of the INEX 2011 data-centric track. In: Geva, S., Kamps, J., Schenkel, R. (eds.) INEX 2011. LNCS, vol. 7424, pp. 118–137. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35734-3_10
Chapter Google Scholar

Download references

Acknowledgements

This work was supported by the ADAPT Centre, funded by Science Foundation Ireland Research Centres Programme (Grant 13/RC/2106; 13/RC/2106_P2) and co-funded by the European Regional Development Fund.

Author information

Authors and Affiliations

ADAPT Centre, School of Computer Science and Statistics, Trinity College Dublin, Dublin, Ireland
Esraa Ali, Séamus Lawless & Owen Conlan
ADAPT Centre, School of Computing, Dublin City University, Dublin, Ireland
Annalina Caputo

Authors

Esraa Ali
View author publications
You can also search for this author in PubMed Google Scholar
Annalina Caputo
View author publications
You can also search for this author in PubMed Google Scholar
Séamus Lawless
View author publications
You can also search for this author in PubMed Google Scholar
Owen Conlan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Esraa Ali or Annalina Caputo .

Editor information

Editors and Affiliations

Arizona State University, Tempe, AZ, USA
K. Selçuk Candan
Politehnica University of Bucharest, Bucharest, Romania
Bogdan Ionescu
Université Grenoble Alpes, Saint-Martin-d’Hères, France
Lorraine Goeuriot
Aalborg University Copenhagen, Copenhagen, Denmark
Birger Larsen
HES-SO Valais-Wallis, Sierre, Switzerland
Henning Müller
University of Montpellier, Montpellier, France
Alexis Joly
University of Copenhagen, Copenhagen, Denmark
Maria Maistro
TU Wien, Vienna, Austria
Florina Piroi
University of Padua, Padova, Italy
Guglielmo Faggioli
University of Padua, Padova, Italy
Nicola Ferro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ali, E., Caputo, A., Lawless, S., Conlan, O. (2021). Dataset Creation Framework for Personalized Type-Based Facet Ranking Tasks Evaluation. In: Candan, K.S., et al. Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2021. Lecture Notes in Computer Science(), vol 12880. Springer, Cham. https://doi.org/10.1007/978-3-030-85251-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-85251-1_3
Published: 14 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-85250-4
Online ISBN: 978-3-030-85251-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics