Abstract
The task of expert finding has been getting increasing attention in information retrieval literature. However, the current state-of-the-art is still lacking in principled approaches for combining different sources of evidence in an optimal way. This paper explores the usage of learning to rank methods as a principled approach for combining multiple estimators of expertise, derived from the textual contents, from the graph-structure with the citation patterns for the community of experts, and from profile information about the experts. Experiments made over a dataset of academic publications, for the area of Computer Science, attest for the adequacy of the proposed approaches.
This work was partially supported by the Fundação para a Ciência e Tecnologia (FCT), through project grant PTDC/EIA-EIA/115346/2009 (SMARTIES), and by the ICP Competitiveness and Innovation Framework Program of the European Commission, through the European Digital Mathematics Library (EuDML) project – http://www.eudml.eu/
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Balog, K., Azzopardi, L., de Rijke, M.: Formal models for expert finding in enterprise corpora. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (2006)
Banks, M.: An extension of the Hirsch index: Indexing scientific topics and compounds. Scientometrics 69(1) (2006)
Batista, P.D., Campiteli, M.G., Kinouchi, O., Martinez, A.S.: Is it possible to compare researchers with different scientific interests? Scientometrics 68(1) (2006)
Cao, Y., Liu, J., Bao, S., Li, H.: Research on expert search at enterprise track of TREC 2005. In: Proceedings of the 14th Text Retrieval Conference (2006)
Chen, P., Xie, H., Maslov, S., Redner, S.: Finding scientific gems with Google’s page rank algorithm. Journal of Informetrics 1(1) (2007)
Craswell, N., de Vries, A.P., Soboroff, I.: Overview of the TREC-2005 enterprise track. In: Proceedings of the 14th Text Retrieval Conference (2006)
de Vries, A.P., Vercoustre, A.-M., Thom, J.A., Craswell, N., Lalmas, M.: Overview of the INEX 2007 Entity Ranking Track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 245–251. Springer, Heidelberg (2008)
Deng, H., King, I., Lyu, M.R.: Formal models for expert finding on DBLP bibliography data. In: Proceedings of the 8th IEEE International Conference on Data Mining (2008)
Egghe, L.: Theory and practise of the g-index. Scientometrics 69(1) (2006)
Fang, H., Zhai, C.: Probabilistic models for expert finding. In: Proceedings of the 29th European Conference on Information Retrieval Research (2007)
Hirsch, J.E.: An index to quantify an individual’s scientific research output. Proceedings of the National Academy of Sciences USA 102(46) (2005)
Joachims, T.: Training linear SVMs in linear time. In: Proceedings of the ACM Conference on Knowledge Discovery and Data Mining, KDD (2006)
Liu, T.: Learning to rank for information retrieval. Foundations and Trends in Information Retrieval 3(3) (2009)
Liu, X., Bollen, J., Nelson, M.L., Van de Sompel, H.: Co-authorship networks in the digital library research community. Information Processing and Management 41(6) (2005)
Macdonald, C., Ounis, I.: Voting techniques for expert search. Knowledge and Information Systems 16(3) (2008)
Petkova, D., Croft, W.B.: Proximity-based document representation for named entity retrieval. In: Proceedings of the 16th ACM Conference on Information and Knowledge Management (2007)
Serdyukov, P.: Search for Expertise: Going Beyond Direct Evidence. PhD thesis, University of Twente (2009)
Sidiropoulos, A., Katsaros, D., Manolopoulos, Y.: Generalized h-index for disclosing latent facts in citation networks. Scientometrics (2006)
Sidiropoulos, A., Manolopoulos, Y.: A citation-based system to assist prize awarding. ACM SIGMOD Record 34(4) (2005)
Sidiropoulos, A., Manolopoulos, Y.: Generalized comparison of graph-based ranking algorithms for publications and authors. Journal for Systems and Software 79(12) (2006)
Soboroff, I., de Vries, A.P., Craswell, N.: Overview of the TREC-2006 enterprise track. In: Proceedings of the 15th Text Retrieval Conference (2007)
Zhu, J., Song, D., Rüger, S., Huang, X.: Modeling document features for expert finding. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management (2008)
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. Journal of Machine Learning Research 6 (2005)
Yang, Y., Tang, J., Wang, B., Guo, J., Li, J., Chen, S.: Expert2Bole: From expert finding to bole search. In: Theeramunkong, T., Kijsirikul, B., Cercone, N., Ho, T.-B. (eds.) PAKDD 2009. LNCS, vol. 5476. Springer, Heidelberg (2009)
Yue, Y., Finley, T., Radlinski, F., Joachims, T.: A support vector method for optimizing average precision. In: Proceedings of the 30th ACM SIGIR international Conference on Research and Development in Information Retrieval (2007)
Zhang, C.-T.: The e-index, complementing the h-index for excess citations. Public Library of Science One 4 (2009)
Zhu, J., Song, S., Rüger, S., Eisenstadt, M., Motta, E.: The open university at TREC 2006 enterprise track expert search task. In: Proceedings of the 15th Text Retrieval Conference (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Moreira, C., Calado, P., Martins, B. (2011). Learning to Rank for Expert Search in Digital Libraries of Academic Publications. In: Antunes, L., Pinto, H.S. (eds) Progress in Artificial Intelligence. EPIA 2011. Lecture Notes in Computer Science(), vol 7026. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24769-9_32
Download citation
DOI: https://doi.org/10.1007/978-3-642-24769-9_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24768-2
Online ISBN: 978-3-642-24769-9
eBook Packages: Computer ScienceComputer Science (R0)