Abstract
In this paper we propose to combine IR and OLAP (On-Line Analytical Processing) technologies to exploit a warehouse of text-rich XML documents. In the system we plan to develop, a multidimensional implementation of a relevance modeling document model will be used for interactively querying the warehouse by allowing navigation in the structure of documents and in a concept hierarchy of query terms. The facts described in the relevant documents will be ranked and analyzed in a novel OLAP cube model able to represent and manage facts with relevance indexes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP to user-analysts: An IT mandate. Technical Report, E.F. Codd & Associates (1993)
Lavrenko, V., Croft, W.B.: Relevance-based language models. In: Proc. of ACM SIGIR 2001 conference, pp. 267–275 (2001)
McCabe, M.C., et al.: On the Design and Evaluation of a Multi-dimensional Approach to Information Retrieval. In: Proc of ACM SIGIR 2000 conference, pp. 363–365 (2000)
Pedersen, T.B., Jensen, C.S., Dyreson, C.E.: A foundation for capturing and querying complex multidimensional data. Information Systems 26(5), 383–423 (2001)
Pérez, J.M., Berlanga, R., Aramburu, M.J.: A Document Model Based on Relevance Modeling Techniques for Semi-Structured Warehouses. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds.) DEXA 2004. LNCS, vol. 3180, pp. 318–327. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pérez, J.M., Pedersen, T.B., Berlanga, R., Aramburu, M.J. (2005). IR and OLAP in XML Document Warehouses. In: Losada, D.E., Fernández-Luna, J.M. (eds) Advances in Information Retrieval. ECIR 2005. Lecture Notes in Computer Science, vol 3408. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31865-1_43
Download citation
DOI: https://doi.org/10.1007/978-3-540-31865-1_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25295-5
Online ISBN: 978-3-540-31865-1
eBook Packages: Computer ScienceComputer Science (R0)