Copyright © 2007 Elsevier B.V. All rights reserved.
Updating the partial singular value decomposition in latent semantic indexing
Available online 17 December 2006.
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
Latent semantic indexing (LSI) is a method of information retrieval (IR) that relies heavily on the partial singular value decomposition (PSVD) of the term-document matrix representation of a data set. Calculating the PSVD of large term-document matrices is computationally expensive; hence in the case where terms or documents are merely added to an existing data set, it is extremely beneficial to update the previously calculated PSVD to reflect the changes. It is shown how updating can be used in LSI to significantly reduce the computational cost of finding the PSVD without significantly impacting performance. Moreover, it is shown how the computational cost can be reduced further, again without impacting performance, through a combination of updating and folding-in.
Keywords: Latent semantic indexing; Singular value decomposition; Updating; Folding-in
Article Outline
- 1. Introduction
- 2. Background
- 2.1. SVD
- 2.2. Folding-in
- 3. Updating methods
- 3.1. Updating documents
- 3.2. Updating terms
- 3.3. Updating term weights
- 4. Folding-up
- 5. Experiments
- 5.1. Medline examples
- 5.2. Cranfield examples
- 5.3. HARD examples
- 6. Conclusions
- Acknowledgements
- References







E-mail Article
Add to my Quick Links

Cited By in Scopus (0)






