ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Computational Statistics & Data Analysis
Volume 52, Issue 1, 15 September 2007, Pages 174-183
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (202 K)

  E-mail Article   
  Add to my Quick Links   
Bookmark and share in 2collab (opens in new window)
Request permission to reuse this article
  Cited By in Scopus (0)
 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.csda.2006.12.018    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2007 Elsevier B.V. All rights reserved.

Updating the partial singular value decomposition in latent semantic indexing

Jane E. Tougasa, Corresponding Author Contact Information, 1, E-mail The Corresponding Author and Raymond J. Spiterib, 2, E-mail The Corresponding Author

aFaculty of Computer Science, Dalhousie University, Halifax, NS, Canada B3H 1W5 bDepartment of Computer Science, University of Saskatchewan, Saskatoon, SK, Canada S7N 5C9

Available online 17 December 2006.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Latent semantic indexing (LSI) is a method of information retrieval (IR) that relies heavily on the partial singular value decomposition (PSVD) of the term-document matrix representation of a data set. Calculating the PSVD of large term-document matrices is computationally expensive; hence in the case where terms or documents are merely added to an existing data set, it is extremely beneficial to update the previously calculated PSVD to reflect the changes. It is shown how updating can be used in LSI to significantly reduce the computational cost of finding the PSVD without significantly impacting performance. Moreover, it is shown how the computational cost can be reduced further, again without impacting performance, through a combination of updating and folding-in.

Keywords: Latent semantic indexing; Singular value decomposition; Updating; Folding-in

Article Outline

1. Introduction
2. Background
2.1. SVD
2.2. Folding-in
3. Updating methods
3.1. Updating documents
3.2. Updating terms
3.3. Updating term weights
4. Folding-up
5. Experiments
5.1. Medline examples
5.2. Cranfield examples
5.3. HARD examples
6. Conclusions
Acknowledgements
References







 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.