Copyright © 1976 Published by Elsevier Science Ltd. All rights reserved.
Compression of large inverted files with hyperbolic term distribution
Available online 17 July 2002.
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
The storage requirements for retrieval systems utilizing inverted files are calculated assuming different storage modes. Various methods for compression of these large files are analyzed. Binary vectors compressed by run-length coding as well as lists of document numbers were found to be suitable. The problem of minimal storage requirements for the inverted file is solved for different assumptions about index term distributions. A representation combining run-length coded binary vectors with list of document numbers was found to be the most economical. Parameter values for this minimum storage form are calculated and specified in tables as well as displayed graphically.







E-mail Article
Add to my Quick Links

Cited By in Scopus (4)





