Copyright © 2002 Elsevier Science Ltd. All rights reserved.
Strong similarity measures for ordered sets of documents in information retrieval
Received 6 December 2000;
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
A general method is presented to construct ordered similarity measures (OS-measures), i.e., similarity measures for ordered sets of documents (as, e.g., being the result of an IR-process), based on classical, well-known similarity measures for ordinary sets (measures such as Jaccard, Dice, Cosine or overlap measures). To this extent, we first present a review of these measures and their relationships.
The method given here to construct OS-measures extends the one given by Michel in a previous paper so that it becomes applicable on any pair of ordered sets. Concrete expressions of this method, applied to the classical similarity measures, are given.
Some of these measures are then tested in the IR-system Profil-Doc. The engine SPIRIT© extracts ranked document sets in three different contexts, each for 550 requests. The practical usability of the OS-measures is then discussed based on these experiments.
Article Outline
- 1. Introduction
- 2. General properties of similarity measures (on ordinary sets)
- 3. Ordered similarity measures (OS-measures)
- 3.1. Statement of the problem
- 3.2. General theorem on the construction of strong OS-measures
- 3.3. Strong OS-measures derived from strong similarity measures for ordinary sets
- 4. Experimentation
- 5. Conclusion
- Appendix A
- References







E-mail Article
Add to my Quick Links

Cited By in Scopus (6)







