ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Information and Software Technology
Volume 45, Issue 10, 15 July 2003, Pages 633-649
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (269 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/S0950-5849(03)00002-8    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2003 Elsevier B.V. All rights reserved.

An experiment in software component retrieval

Hafedh MiliCorresponding Author Contact Information, E-mail The Corresponding Author, Estelle Ah-Ki, Robert Godin and Hamid Mcheick

Département d'Informatique, Université du Québec à Montréal, Case Postale 8888 (A), Montréal, Que., PQ, Canada H3C 3P8

Received 29 September 2002; 
revised 11 November 2002; 
accepted 30 December 2002. ;
Available online 1 July 2003.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Our research centers around exploring methodologies for developing reusable software, and developing methods and tools for building inter-enterprise information systems with reusable components. In this paper, we focus on an experiment in which different component indexing and retrieval methods were tested. The results are surprising. Earlier work had often shown that controlled vocabulary indexing and retrieval performed better than full-text indexing and retrieval [IEEE Trans. Software Engng (1994) 1, IEEE Trans. Software Engng 17 (1991) 800], but the differences in performance were often so small that some questioned whether those differences were worth the much greater cost of controlled vocabulary indexing and retrieval [Commun. Assoc. Comput. Mach. 28 (1985) 289, Commun. Assoc. Comput. Mach. 29 (1986) 648]. In our experiment, we found that full-text indexing and retrieval of software components provided comparable precision but much better recall than controlled vocabulary indexing and retrieval of components. There are a number of explanations for this somewhat counter-intuitive result, including the nature of software artifacts, and the notion of relevance that was used in our experiment. We bring to the fore some fundamental questions related to reuse repositories.

Author Keywords: Software reuse; Multi-faceted classification; Boolean retrieval; Plain-text retrieval; Retrieval evaluation; Approximate retrieval

Article Outline

1. Introduction
1.1. Component retrieval: do we still care?
1.2. The component retrieval problem
2. ClassServer: an experimental component repository
2.1. Overview
2.2. A multi-faceted classification of components
2.3. Software component retrieval in ClassServer
2.3.1. Free text indexing and search
2.3.2. Multi-faceted controlled-vocabulary retrieval
2.4. The component library
3. Constructing domain vocabulary
3.1. Extracting a set of concepts
3.2. Constructing a hierarchy of important domain concepts
3.2.1. Principles
3.2.2. A case-study: the Genbank experiment
3.2.3. Constructing the graph based on OSEs on-line documentation
4. Automatic indexing from controlled vocabulary
4.1. The algorithm
4.2. Results
5. Retrieval experiments
5.1. Experimental design
5.2. Queries
5.3. Component relevance: a performance-based evaluation
5.4. Performance results
6. Conclusion and directions
Acknowledgements
References




 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.