ABSTRACT
We describe Rosetta, a digital library system for scientific literature. Rosetta makes it easy for people to find the information for which they are looking even when using short, imprecise queries. Rosetta indexes research articles based on the way they have been described when cited in other documents. The concise descriptions that occur in citations are similar to the short queries people typically form when searching; therefore, citations make a better basis for indexing than do the words used within a research article itself. Using this indexing technique we are able to provide a user interface that presents users with an automatically generated directory of the information space surrounding a query. Our objective with this interface is to present people with the information for which they have asked as well as the information for which they may have intended to ask.
- 1.Bradshaw, S. Reference Directed Indexing: Attention to the Description People Use for Information. Masters Thesis. The University of Chicago. 1998.]]Google Scholar
- 2.Brin, S. and Page, L. The Anatomy of a Large-Scale Hypertextual Web Search Engine. Proceedings of WWW '98 (Brisbane Australia, April 1998).]] Google ScholarDigital Library
- 3.Budzik, J., and Hammond, K. J. Watson: Anticipating and Contextualizing Information Needs. Proceedings of the Sixty-second Annual Meeting of the American Society for Information Science. Learned Information, November, 1999.]]Google Scholar
- 4.Chen, H., Schatz, B., Ng, T., Martinez, J., Kirchhoff, A., and Lin, C. A parallel Computing Approach to Creating Engineering Concept Spaces for Semantic Retrieval: The Illinois Digital library Initiative Project. IEEE Transactions on Pattern Analysis and Machine Intelligence, Special Section on Digital Libraries: Representation and Retrieval 18, 8 (August 1996). 77 l-782.]] Google ScholarDigital Library
- 5.Davis, J. R. Creating a Networked Computer Science Technical Report Library. D-lib Magazine, September, 1995.]] Google ScholarDigital Library
- 6.Giles, C. L, Bollacker, K, and Lawrence, S. CiteSeer: An Automatic Citation Indexing System. Proceedings of DigitaZ Libraries '98 (Pittsburgh PA, June 1998). ACM Press. 89-98.]] Google ScholarDigital Library
- 7.Jansen, B.J., Spink, A., Bateman, J., Saracevic, Tefko. Searchers, the Subjects they Search, and Sufftciency: A Study of a Large Sample of Excite Searches. Proceedings of Webnet '98 (Orlando FL, November 1998). 472-477.]]Google Scholar
- 8.Jones, S., Cunningham, S. J., and McNab, R. An Analysis of Usage of a Digital Library. Proceedings of ECDL '98 (Heraklion Crete Greece, September 1998).]] Google ScholarDigital Library
- 9.Lawrence, S., Giles, C. L., Bollacker, K. Digital Libraries and Autonomous Citation Indexing. IEEE Computer 32, 6. 67-71. 1999.]] Google ScholarDigital Library
- 10.Mauldin, M. and Leavitt, J. R. R. Web Agent Related Research at the Center for Machine Translation. Proceedings of the ACM Special interest Group on Networked Information Discovery and Retrieval. 1994.]]Google Scholar
- 11.Marshakova, I. V. System of Document Connections Based on References (in Russian). Nauchno- Tekhnicheskaya Informatsiya, ser. 2, 6. 3-8. 1973.]]Google Scholar
- 12.Salton, G., Wong, A., and Yang, C. S. A Vector Space Model for Automatic Indexing. Communications of the ACM 18, 11. 613-620. 1971.]] Google ScholarDigital Library
- 13.Salton, G, and Buckley, C. Term-Weighting Approaches in Automatic Text Retrieval. Information Processing and Management. 24, 5. 5 13-523. 1988.]] Google ScholarDigital Library
- 14.Schatz, B. R., Johnson, E. H., and Cochrane, P.A. Interactive Term Suggestion for Users of Digital Libraries: Using Subject Thesauri and Co-occurrence Lists for Information Retrieval. Proceedings of Digital Libraries '96 (Bethesda MD, March 1996). ACM Press. 126-133.]] Google ScholarDigital Library
- 15.Smal1, H. Co-citation in the Scientific Literature: A New Measure of the Relationship Between Two Documents. Journal of the American Society for Information Science. 24.265-269. 1973.]]Google Scholar
- 16.Sperms, E. Parasite: Mining Structural Information on the Web. The Sixth International World Wide Web Conference. 1997.]] Google ScholarDigital Library
- 17.Spink, A., Jansen, B. J., Bateman, J. Users' Searching Behavior on the Excite Web Search Engine. Proceedings of Webnet '98 (Orlando FL, November 1998). 828-833.]]Google Scholar
- 18.The U.S. National Library of Medicine. http://www.nlm.nih.gov/nlmhome.html.]]Google Scholar
- 19.Ward, Grady. A set of lexical resources. http:Nwww.dcs.shef.ac.uklresearchlilashlMobyl]]Google Scholar
- 20.Witten, I. H., and McNab, R. The New Zealand Digital Library: Collections and Experience. The Electronic Library 15,6.495-503.]]Google Scholar
- 21.Yaru, D. Structural Modeling of Network Systems in Citation Analysis. Journal of the American Society for Information Science. 48, 10. 946-952. 1997.]] Google ScholarDigital Library
Index Terms
- Guiding people to information: providing an interface to a digital library using reference as a basis for indexing
Recommendations
Visualizing and mapping the intellectual structure of information retrieval
Information retrieval is a long established subfield of library and information science. Since its inception in the early- to mid -1950s, it has grown as a result, in part, of well-regarded retrieval system evaluation exercises/campaigns, the ...
Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2018)
SIGIR '18: The 41st International ACM SIGIR Conference on Research & Development in Information RetrievalThe large scale of scholarly publications poses a challenge for scholars in information seeking and sensemaking. Information retrieval~(IR), bibliometric and natural language processing (NLP) techniques could enhance scholarly search, retrieval and user ...
Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2019)
SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information RetrievalThe deluge of scholarly publication poses a challenge for scholars find relevant research and policy makers to seek in-depth information and understand research impact. Information retrieval (IR), natural language processing (NLP) and bibliometrics ...
Comments