ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Information Processing & Management
Volume 40, Issue 2, March 2004, Pages 239-255
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Purchase PDF (418 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
Special issue
View Record in Scopus
 
doi:10.1016/S0306-4573(03)00039-6    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2003 Elsevier Ltd. All rights reserved.

Automatic topics discovery from hyperlinked documents

Kuo-Jui WuCorresponding Author Contact Information, E-mail The Corresponding Author, a, Meng-Chang ChenE-mail The Corresponding Author, a, 1 and Yeali SunE-mail The Corresponding Author, b, 2

a Institute of Information Science, Academia Sinica, 128, Section 2, Academic Road, Nankang 115, Taipei, Taiwan b Department of Information Management, National Taiwan University, No. 50, Lane 144, Kee-Lung Road, Section 4, Taipei, Taiwan

Received 20 August 2002; 
accepted 29 April 2003. ;
Available online 21 November 2003.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Topic discovery is an important means for marketing, e-Business and social science studies. As well, it can be applied to various purposes, such as identifying a group with certain properties and observing the emergence and diminishment of a certain cyber community. Previous topic discovery work (J.M. Kleinberg, Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms, San Francisco, California, p. 668) requires manual judgment of usefulness of outcomes and is thus incapable of handling the explosive growth of the Internet. In this paper, we propose the Automatic Topic Discovery (ATD) method, which combines a method of base set construction, a clustering algorithm and an iterative principal eigenvector computation method to discover the topics relevant to a given query without using manual examination. Given a query, ATD returns with topics associated with the query and top representative pages for each topic. Our experiments show that the ATD method performs better than the traditional eigenvector method in terms of computation time and topic discovery quality.

Author Keywords: Topic discovery; Hyperlink analysis; Authority; Hub

Corresponding Author Contact InformationCorresponding author. Tel.: +886-2-27883799x1614

1 Tel.: +886-2-27883799x1802.

2 Tel.: +886-2-23630231x2870.


 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.