Abstract
In text mining framework, data-driven indices are used as importance indices of words and phrases. Although the values of these indices are influenced by usages of terms, many conventional emergent term detection methods did not treat these indices explicitly. In order to detect research keys in academic researches, we propose a method based on temporal patterns of technical terms by using several data-driven indices and their temporal clusters. The method consists of an automatic term extraction method in given documents, three importance indices from text mining studies, and temporal patterns based on results of temporal clustering. Then, we assign abstracted sense of the temporal patterns of the terms based on their linear trends of centroids. Empirical studies show that the three importance indices are applied to the titles of four annual conferences about data mining field as sets of documents. After extracting the temporal patterns of automatically extracted terms, we compared the emergent patterns and one of the keyword of this article between the four conferences.
Chapter PDF
Similar content being viewed by others
Keywords
References
Lent, B., Agrawal, R., Srikant, R.: Discovering trends in text databases. In: KDD 1997: Proceedings of the third ACM SIGKDD international conference on Knowledge discovery in data mining, pp. 227–230. AAAI Press, Menlo Park (1997)
Kontostathis, A., Galitsky, L., Pottenger, W.M., Roy, S., Phelps, D.J.: A survey of emerging trend detection in textual data mining. A Comprehensive Survey of Text Mining (2003)
Keogh, E., Chu, S., Hart, D., Pazzani, M.: Segmenting time series: A survey and novel approach. In: An Edited Volume, Data mining in Time Series Databases, pp. 1–22. World Scientific, Singapore (2003)
Liao, T.W.: Clustering of time series data: a survey. Pattern Recognition 38, 1857–1874 (2005)
The dblp computer science bibliography, http://www.informatik.uni-trier.de/~ley/db/
Nakagawa, H.: Automatic term recognition based on statistics of compound nouns. Terminology 6(2), 195–210 (2000)
Ohsaki, M., Abe, H., Yamaguchi, T.: Numerical time-series pattern extraction based on irregular piecewise aggregate approximation and gradient specification. New Generation Comput. 25(3), 213–222 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 IFIP International Federation for Information Processing
About this paper
Cite this paper
Abe, H., Tsumoto, S. (2010). Mining Temporal Patterns of Technical Term Usages in Bibliographical Data. In: Shi, Z., Vadera, S., Aamodt, A., Leake, D. (eds) Intelligent Information Processing V. IIP 2010. IFIP Advances in Information and Communication Technology, vol 340. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16327-2_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-16327-2_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16326-5
Online ISBN: 978-3-642-16327-2
eBook Packages: Computer ScienceComputer Science (R0)