Abstract
Patent claims usually embody the core technological scope and the most essential terms to define the protection of an invention, which makes them the ideal resource for patent topic identification and theme changes analysis. However, conducting content analysis manually on massive technical terms is very time-consuming and laborious. Even with the help of traditional text mining techniques, it is still difficult to model topic changes over time, because single keywords alone are usually too general or ambiguous to represent a concept. Moreover, term frequency that used to rank keywords cannot separate polysemous words that are actually describing a different concept. To address this issue, this research proposes a topic change identification approach based on latent dirichlet allocation, to model and analyze topic changes and topic-based trend with minimal human intervention. After textual data cleaning, underlying semantic topics hidden in large archives of patent claims are revealed automatically. Topics are defined by probability distributions over words instead of terms and their frequency, so that polysemy is allowed. A case study using patents published in the United States Patent and Trademark Office (USPTO) from 2009 to 2013 with Australia as their assignee country is presented, to demonstrate the validity of the proposed topic change identification approach. The experimental result shows that the proposed approach can be used as an automatic tool to provide machine-identified topic changes for more efficient and effective R&D management assistance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Data accessed in March 2014.
- 2.
All plant patents are seen as having one same USPC for calculation convenience.
References
Abbas, A., Zhang, L., & Khan, S. U. (2014). A literature review on the state-of-the-art in patent analysis. World Patent Inf, 37, 3–13.
Batagelj, V., & Mrvar, A. (2004). Pajek—Analysis and visualization of large networks. Berlin: Springer.
Blei, D. M. (2012). Probabilistic topic models. Communications of the ACM, 55, 77–84.
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. The Journal of Machine Learning research, 3, 993–1022.
Campbell, R. S. (1983). Patent trends as a technological forecasting tool. World Patent Information, 5, 137–143.
Camus, C., & Brancaleon, R. (2003). Intellectual assets management: From patents to knowledge. World Patent Information, 25, 155–159.
Chen, H., Zhang, G., Zhu, D., & Lu, J. (2015). A patent time series processing component for technology intelligence by trend identification functionality. Neural Computing and Applications, 26, 345–353.
Daim, T. U., Kocaoglu, D. F., & Anderson, T. R. (2011). Using technological intelligence for strategic decision making in high technology environments. Technological Forecasting and Social Change, 78, 197–198.
David, D., Lewis, Y. Y., Rose, T. G., Li, F. (2004). SMART stopword list [Online]. Cambridge: MIT Press. Available: http://jmlr.csail.mit.edu/papers/volume5/lewis04a/a11-smart-stop-list/english.stop
Ernst, H. (1997). The use of patent data for technological forecasting: The diffusion of CNC-technology in the machine tool industry. Small Business Economics, 9, 361–381.
Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 101, 5228–5235.
Halkidi, M., Batistakis, Y., & Vazirgiannis, M. (2001). On clustering validation techniques. Journal of Intelligent Information Systems, 17, 107–145.
Haywood, S. (2003). Academic vocabulary [Online]. Nottingham: Nottingham University. Available: http://www.nottingham.ac.uk/alzsh3/acvocab/wordlists.htm, 2014
Heinrich, G. (2005). Parameter estimation for text analysis, version 2.9 ed. Darmstadt, Germany: Fraunhofer IGD.
Kim, D., & Oh, A. (2011). Topic chains for understanding a news corpus. In A. Gelbukh (Ed.), Computational linguistics and intelligent text processing. Berlin, Heidelberg: Springer.
Koltcov, S., Koltsova, O., & Nikolenko, S. (2014). Latent dirichlet allocation: stability and applications to studies of user-generated content. In Proceedings of the 2014 ACM conference on Web science. Bloomington, Indiana, USA: ACM.
Lai, K.-K., & Wu, S. J. (2005). Using the patent co-citation approach to establish a new patent classification system. Information Processing and Management, 41, 313–330.
Lukins, S. K., Kraft, N. A., & Etzkorn, L. H. (2010). Bug localization using latent Dirichlet allocation. Information and Software Technology, 52, 972–990.
Nishijima, Y., Anzai, T., & Sengoku, S. (2013). Application of bibliometric analysis to market analysis. In Proceedings of the 2013 Portland International Conference on Management of Engineering & Technology (pp. 2365–2377).
Noel, G. E., & Peterson, G. L. (2014). Applicability of Latent Dirichlet Allocation to multi-disk search. Digital Investigation.
Novelli, E. (2014). An examination of the antecedents and implications of patent scope. Research Policy.
Office U.S.P.A.T. (2015). United States Patent and Trademark Office [Online]. Available: http://www.uspto.gov/
Porter, L. A. (2005). QTIP: Quick technology intelligence processes. Technological Forecasting and Social Change, 72, 1070–1081.
Sheikh, N., Gomez, F. A., Yonghee, C., & Siddappa, J. (2011). Forecasting of advanced electronic packaging technologies using bibliometric analysis and Fisher-Pry diffusion model. In Proceedings of the 2011 Portland International Conference on Management of Engineering & Technology (pp. 1–20).
Sheldon, J. G. (1995). How to write a patent application. Practising Law Institute.
Steinbach, M., Karypis, G., & Kumar, V. (2000). A comparison of document clustering techniques. KDD workshop on text mining (pp. 525–526), Boston.
Steyvers, M., & Griffiths, T. (2007). Probabilistic topic models. In T. Landauer, D. S. McNamara, S. Dennis, & W. Kintsch (Ed.), Latent semantic analysis: A road to meaning. Laurence Erlbaum.
Tong, X., & Frame, J. D. (1994). Measuring national technological performance with patent claims data. Research Policy, 23, 133–141.
Trippe, A. J. (2003). Patinformatics: Tasks to tools. World Patent Information, 25, 211–221.
Tseng, Y.-H., Lin, C.-J., & Lin, Y.-I. (2007). Text mining techniques for patent analysis. Information Processing and Management, 43, 1216–1247.
USPTO. (2012). Manual of patent examining procedure: Claim interpretation [Online]. USPTO. Available: http://www.uspto.gov/web/offices/pac/mpep/s2111.html
Watts, R. J., & Porter, A. L. (1997). Innovation forecasting. Technological Forecasting and Social Change, 56, 25–47.
Wikipedia. (2014). Transitional phrase [Online]. Wikipedia. Available: http://en.wikipedia.org/wiki/Transitional_phrase, 2014.
WIPO. (2002). Patent cooperation treaty (PCT) Article 6 [Online]. Washington: WIPO. Available: http://www.wipo.int/pct/en/texts/articles/a6.htm
WIPO. (2004). WIPO intellectual property handbook: Policy, law and use.
Xie, Z., & Miyazaki, K. (2013). Evaluating the effectiveness of keyword search strategy for patent identification. World Patent Information, 35, 20–30.
Yang, L., Qiu, M., Gottipati, S., Zhu, F., Jiang, J., Sun, H., & Chen, Z. (2013). Cqarank: Jointly model topics and expertise in community question answering. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (pp. 99–108). ACM.
Yang, S., & Soo, V. (2012). Extract conceptual graphs from plain texts in patent claims. Engineering Applications of Artificial Intelligence, 25, 874–887.
Yoon, B. (2008). On the development of a technology intelligence tool for identifying technology opportunity. Expert Systems with Applications, 35, 124–135.
Yoon, B., & Park, Y. (2005). A systematic approach for identifying technology opportunities: Keyword-based morphology analysis. Technological Forecasting and Social Change, 72, 145–160.
Yoon, J., & Kim, K. (2012). TrendPerceptor: A property–function based technology intelligence system for identifying technology trends from patents. Expert Systems with Applications, 39, 2927–2938.
Zhang, Y., Porter, A. L., Hu, Z., Guo, Y., & Newman, N. C. (2014). “Term clumping” for technical intelligence: A case study on dye-sensitized solar cells. Technological Forecasting and Social Change, 85, 26–39.
Zhu, D., & Porter, A. L. (2002). Automated extraction and visualization of information for technological intelligence and forecasting. Technological Forecasting and Social Change, 69, 495–506.
Acknowledgments
The work presented in this paper is partly supported by the Australian Research Council (ARC) under Discovery Project DP140101366 and the National High Technology Research and Development Program of China (Grant No. 2014AA015105).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
See Table 11.3.
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Chen, H., Zhang, Y., Zhu, D. (2016). Identifying Technological Topic Changes in Patent Claims Using Topic Modeling. In: Daim, T., Chiavetta, D., Porter, A., Saritas, O. (eds) Anticipating Future Innovation Pathways Through Large Data Analysis. Innovation, Technology, and Knowledge Management. Springer, Cham. https://doi.org/10.1007/978-3-319-39056-7_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-39056-7_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-39054-3
Online ISBN: 978-3-319-39056-7
eBook Packages: Business and ManagementBusiness and Management (R0)