Skip to main content

Identifying Technological Topic Changes in Patent Claims Using Topic Modeling

  • Chapter
  • First Online:
Anticipating Future Innovation Pathways Through Large Data Analysis

Part of the book series: Innovation, Technology, and Knowledge Management ((ITKM))

Abstract

Patent claims usually embody the core technological scope and the most essential terms to define the protection of an invention, which makes them the ideal resource for patent topic identification and theme changes analysis. However, conducting content analysis manually on massive technical terms is very time-consuming and laborious. Even with the help of traditional text mining techniques, it is still difficult to model topic changes over time, because single keywords alone are usually too general or ambiguous to represent a concept. Moreover, term frequency that used to rank keywords cannot separate polysemous words that are actually describing a different concept. To address this issue, this research proposes a topic change identification approach based on latent dirichlet allocation, to model and analyze topic changes and topic-based trend with minimal human intervention. After textual data cleaning, underlying semantic topics hidden in large archives of patent claims are revealed automatically. Topics are defined by probability distributions over words instead of terms and their frequency, so that polysemy is allowed. A case study using patents published in the United States Patent and Trademark Office (USPTO) from 2009 to 2013 with Australia as their assignee country is presented, to demonstrate the validity of the proposed topic change identification approach. The experimental result shows that the proposed approach can be used as an automatic tool to provide machine-identified topic changes for more efficient and effective R&D management assistance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Data accessed in March 2014.

  2. 2.

    All plant patents are seen as having one same USPC for calculation convenience.

References

  • Abbas, A., Zhang, L., & Khan, S. U. (2014). A literature review on the state-of-the-art in patent analysis. World Patent Inf, 37, 3–13.

    Article  Google Scholar 

  • Batagelj, V., & Mrvar, A. (2004). Pajek—Analysis and visualization of large networks. Berlin: Springer.

    Google Scholar 

  • Blei, D. M. (2012). Probabilistic topic models. Communications of the ACM, 55, 77–84.

    Article  Google Scholar 

  • Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. The Journal of Machine Learning research, 3, 993–1022.

    Google Scholar 

  • Campbell, R. S. (1983). Patent trends as a technological forecasting tool. World Patent Information, 5, 137–143.

    Article  Google Scholar 

  • Camus, C., & Brancaleon, R. (2003). Intellectual assets management: From patents to knowledge. World Patent Information, 25, 155–159.

    Article  Google Scholar 

  • Chen, H., Zhang, G., Zhu, D., & Lu, J. (2015). A patent time series processing component for technology intelligence by trend identification functionality. Neural Computing and Applications, 26, 345–353.

    Article  Google Scholar 

  • Daim, T. U., Kocaoglu, D. F., & Anderson, T. R. (2011). Using technological intelligence for strategic decision making in high technology environments. Technological Forecasting and Social Change, 78, 197–198.

    Article  Google Scholar 

  • David, D., Lewis, Y. Y., Rose, T. G., Li, F. (2004). SMART stopword list [Online]. Cambridge: MIT Press. Available: http://jmlr.csail.mit.edu/papers/volume5/lewis04a/a11-smart-stop-list/english.stop

  • Ernst, H. (1997). The use of patent data for technological forecasting: The diffusion of CNC-technology in the machine tool industry. Small Business Economics, 9, 361–381.

    Article  Google Scholar 

  • Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America, 101, 5228–5235.

    Article  Google Scholar 

  • Halkidi, M., Batistakis, Y., & Vazirgiannis, M. (2001). On clustering validation techniques. Journal of Intelligent Information Systems, 17, 107–145.

    Article  Google Scholar 

  • Haywood, S. (2003). Academic vocabulary [Online]. Nottingham: Nottingham University. Available: http://www.nottingham.ac.uk/alzsh3/acvocab/wordlists.htm, 2014

  • Heinrich, G. (2005). Parameter estimation for text analysis, version 2.9 ed. Darmstadt, Germany: Fraunhofer IGD.

    Google Scholar 

  • Kim, D., & Oh, A. (2011). Topic chains for understanding a news corpus. In A. Gelbukh (Ed.), Computational linguistics and intelligent text processing. Berlin, Heidelberg: Springer.

    Google Scholar 

  • Koltcov, S., Koltsova, O., & Nikolenko, S. (2014). Latent dirichlet allocation: stability and applications to studies of user-generated content. In Proceedings of the 2014 ACM conference on Web science. Bloomington, Indiana, USA: ACM.

    Google Scholar 

  • Lai, K.-K., & Wu, S. J. (2005). Using the patent co-citation approach to establish a new patent classification system. Information Processing and Management, 41, 313–330.

    Article  Google Scholar 

  • Lukins, S. K., Kraft, N. A., & Etzkorn, L. H. (2010). Bug localization using latent Dirichlet allocation. Information and Software Technology, 52, 972–990.

    Article  Google Scholar 

  • Nishijima, Y., Anzai, T., & Sengoku, S. (2013). Application of bibliometric analysis to market analysis. In Proceedings of the 2013 Portland International Conference on Management of Engineering & Technology (pp. 2365–2377).

    Google Scholar 

  • Noel, G. E., & Peterson, G. L. (2014). Applicability of Latent Dirichlet Allocation to multi-disk search. Digital Investigation.

    Google Scholar 

  • Novelli, E. (2014). An examination of the antecedents and implications of patent scope. Research Policy.

    Google Scholar 

  • Office U.S.P.A.T. (2015). United States Patent and Trademark Office [Online]. Available: http://www.uspto.gov/

  • Porter, L. A. (2005). QTIP: Quick technology intelligence processes. Technological Forecasting and Social Change, 72, 1070–1081.

    Article  Google Scholar 

  • Sheikh, N., Gomez, F. A., Yonghee, C., & Siddappa, J. (2011). Forecasting of advanced electronic packaging technologies using bibliometric analysis and Fisher-Pry diffusion model. In Proceedings of the 2011 Portland International Conference on Management of Engineering & Technology (pp. 1–20).

    Google Scholar 

  • Sheldon, J. G. (1995). How to write a patent application. Practising Law Institute.

    Google Scholar 

  • Steinbach, M., Karypis, G., & Kumar, V. (2000). A comparison of document clustering techniques. KDD workshop on text mining (pp. 525–526), Boston.

    Google Scholar 

  • Steyvers, M., & Griffiths, T. (2007). Probabilistic topic models. In T. Landauer, D. S. McNamara, S. Dennis, & W. Kintsch (Ed.), Latent semantic analysis: A road to meaning. Laurence Erlbaum.

    Google Scholar 

  • Tong, X., & Frame, J. D. (1994). Measuring national technological performance with patent claims data. Research Policy, 23, 133–141.

    Article  Google Scholar 

  • Trippe, A. J. (2003). Patinformatics: Tasks to tools. World Patent Information, 25, 211–221.

    Article  Google Scholar 

  • Tseng, Y.-H., Lin, C.-J., & Lin, Y.-I. (2007). Text mining techniques for patent analysis. Information Processing and Management, 43, 1216–1247.

    Article  Google Scholar 

  • USPTO. (2012). Manual of patent examining procedure: Claim interpretation [Online]. USPTO. Available: http://www.uspto.gov/web/offices/pac/mpep/s2111.html

  • Watts, R. J., & Porter, A. L. (1997). Innovation forecasting. Technological Forecasting and Social Change, 56, 25–47.

    Article  Google Scholar 

  • Wikipedia. (2014). Transitional phrase [Online]. Wikipedia. Available: http://en.wikipedia.org/wiki/Transitional_phrase, 2014.

  • WIPO. (2002). Patent cooperation treaty (PCT) Article 6 [Online]. Washington: WIPO. Available: http://www.wipo.int/pct/en/texts/articles/a6.htm

  • WIPO. (2004). WIPO intellectual property handbook: Policy, law and use.

    Google Scholar 

  • Xie, Z., & Miyazaki, K. (2013). Evaluating the effectiveness of keyword search strategy for patent identification. World Patent Information, 35, 20–30.

    Article  Google Scholar 

  • Yang, L., Qiu, M., Gottipati, S., Zhu, F., Jiang, J., Sun, H., & Chen, Z. (2013). Cqarank: Jointly model topics and expertise in community question answering. In Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (pp. 99–108). ACM.

    Google Scholar 

  • Yang, S., & Soo, V. (2012). Extract conceptual graphs from plain texts in patent claims. Engineering Applications of Artificial Intelligence, 25, 874–887.

    Article  Google Scholar 

  • Yoon, B. (2008). On the development of a technology intelligence tool for identifying technology opportunity. Expert Systems with Applications, 35, 124–135.

    Article  Google Scholar 

  • Yoon, B., & Park, Y. (2005). A systematic approach for identifying technology opportunities: Keyword-based morphology analysis. Technological Forecasting and Social Change, 72, 145–160.

    Article  Google Scholar 

  • Yoon, J., & Kim, K. (2012). TrendPerceptor: A property–function based technology intelligence system for identifying technology trends from patents. Expert Systems with Applications, 39, 2927–2938.

    Article  Google Scholar 

  • Zhang, Y., Porter, A. L., Hu, Z., Guo, Y., & Newman, N. C. (2014). “Term clumping” for technical intelligence: A case study on dye-sensitized solar cells. Technological Forecasting and Social Change, 85, 26–39.

    Article  Google Scholar 

  • Zhu, D., & Porter, A. L. (2002). Automated extraction and visualization of information for technological intelligence and forecasting. Technological Forecasting and Social Change, 69, 495–506.

    Article  Google Scholar 

Download references

Acknowledgments

The work presented in this paper is partly supported by the Australian Research Council (ARC) under Discovery Project DP140101366 and the National High Technology Research and Development Program of China (Grant No. 2014AA015105).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hongshu Chen .

Editor information

Editors and Affiliations

Appendix

Appendix

See Table 11.3.

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Chen, H., Zhang, Y., Zhu, D. (2016). Identifying Technological Topic Changes in Patent Claims Using Topic Modeling. In: Daim, T., Chiavetta, D., Porter, A., Saritas, O. (eds) Anticipating Future Innovation Pathways Through Large Data Analysis. Innovation, Technology, and Knowledge Management. Springer, Cham. https://doi.org/10.1007/978-3-319-39056-7_11

Download citation

Publish with us

Policies and ethics