Skip to main content
Log in

Delineating the scientific footprint in technology: Identifying scientific publications within non-patent references

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

Indicators based on non-patent references (NPRs) are increasingly being used for measuring and assessing science–technology interactions. But NPRs in patent documents contain noise, as not all of them can be considered ‘scientific’. In this article, we introduce the results of a machine-learning algorithm that allows identifying scientific references in an automated manner. Using the obtained results, we analyze indicators based on NPRs, with a focus on the difference between NPR- and scientific non-patent references-based indicators. Differences between both indicators are significant and dependent on the considered patent system, the applicant country and the technological domain. These results signal the relevancy of delineating scientific references when using NPRs to assess the occurrence and impact of science–technology interactions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. See Salton et al. (1975) on vector space models as well as Magerman et al. (2010) for a more elaborated account on vector space models for patent and publication documents.

  2. This is because the denominators are adapted to the considered subset. Whereas the volume of patents with NPRs is used as the denominator for the NPR intensity, the volume of patents with scientific NPRs is used as a denominator for the SNPR intensity indicator. If, alternatively, the same denominator is used for both (NPR and SNPR) intensities—namely the total number of patents—then both intensities do differ significantly.

  3. The intensity indicator for EPO becomes higher because of differential drops in the denominator (number of patents containing SNPRs) and the nominator (number of SNPRs).

References

  • Bishop, C. M. (2006). Pattern recognition and machine learning. New York: Springer.

    MATH  Google Scholar 

  • Callaert, J., Grouwels, J., & Van Looy, B. (2011), Delineating the scientific footprint in technology: Identifying scientific publications within non-patent references. In: E. Noyons, P. Ngulube & J. Leta (Eds.), Proceedings of ISSI 2011—The 13th International Conference on Scientometrics and Informetrics, Durban (pp. 13–18). San Juan: ISSI.

  • Callaert, J., Van Looy, B., Verbeek, A., Debackere, K., & Thijs, B. (2006). Traces of prior art: An analysis of non-patent references found in patent documents. Scientometrics, 69(1), 3–20.

    Article  Google Scholar 

  • Guan, J. C., & He, Y. (2007). Patent-bibliometric analysis on the Chinese science—Technology linkages. Scientometrics, 72(3), 403–425.

    Article  MathSciNet  Google Scholar 

  • Harhoff, D., Scherer, F. M., & Vopel, K. (2003). Citations, family size, opposition and the value of patent rights. Research Policy, 32(8), 1343–1363.

    Article  Google Scholar 

  • Hastie, T., & Friedman, J. (2009). The elements of statistical learning—Data mining, inference, and prediction (2nd ed.). New York: Springer-Verlag.

    MATH  Google Scholar 

  • He, Z. L., & Deng, M. (2007). The evidence of systematic noise in non-patent references: A study of New Zealand companies’ patents. Scientometrics, 72(1), 149–166.

    Article  Google Scholar 

  • Kotsiantis, S. B. (2007). Supervised machine learning: A review of classification techniques. In I. E. Magogliannis, et al. (Eds.), Emerging Artificial Intelligence Applications in Computer Engineering. Amsterdam: IOS Press.

    Google Scholar 

  • Lo, S. C. S. (2010). Scientific linkage of science research and technology development: A case of genetic engineering research. Scientometrics, 82(1), 109–120.

    Article  Google Scholar 

  • Magerman, T., Van Looy, B., & Song, X. (2010). Exploring the feasibility and accuracy of latent semantic analysis based text mining techniques to detect similarity between patent documents and scientific publications. Scientometrics, 82(2), 289–306.

    Article  Google Scholar 

  • Meyer, M. (2000a). Does science push technology? Patents citing scientific literature. Research Policy, 29, 409–434.

    Article  Google Scholar 

  • Meyer, M. (2000b). What is special about patent citations? Differences between scientific and patent citations. Scientometrics, 49(1), 93–123.

    Article  Google Scholar 

  • Michel, J., & Bettels, B. (2001). Patent citation analysis: A closer look at the basic input data from patent search reports. Scientometrics, 51(1), 185–201.

    Article  Google Scholar 

  • Narin, F., & Noma, E. (1985). Is technology becoming science? Scientometrics, 7, 369–381.

    Article  Google Scholar 

  • Nelson, A. J. (2009). Measuring knowledge spillovers: What patents, licenses and publications reveal about innovation diffusion. Research Policy, 38, 994–1005.

    Article  Google Scholar 

  • Salton, G., Wong, A., & Yang, C. S. (1975). A vector space model for automatic indexing. Communications of the ACM, 18(11), 613–620.

    Article  MATH  Google Scholar 

  • Schmoch, U. (2008). Concept of a technology classification for country comparisons. Final report to the world intellectual property organization.

  • Tijssen, R. J. W., Buter, R. K., & Van Leeuwen, T. N. (2000). Technological relevance of science: Validation and analysis of citation linkages between patents and research papers. Scientometrics, 47, 389–412.

    Article  Google Scholar 

  • Van Looy, B., Callaert, J., Debackere, K., & Verbeek, A. (2002). Patent-related indicators for assessing knowledge-generating institutions: Towards a contextualised approach. The Journal of Technology Transfer, 28(1), 53–61.

    Article  Google Scholar 

  • Van Looy, B., Magerman, T., & Debackere, K. (2007). Developing technology in the vicinity of science: An examination of the relationship between science intensity (of patents) and technological productivity within the field of biotechnology. Scientometrics, 70(2), 441–458.

    Article  Google Scholar 

  • Van Looy, B., Zimmermann, E., Veugelers, R., Verbeek, A., Mello, J., & Debackere, K. (2003). Do science–technology interactions pay off when developing technology? Scientometrics, 57(3), 355–367.

    Article  Google Scholar 

  • Van Vianen, B., Moed, H., & Van Raan, A. (1990). An exploration of the science base of recent technology. Research Policy, 19, 61–81.

    Article  Google Scholar 

  • Verbeek, A., Debackere, K., Luwel, M., & Zimmermann, E. (2002). Measuring progress and evolution in science and technology-I: The multiple uses of bibliometric indicators. International Journal of Management Reviews, 4(2), 179–211.

    Article  Google Scholar 

  • Verspagen, B. (2008). Knowledge flows, patent citations and the impact of science on technology. Economic Systems Research, 20(4), 266–339.

    Google Scholar 

Download references

Acknowledgments

This article is an extended version of a paper presented at the 13th International Conference on Scientometrics and Informetrics, Durban (South Africa), 4–7 July 2011 Callaert et al. 2011). The authors want to acknowledge conference participants who contributed with comments and remarks.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Julie Callaert.

Additional information

Authors appear in alphabetical order.

Appendix

Appendix

See Table 7.

Table 7 Parameters linear discriminant function (excerpt: top 20 terms in absolute coefficient value)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Callaert, J., Grouwels, J. & Van Looy, B. Delineating the scientific footprint in technology: Identifying scientific publications within non-patent references. Scientometrics 91, 383–398 (2012). https://doi.org/10.1007/s11192-011-0573-9

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11192-011-0573-9

Keywords

Mathematics Subject Classification (2000)

JEL Classification

Navigation