Elsevier

Journal of Informetrics

Volume 4, Issue 1, January 2010, Pages 74-82
Journal of Informetrics

Can epidemic models describe the diffusion of topics across disciplines?

https://doi.org/10.1016/j.joi.2009.08.002Get rights and content

Abstract

This paper introduces a new approach to describe the spread of research topics across disciplines using epidemic models. The approach is based on applying individual-based models from mathematical epidemiology to the diffusion of a research topic over a contact network that represents knowledge flows over the map of science—as obtained from citations between ISI Subject Categories. Using research publications on the protein class kinesin as a case study, we report a better fit between model and empirical data when using the citation-based contact network. Incubation periods on the order of 4–15.5 years support the view that, whilst research topics may grow very quickly, they face difficulties to overcome disciplinary boundaries.

Introduction

How concepts, ideas, technologies and/or innovations spread across heterogeneous communities has long been one of the central questions of the sociology of science and technology (Mulkay, 1974, Rogers, 1962). In recent years, studying the diffusion of scientific topics has become much more feasible due to the wider availability of a variety of databases, fast and cheap computing power and efficient search and model-fitting algorithms. There are a number of ways in which the diffusion of topics can be tracked (e.g., Chen & Hicks, 2004). In terms of transmission dynamics, the similarities between the spread of research topics and the spread of infectious diseases have not gone unnoticed (Bettencourt, Cinrón-Arias, Kaiser, & Castillo-Chávez, 2006). In the spread of a disease through a population, contact between an infectious and a susceptible individual can lead to the transmission of infection. In a similar way, individuals or groups working on a particular research topic or topics can motivate other individuals or groups to start work based on the same or similar research topics with citation being evidence of motivation.

Though models of social contagion date back to the mid 20th century1, the use of epidemiological models to capture the diffusion of research topics through scientific publications was recently discussed by Bettencourt et al. (2006), who found a good fit between suitably adapted epidemic models and data for the spread of a specific research topic (Feynman diagrams in theoretical physics). They further showed that this good fit is not dependent on the particular topic chosen and that epidemic models provide good descriptions of the spread of other topics in both theoretical and experimental physics (Bettencourt, Kaiser, Kaur, Castillo-Chávez, & Wojick, 2008).

However, the epidemiological models investigated so far have been of the simple differential-equation-based compartmental type. While compartmental models are transparent and allow the derivation of some analytical results, they are limited in their capability to capture heterogeneities at the individual level and in the interaction between individual epidemiological units, both of which we expect to see in citation networks (see model description below). As a potentially useful alternative method, we have developed an individual-based directed and weighted network model.

The second novelty of the approach we present here is that, whereas previous studies have investigated the growth of a topic in terms of number of published papers or publishing authors, we inquire here into how a research topic spreads over an existing network of disciplines. In other words, whereas previous studies had focused on growth dynamics, this novel perspective captures the diffusion of topics over the network of connections between scientific disciplines, as assigned by the ISI Web of Science's classification in terms of Subject Categories (SCs), following Leydesdorff and Rafols’ approach (2009). This underlying network of citations among SCs represents the knowledge flows over the “backbone” of the map of science (Boyack, Klavans, & Börner, 2005). The weight of a link (i.e., the normalised number of citations between SCs) in this network is taken to be a good indicator of the likelihood of a SC becoming research-active in a certain area given that some other related SCs are already research-active in this specific area. We can then ask whether a novel topic (a newly discovered phenomenon, material, method or piece of instrumentation) seeded at one particular node or vertex in the network will diffuse through it following to some extent the weighted connections between SCs.

In this exploratory study we examined the spread of research on kinesin. Kinesin represents a class of eukaryotic motor proteins (often referred to as a molecular motor or “nano-engine”) that functions by moving actively along microtubules (Block, 1998). Kinesin research first emerged in 1985, with the report of its discovery published in the areas of Biochemistry and Cell Biology. In the 1990s, research on kinesin spread broadly to other fields in the biological sciences and in the 2000s it reached other biomedical research, on the one hand, and chemistry, physics and materials sciences, on the other, as illustrated in Fig. 1. This later development is associated with potential bionanotechnology applications, which made kinesin an interesting case for the study of interdisciplinary research (Rafols, 2007, Rafols and Meyer, 2009).

Here, we show that the spread of kinesin-related research over a network of disciplines can be well approximated by models used in the context of the transmission of infectious diseases (Anderson and May, 1991, Diekmann and Heesterbeek, 2000, Keeling and Rohani, 2008). Similar network models have been successfully used to explain and predict the pattern of infectious disease transmission (Green et al., 2008, Keeling et al., 2001, Kiss et al., 2005, Kiss et al., 2006a, Kiss et al., 2006b, Eames and Keeling, 2002), and such models are well researched in the context of mathematical epidemiology (Keeling & Eames, 2005).

The paper is organised as follows: we first introduce the data and methods; second, we describe the model; then we present the results of the quality of fit for two different disease transmission models (i.e., Susceptible-Infected or SI, and Susceptible-Exposed-Infected or SEI). Results based on the directed and weighted empirical network are compared to the case of homogeneous disciplinary spread (i.e., equal weights) on the same network. The discussion and conclusions briefly explore possible future improvements of the model and its applications in science policy.

Section snippets

Methods and data

A set of publications (articles, reviews and letters) related to the molecular motor kinesin was constructed by searching the term “kinesin*” in the bibliographic field “Topic” of the ISI Web of Science database. This search yielded 4021 publications starting from 1985 (2 publications) to 2007 (394).2 Each publication was assigned to one or more disciplines

The model

In classic disease transmission models that assume homogeneous random mixing, the population is divided into different compartments based on the disease status of the individuals and other characteristics such as age, gender or risk. Thereafter, the rates of all possible transitions between the compartments are determined. Based on this, a system of differential equations can be derived. In our model, we use a different approach and consider each SC as a node in a network along with all its

Results

This section presents the results of the simulations. First we examined the SI model with the empirical weighted network. Second, we explored how results are affected by stronger simplifications of the base network. Finally, we investigated the SEI model.

Discussion

The results of the simulations showed that relatively simple models with the directed and weighted network of SCs can produce good fits and deliver estimates of transmission rates and incubation times. As the present study is essentially a proof of concept, these results should be treated with a degree of caution but they do suggest directions in which this methodology could be further developed. We envisage two aspects in which the models might be incrementally improved.

First, we have

Conclusions

This paper has demonstrated the feasibility of applying individual-based epidemic models to the spread of a research topic over the map of science. It has made two contributions beyond previous epidemic models (Bettencourt et al., 2006, 2008): the use of a directed and weighted network of disciplines to describe the spread of topics, and the introduction of individual-based models. Using research on kinesin as a case study, we have confirmed that the agreement between model output and empirical

Acknowledgements

IR thanks Loet Leydesdorff for fruitful discussions. IR acknowledges support from the US National Science Foundation (Award #0830207, “Measuring and Tracking Research Knowledge Integration”). The findings and observations contained in this paper are those of the authors and do not necessarily reflect the views of the National Science Foundation.

References (29)

  • O. Diekmann et al.

    The epidemic in a closed population

    Mathematical epidemiology of infectious diseases: Model building, analysis and interpretation

    (2000)
  • K.T.D. Eames et al.

    Modeling dynamic and network heterogeneities in the spread of sexually transmitted diseases

    PNAS

    (2002)
  • D.M. Green et al.

    Estimates for local and movement-based transmission of bovine tuberculosis in British cattle

    Proceedings of the Royal Society B

    (2008)
  • M.J. Keeling et al.

    Dynamics of the 2001 UK foot and mouth epidemic: Stochastic dispersal in a heterogeneous landscape

    Science

    (2001)
  • Cited by (75)

    • A dynamics model of two kinds of knowledge transmission on duplex networks

      2022, Physica A: Statistical Mechanics and its Applications
    • Evaluating tacit knowledge diffusion with algebra matrix algorithm based social networks

      2022, Applied Mathematics and Computation
      Citation Excerpt :

      Through further comparison with algebra matrix evaluation method, the role of states of tacit knowledge owner and explicit knowledge carrier on tacit knowledge evaluation is estimated Because knowledge diffusion often depends on different diffusion mechanisms [18,47], we compare the states’ influence of tacit knowledge owners and explicit knowledge carriers on the diffusion thresholds with some popular methods, such as the epidemic dynamic diffusion models of susceptibility-infection-susceptibility model(SIS) [19], susceptibility-infection-remove model(SIR) [20] and susceptibility-exposed-infection-recovery model(SEIR) [21], which are proven to be effective to estimate the diffusion thresholds of tacit knowledge with different mechanism. Comparing the algebra matrix evaluation with those popular dynamic methods is an effective way to verify the necessary factors of tacit knowledge owners’ and explicit knowledge carriers’ states for the tacit knowledge diffusion.

    View all citing articles on Scopus
    View full text