Measuring national capability over big science’s multidisciplinarity: A case study of nuclear fusion research

Hyunuk Kim; Inho Hong; Woo-Sung Jung

doi:10.1371/journal.pone.0211963

Abstract

In the era of big science, countries allocate big research and development budgets to large scientific facilities that boost collaboration and research capability. A nuclear fusion device called the “tokamak” is a source of great interest for many countries because it ideally generates sustainable energy expected to solve the energy crisis in the future. Here, to explore the scientific effects of tokamaks, we map a country’s research capability in nuclear fusion research with normalized revealed comparative advantage on five topical clusters—material, plasma, device, diagnostics, and simulation—detected through a dynamic topic model. Our approach captures not only the growth of China, India, and the Republic of Korea but also the decline of Canada, Japan, Sweden, and the Netherlands. Time points of their rise and fall are related to tokamak operation, highlighting the importance of large facilities in big science. The gravity model points out that two countries collaborate less in device, diagnostics, and plasma research if they have comparative advantages in different topics. This relation is a unique feature of nuclear fusion compared to other science fields. Our results can be used and extended when building national policies for big science.

Citation: Kim H, Hong I, Jung W-S (2019) Measuring national capability over big science’s multidisciplinarity: A case study of nuclear fusion research. PLoS ONE 14(2): e0211963. https://doi.org/10.1371/journal.pone.0211963

Editor: Jefferson Stafusa Elias Portela, Julius-Maximilians-Universitat Wurzburg, GERMANY

Received: September 22, 2018; Accepted: January 24, 2019; Published: February 8, 2019

Copyright: © 2019 Kim et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data underlying the results presented in the study are available from Web of Science (http://webofknowledge.com).

Funding: This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2016R1D1A1B03932590). H.K. acknowledges NRF (National Research Foundation of Korea) Grant funded by the Korean Government (NRF-2017H1A2A1044205-Global Ph.D. Fellowship Program). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Big science is characterized by its big budgets, manpower, and machines. It includes a number of multidisciplinary fields such as nuclear fusion, particle accelerators, and space science [1]. Most of them originated for military reasons in World War II and were mainly led by superpowers. In recent decades, as these fields become more demanding, countries actively collaborate to utilize the resources of others and build shared infrastructure [2–4]. In this sense, compared to little science, big science requires more international collaboration and resource accessibility [5].

A large facility is considered the core resource of big science. From construction to operation, it requires participation of various stakeholders under the leadership of national government, resulting in economic spillovers to society [6–8]. A large facility also stimulates scientific advancements by supporting research activities that are hard to conduct in a laboratory. It attracts researchers of diverse disciplines and enhances scientific collaborations. Despite its scientific importance, little attention has been paid to examining how large facilities raise national research capacities because of difficulties in unraveling the multidisciplinarity of big science [9–11]. Moreover, national research capacity is difficult to quantify as it is built on the complex interactions between private and public domains [12, 13]. Depending on science and technology policies, countries have different goals, such as training experts, publishing papers, or granting patents, that constitute the national research capacity [14, 15].

Among many aspects of the national research capacity, this study focuses on academic publishing to estimate the capacity quantitatively [16–22], which we term “research capability,” by implementing topic modeling and revealed comparative advantage on the bibliographic information of research papers. The dynamic topic model [23, 24] first detects subject fields from paper abstracts and distributes publication counts over the detected fields in real values. Normalized revealed comparative advantage (NRCA) [25] is applied to fractional publication counts for projecting a country’s research capability as well as its changes by facility construction. Based on NRCA, we measure how similar two countries’ research capabilities are and include the distance in a gravity model to show its impact on international collaboration.

For a case study, we investigate nuclear fusion, in which the construction of large facilities and international collaborations are crucial. Nuclear fusion is a field that countries have interest in as it produces clean, affordable, and sustainable energy [26, 27]. The history of nuclear fusion consists of the footprints of major successes in tokamaks [28]. After the nuclear fusion reaction of hydrogen was identified as the source of solar energy in the 1920s [29], scientists began to study controlled thermonuclear fusion for sustainable energy production in the 1950s [30]. The tokamak is a device that magnetically confines high-temperature plasmas essential for steady thermonuclear reactions [31], and now it is the most dominant and actively studied device for nuclear fusion research [32]. Tokamaks are composed of strong magnets for confining plasmas, several wall-components in a vacuum vessel for protection, heating devices, and diagnostic devices, which require knowledge across diverse fields: plasma physics, numerical simulations, diagnostics, material science, and engineering [31]. The performance of tokamaks positively scales with size, thus tokamaks have become greater, better, and more expensive [33–36]. The large budgets for tokamaks have increased international collaborations since the 1990s, as seen in the cases of JET (Joint European Torus) [37] and ITER (International Thermonuclear Experimental Reactor) construction [34].

Our approach successfully captures various aspects of nuclear fusion from a bibliographic database over 40 years, 1976–2016. The dynamic topic model disentangles multidisciplinarity and classifies 41 topics grouped into five topical clusters: material, plasma, device, diagnostics, and simulation. Furthermore, the revealed comparative advantage identifies leading countries that participate in international projects or have their own tokamak. The rise and fall of these countries match well with tokamak operation. With the gravity model of scientific collaboration, we additionally address whether complementarity leads to collaboration in nuclear fusion research. The regression results show that countries collaborate less if they have research capability in different topics. It is a unique characteristic of nuclear fusion compared to other sciences in which complementarity enhances collaborations [38–42]. This paper provides quantitative evidence for establishing strategic policies that initiate and evaluate big science projects.

Data and methods

Bibliographic data

We analyzed 25,085 nuclear fusion research papers published during 1976-2016. They were collected from the Scopus database (document type: article) and contain the term “tokamak” in the title, abstract, or keyword fields. Papers without affiliation information were manually filled by checking their original documents. When an author had multiple affiliations, we considered the first one as her/his nationality. We used the fractional counting method to obtain the number of papers for each country. For example, if a paper was written by three American and two Korean researchers, 0.6 and 0.4 were assigned to both countries’ paper counts.

The fractional counting method gives more weight to leading countries, so that would embrace their inherent academic leadership. Nevertheless, the fractional counting method gives less biased results than the full counting method that assigns an equal weight to all countries in a paper. The full counting method could overrepresent some countries (e.g. the United States) which participate in many international projects. Systemic comparisons of the two methods recommend the fractional counting method in co-authorship analysis [43, 44], especially for scientific fields conducting large-scale international experiments. For this reason, we chose the fractional counting method to estimate research capability as well as the degree of collaborations.

Among 75 countries in our dataset, we focused on the top 14 countries that published more than 250 papers in our time scope. The distribution of paper counts was highly skewed. These 14 countries published more than 90% of the research articles. The top 14 countries were the United States, Japan, China, Germany, the United Kingdom, Russia, France, Italy, the Republic of Korea, Switzerland, India, Sweden, Canada, and the Netherlands. The basic statistics of these countries are listed in Table 1. A paper written by more than two authors in different countries is classified as a collaborative paper.

Download:

Table 1. Summary statistics of 14 leading countries in nuclear fusion research.

All values are real numbers as we count the number of papers by the fractional counting method. Ratio is the proportion of collaborative papers to total papers.

https://doi.org/10.1371/journal.pone.0211963.t001

Topic modeling and clustering

The dynamic topic model (DTM) conceptualizes the knowledge in nuclear fusion research [23, 24]. The DTM specifies topics in a set of documents based on latent Dirichlet allocation (LDA) [45], and it also describes the temporal evolution of detected topics by updating consequent input hyperparameters α_t and β_t by each year. α_t affects the topic distribution of a document, and β_t indicates the word distribution in a topic. The DTM infers both parameters to reproduce the empirical word distribution under the assumption that a document is made by both processes in year t, choosing a topic for a document by α_t and sampling words in that topic by β_t. α_t and β_t are used as references to estimate α_t+1 and β_t+1.

In our DTM implementation, insignificant words were filtered out if their term frequency–inverse document frequency (tf-idf) values were less than 0.01. Then, we used the words that appeared more than 10 times in the whole document. As a result, our dictionary contained 7,851 unique words, and the documents contained 1,619,233 words in total. The number of topics K needed to be determined before running the DTM. Following the recent approach [46], we specified the number of topics K = 41 (see S1 Appendix and S1 Fig). Open source codes were written by the authors of the DTM paper and available at https://github.com/blei-lab/dtm. We manually labelled 41 topics from their word frequencies (see S1 Table).

The DTM provides an article’s topic distribution based on the learned parameters. As we set the number of topics to 41, the topic distribution of an article was given as a vector of length 41. Topic distribution was allocated to countries in proportion to their contributions on each article. For instance, if an article was written by American authors only, the topic distribution of the article was fully given to the United States. For another article written by three American and two Korean researchers, 60% of the topic distribution would be added to the United States. In this way, a country’s research capability over 41 topics was estimated for each year from 1976–2016.

Fractional publication and collaboration counts by topics

The fractional counting method was used for calculating a country’s publication and collaboration counts (Fig 1). For year t when n_t papers are published, we have two matrices, the fractional publication counts by countries (A_t: n_t papers × 75 countries) and the topic distributions of papers (B_t: n_t papers × 41 topics). represents the fractional publication counts of 75 countries by 41 topics at year t. Based on the five topical clusters that we found (Fig 2), the fractional counts were summed into five columns to obtain the discriminant power for further analysis. We will explain these topical clusters in the result section. We hereafter call this summarized matrix as national research capability over 5 topical clusters at year t, R_t (75 countries × 5 topical clusters). Collaborations were also counted in fractions. We multiplied the country profile of a paper and its transpose to obtain the collaboration matrix. The matrix was distributed over five matrices in proportion to topical cluster weights.

Download:

Fig 1. Schematics of the fractional counting method for publication and collaboration counts.

Two matrices, the fractional publication counts by countries A_t and the topic distributions of papers B_t, were extracted from the document set of year t. (1) represents the fractional publication counts by topics at year t. For further analysis, based on the hierarchical tree of clusters in Fig 2, the fractional publications by 41 topics are grouped into five topical clusters: material, plasma, device, diagnostics, and simulation. R_t is the aggregated matrix and is transposed in the figure to match with the hierarchical tree of 41 clusters. (2) The country profile of a paper is transformed into a collaboration matrix W₁, which was distributed over the five topical clusters by weights. For each year, by aggregating the collaboration matrices of all published papers, we had five fractional collaboration matrices.

https://doi.org/10.1371/journal.pone.0211963.g001

Download:

Fig 2. Hierarchical tree of 41 topics detected from the dynamic topic model.

Topics were agglomerated by the ward.D method [50]. The distance between topics was measured by the Jensen-Shannon distance [51], a square root of the Jensen-Shannon divergence. Five topical clusters—material, plasma, device, diagnostics, and simulation—are revealed. The branches are colored by the corresponding topical clusters.

https://doi.org/10.1371/journal.pone.0211963.g002

Normalized revealed comparative advantage (NRCA)

Normalized revealed comparative advantage (NRCA) [25], one of revealed comparative advantage indices, represents how much an entity’s value exceeds expectations. When comparing longitudinal RCA values, NRCA outperforms the Balassa index (BRCA) [47], the most popular RCA index that defines comparative advantage as a ratio of observations to expectations. Let be country i’s research capability on topical cluster j at year t. , the NRCA of country i on topical cluster j at year t, is calculated as (1) where is the sum of country i’s research capability across five topical clusters at year t (), R_j,t is the sum of all countries’ research capabilities on topical cluster j at year t (), and R_t is the sum of all countries’ research capabilities on five topical clusters at year t, denoted by . A positive value means that country i has a comparative advantage on topical cluster j at year t.

Countries have comparative advantages on different topics as it is almost impossible to be competitive in all topics. We measured how similar two countries’ research capabilities are as follows. First, the NRCA of each country was transformed into the binary vector by changing positive NRCA values to 1 and negative values to 0 to identify the topics with significant comparative advantages. Second, the Jaccard distance between two countries’ binary NRCA vectors was calculated for determining their topical dissimilarity (Eq 2). We call this distance between country m and n on topical cluster j at year t the capability distance c_mn,j,t. A high c_mn,j,t represents that two countries are in complementary relation where their differences in research capability generate synergy by collaborations. (2)

Gravity model of scientific collaboration

Scientific collaboration between country m and n in topical cluster j at year t, w_mn,j,t, is related to the number of publications of the two (P_m,j,t and P_n,j,t) and their geographical distance (d_mn). The gravity model explains their relationships in many scientific fields [48, 49]. P_m,j,t and P_n,j,t positively and d_mn negatively affects w_mn,j,t. We added the capability distance to the gravity model for checking whether complementarity increases collaboration. Our basic model is written as (3) where d_mn is the Haversine distance (km) between capitals. For two countries m and n, we counted w_mn,j,t, P_m,j,t, and P_n,j,t in real values, and calculated c_mn,j,t from the binary transformed NRCA vectors. A positive λ indicates that complementarity stimulates collaboration.

Results

Knowledge structure of nuclear fusion research

The DTM detected 41 topics in the dataset. Each topic had its word distribution indicating the extent of word assignments to the topic. We assumed that two topics were close if their word distributions were similar. The topic distance between topic k₁ and k₂ was obtained by the Jensen-Shannon distance [51], a square root of the Jensen-Shannon divergence. For simplicity, we used the word distribution at the last year, and . A knowledge structure of nuclear fusion research was drawn by agglomerating 41 topics with the ward.D method [50]. The hierarchical tree consists of five distinguishable topical clusters: material, plasma, device, diagnostics, and simulation (Fig 2).

Each cluster is clearly characterized by its topics. We observe the details of each branch from the top of the tree. The “material” cluster is described by tokamak edge plasmas and components as plasmas interact with wall materials at the edge. The “plasma” cluster contains general plasma-related topics (i.e., plasma flow, magnetohydrodynamics, and discharge), major instabilities in tokamak configurations (i.e., Alfvén eigenmode, neoclassical tearing mode, and edge-localized mode), and heating methods (i.e., lower hybrid current drive and electron cyclotron resonance heating). The “device” cluster includes mechanical components in tokamaks (i.e., coil, power supply, vessel, magnet, and blanket) and several tokamaks (i.e., Tore Supra, KSTAR, and EAST). The “diagnostics” cluster is composed of plasma diagnostics methods such as soft X-ray, neutron detector, and spectroscopy. Finally, the “simulation” cluster focuses on analytic calculations and computations.

National research capability and its overall trends

Normalized revealed comparative advantage (NRCA) on the fractional publication counts extracted national research capability over 40 years (Fig 3). In all countries, NRCA changes are in good agreement with tokamak construction and operation, representing the scientific effects of large facilities across multiple domains. The United States and Japan have led nuclear fusion research, while Japan’s influence has been decreasing since the 2000s. It may be due to the upgrade of their major tokamak JT-60 which was disassembled in 2009-2012 and is being upgraded to JT-60SA for first plasma in 2020. China rapidly develops research capability overall except in material-related topics. Even though we consider the rise of China in all science and technology fields, their pace in nuclear fusion research is surprisingly fast. China’s tokamaks, HT-7 and HL-2A, raise research capability in device, diagnostics, and simulation. At the point of EAST (Experimental Advanced Superconducting Tokamak) operation in 2006, they also began to equip plasma capability as well. The other countries operating their own tokamaks, Germany, the United Kingdom, Russia, France, Italy, and Switzerland, actively engage in nuclear fusion research. However, the countries without their own tokamak operation, Sweden and the Netherlands, are losing their research capabilities. Canada’s fall seems plausible as they left tokamak projects in the early 2000s [52]. There are two interesting countries, the Republic of Korea and India, that obtain research capability in all fields. Their rises coincide with the ITER project and construction of tokamaks, KSTAR (first plasma in 2008) and SST-1 (first plasma in 2013).

Download:

Fig 3. Ranks of normalized revealed comparative advantages for the top 14 countries.

Rank series of the countries are smoothed with LOESS (locally estimated scatterplot smoothing) and colored by the topical clusters.

https://doi.org/10.1371/journal.pone.0211963.g003

Negative relation between complementarity and collaboration

Complementarity positively affects collaboration in many science fields [38–42]. Researchers and countries find collaborators that exchange knowledge as well as resources they do not have. We assume complementarity boosts collaboration even in big science because countries have limited budgets and manpower. To observe whether our assumption holds, we implemented the gravity model of collaboration with the capability distance, a Jaccard distance of the binary NRCA vectors in five topical clusters (Eq 3). The OLS regression results with fixed time effects are given in Table 2. The coefficients of publication counts of two countries are the same because they are symmetric in the collaboration matrix.

Download:

Table 2. Gravity model OLS regression results.

https://doi.org/10.1371/journal.pone.0211963.t002

In all topical clusters, as expected, the number of publications had a positive coefficient, and the geographical distance had a negative coefficient. This means that collaborations occur frequently when two countries have high research capability and locate closely. In contrast to our assumption, the capability distance negatively affects collaboration, indicating that countries collaborate less if they have research capabilities in different topics. This tendency is found in three clusters, plasma, device, and diagnostics, with respect to fusion reaction in tokamak facilities. Collaborations on material and simulation are not related to the capability distance. The regression results suggest that complementarity would affect collaborations differently by topics in big science. International collaborations in core knowledge fields happen when two countries mutually benefit based on similar research capability.

Discussion and conclusion

Large facilities and international collaboration, two core components of big science, were investigated with bibliographic data, the dynamic topic model, and revealed comparative advantage. In this study, we chose nuclear fusion for a case study. Word similarity between topics unfolded the knowledge structure of nuclear fusion comprising five multidisciplinary topical clusters: material, plasma, device, diagnostics, and simulation. Different countries have different comparative advantages over these clusters. The time points that the comparative advantage trend changes match well with tokamak operation. Catching-up countries that have built their own tokamaks have developed their research capability while countries that do not operate a tokamak miss their productivity.

Revealed comparative advantage can be used as a new indicator of big science project evaluation. Through time series analysis [53], we can examine the connections between facility construction and revealed comparative advantages in different topical clusters. The time series analysis addresses whether knowledge spillover occurs in various scales from facilities to countries [54–56]. In addition, with external information such as the amount of funding, the number of employees, and instrument specifications, we can investigate the impact of facility construction and international collaboration in detail. The publishing policy of large facilities also needs to be considered when interpreting the comparative advantage. Large facilities that restrict the publication of academic papers for the purpose of secrecy [57] have low research capability in our study, relative to others that promote academic publishing. These qualitative factors of facilities require further evaluations to estimate their scientific impacts accurately as the measure for policy making, investment, and education [58].

The international collaboration in nuclear fusion was estimated by the gravity model with the capability distance that represents how similar two countries’ research capabilities are. The regression results show high capability distance distracts the international collaborations in fusion reaction related clusters: plasma, device, and diagnostics. This tendency contrasts with that of other science fields favoring collaborators that have complementary comparative advantages [38–42]. Real collaborations in nuclear fusion governed by this pattern are worth studying. Countries may have distinct motivations to collaborate with other countries and to participate in international projects. Political and societal factors would also be involved in the policy making process. Understanding the history of nuclear fusion research gives us insights into what science policy a country has to take depending on the development stage.

Our approach can be applied to other fields of big science. Particle physics and Antarctic science are the potential targets. They depend on large facilities, particle accelerators, and research stations in Antarctica. In particle physics, we expect that the dynamic topic model differentiates various types of particle accelerators [59]. A country’s strategic decisions for particle accelerators can be traced with comparative advantages on topical clusters. In Antarctic science, research stations may increase research capabilities on geography-dependent topics [60, 61] because its location expands the range of research activities. An increasing comparative advantage on spatial topics will support this idea. Antarctic science, especially, has interesting aspects that affect the gravity model of collaboration. Collaboration in Antarctica would occur frequently between close research stations, not between close capitals, so the geographical distance of the model should be defined in a different way. The Antarctic Treaty System, which enforces the peaceful usage of Antarctica and freedom of scientific investigation [62], can encourage countries to collaborate with others having complementary comparative advantages. It is necessary to determine in particle physics and Antarctic science whether collaboration in big science decreases by complementarity as in the case of nuclear fusion. More studies are needed to understand the nature of big science.

Supporting information

S1 Appendix. Determining the number of topics from static LDA model.

https://doi.org/10.1371/journal.pone.0211963.s001

(PDF)

S1 Fig. Topic usage distribution for static LDA model.

We used the topic usage distribution for static K = 500 model to calculate the cutoff that specifies sufficiently used topics. The minimum of KDE (blue line) derivative determines the cutoff (red dashed line), and the number of topics above this point, K = 41, is used for the DTM.

https://doi.org/10.1371/journal.pone.0211963.s002

(TIF)

S1 Table. The top 10 words for 41 topics in nuclear fusion research.

https://doi.org/10.1371/journal.pone.0211963.s003

(PDF)

Acknowledgments

This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2016R1D1A1B03932590). H.K. acknowledges the NRF Grant funded by the Korean Government (2017H1A2A1044205, Global Ph.D. Fellowship Program).

References

1. Weinberg AM. Impact of large-scale science on the United States. Science. 1961;134(3473):161–164. pmid:17818712
- View Article
- PubMed/NCBI
- Google Scholar
2. Capshew JH, Rader KA. Big science: Price to the present. Osiris. 1992;7:2–25.
- View Article
- Google Scholar
3. Xin H, Yidong G. China bets big on big science. Science. 2006;311(5767):1548–1549. pmid:16543433
- View Article
- PubMed/NCBI
- Google Scholar
4. Fortin JM, Currie DJ. Big science vs. little science: how scientific impact scales with funding. PLoS ONE. 2013;8(6):e65263. pmid:23840323
- View Article
- PubMed/NCBI
- Google Scholar
5. Sonnenwald DH. Scientific collaboration. Annual Review of Information Science and Technology. 2007;41(1):643–681.
- View Article
- Google Scholar
6. Autio E, Hameri AP, Vuola O. A framework of industrial knowledge spillovers in big-science centers. Research Policy. 2004;33(1):107–126.
- View Article
- Google Scholar
7. Choi W, Tho H, Kim Y, Hwang S, Kang D. The economic benefits of big science R&D: With a focus on fusion R&D program in Korea. Fusion Engineering and Design. 2017;124:1263–1268.
- View Article
- Google Scholar
8. Castelnovo P, Florio M, Forte S, Rossi L, Sirtori E. The economic impact of technological procurement for large-scale research infrastructures: Evidence from the Large Hadron Collider at CERN. Research Policy. 2018;47(9):1853–1867.
- View Article
- Google Scholar
9. Heidler R, Hallonsten O. Qualifying the performance evaluation of Big Science beyond productivity, impact and costs. Scientometrics. 2015;104(1):295–312.
- View Article
- Google Scholar
10. Hallonsten O. Use and productivity of contemporary, multidisciplinary Big Science. Research Evaluation. 2016;25(4):486–495.
- View Article
- Google Scholar
11. Qiao L, Mu R, Chen K. Scientific effects of large research infrastructures in China. Technological Forecasting and Social Change. 2016;112:102–112.
- View Article
- Google Scholar
12. Freeman C. The ‘National System of Innovation’ in historical perspective. Cambridge Journal of Economics. 1995;19(1):5–24.
- View Article
- Google Scholar
13. Etzkowitz H, Leydesdorff L. The dynamics of innovation: from National Systems and “Mode 2” to a Triple Helix of university–industry–government relations. Research Policy. 2000;29(2):109–123.
- View Article
- Google Scholar
14. Feller I. Federal and state government roles in science and technology. Economic Development Quarterly. 1997;11(4):283–295.
- View Article
- Google Scholar
15. Larédo P, Mustar P. Research and innovation policies in the new global economy: An international comparative analysis. Edward Elgar Publishing; 2001.
16. Chubin DE. Research evaluation and the generation of big science policy. Knowledge. 1987;9(2):254–277.
- View Article
- Google Scholar
17. Börner K, Klavans R, Patek M, Zoss AM, Biberstine JR, Light RP, et al. Design and update of a classification system: The UCSD map of science. PLoS ONE. 2012;7(7):e39464. pmid:22808037
- View Article
- PubMed/NCBI
- Google Scholar
18. Sinha A, Shen Z, Song Y, Ma H, Eide D, Hsu BJP, et al. An overview of microsoft academic service (MAS) and applications. In: Proceedings of the 24th international conference on world wide web. ACM; 2015. p. 243–246.
19. Wang Q, Waltman L. Large-scale analysis of the accuracy of the journal classification systems of Web of Science and Scopus. Journal of Informetrics. 2016;10(2):347–364.
- View Article
- Google Scholar
20. Chen G, Xiao L, Hu Cp, Zhao Xq. Identifying the research focus of Library and Information Science institutions in China with institution-specific keywords. Scientometrics. 2015;103(2):707–724.
- View Article
- Google Scholar
21. Guevara MR, Hartmann D, Aristarán M, Mendoza M, Hidalgo CA. The research space: using career paths to predict the evolution of the research output of individuals, institutions, and nations. Scientometrics. 2016;109(3):1695–1709.
- View Article
- Google Scholar
22. Li N. Evolutionary patterns of national disciplinary profiles in research: 1996–2015. Scientometrics. 2017;111(1):493–520.
- View Article
- Google Scholar
23. Blei DM, Lafferty JD. Dynamic topic models. In: Proceedings of the 23rd international conference on Machine learning. ACM; 2006. p. 113–120.
24. Gerrish S, Blei DM. A language-based approach to measuring scholarly impact. In: ICML. vol. 10. Citeseer; 2010. p. 375–382.
25. Yu R, Cai J, Leung P. The normalized revealed comparative advantage index. The Annals of Regional Science. 2009;43(1):267–282.
- View Article
- Google Scholar
26. Chen F. An indispensable truth: how fusion power can save the planet. Springer Science & Business Media; 2011.
27. Clery D. A piece of the sun. Gerald Duckworth & Co; 2013.
28. Braams CM, Stott PE. Nuclear fusion: half a century of magnetic confinement fusion research. CRC Press; 2002.
29. Eddington AS. The internal constitution of the stars. University Press Cambridge; 1926.
30. Smirnov V. Tokamak foundation in USSR/Russia 1950–1990. Nuclear Fusion. 2009;50(1):014003.
- View Article
- Google Scholar
31. Wesson J, Campbell DJ. Tokamaks. vol. 149. Oxford University Press; 2011.
32. Kikuchi M. A review of fusion and Tokamak research towards steady-state operation: A JAEA contribution. Energies. 2010;3(11):1741–1789.
- View Article
- Google Scholar
33. Lawson JD. Some criteria for a power producing thermonuclear reactor. Proceedings of the Physical Society Section B. 1957;70(1):6.
- View Article
- Google Scholar
34. Aymar R, Barabaschi P, Shimomura Y. The ITER design. Plasma Physics and Controlled Fusion. 2002;44(5):519.
- View Article
- Google Scholar
35. Ikeda K. ITER on the road to fusion energy. Nuclear Fusion. 2009;50(1):014002.
- View Article
- Google Scholar
36. Grandoni D. Why it’s taking the US so long to make fusion energy work. Huffington Post. 2015;.
37. Rebut P, Bickerton R, Keen BE. The Joint European Torus: installation, first results and prospects. Nuclear Fusion. 1985;25(9):1011.
- View Article
- Google Scholar
38. Oh W, Choi JN, Kim K. Coauthorship dynamics and knowledge capital: The patterns of cross-disciplinary collaboration in information systems research. Journal of Management Information Systems. 2005;22(3):266–292.
- View Article
- Google Scholar
39. Barjak F, Robinson S. International collaboration, mobility and team diversity in the life sciences: impact on research performance. Social Geography. 2008;3(1):23–36.
- View Article
- Google Scholar
40. Heinze T, Kuhlmann S. Across institutional boundaries?: Research collaboration in German public sector nanoscience. Research Policy. 2008;37(5):888–899.
- View Article
- Google Scholar
41. Acosta M, Coronado D, Ferrándiz E, Léon MD. Factors affecting inter-regional academic scientific collaboration within Europe: The role of economic distance. Scientometrics. 2011;87(1):63–74.
- View Article
- Google Scholar
42. Zhang C, Guo J. China’s international research collaboration: evidence from a panel gravity model. Scientometrics. 2017;113(2):1129–1139.
- View Article
- Google Scholar
43. Perianes-Rodriguez A, Waltman L, van Eck NJ. Constructing bibliometric networks: A comparison between full and fractional counting. Journal of Informetrics. 2016;10(4):1178–1195.
- View Article
- Google Scholar
44. Park HW, Yoon J, Leydesdorff L. The normalization of co-authorship networks in the bibliometric evaluation: the government stimulation programs of China and Korea. Scientometrics. 2016;109(2):1017–1036.
- View Article
- Google Scholar
45. Blei DM, Ng AY, Jordan MI. Latent Dirichlet allocation. Journal of Machine Learning Research. 2003;3(Jan):993–1022.
- View Article
- Google Scholar
46. Gerow A, Hu Y, Boyd-Graber J, Blei DM, Evans JA. Measuring discursive influence across scholarship. Proceedings of the National Academy of Sciences. 2018;p. 201719792.
- View Article
- Google Scholar
47. Balassa B. Trade liberalisation and “revealed” comparative advantage. The Manchester School. 1965;33(2):99–123.
- View Article
- Google Scholar
48. Ponds R, Van Oort F, Frenken K. The geographical and institutional proximity of research collaboration. Papers in Regional Science. 2007;86(3):423–443.
- View Article
- Google Scholar
49. Hoekman J, Frenken K, Tijssen RJ. Research collaboration at a distance: Changing spatial patterns of scientific collaboration within Europe. Research Policy. 2010;39(5):662–673.
- View Article
- Google Scholar
50. Murtagh F, Legendre P. Ward’s hierarchical agglomerative clustering method: which algorithms implement Ward’s criterion? Journal of Classification. 2014;31(3):274–295.
- View Article
- Google Scholar
51. Endres DM, Schindelin JE. A new metric for probability distributions. IEEE Transactions on Information Theory. 2003;.
52. Brumfiel G. Canada prepares to pull the plug on fusion project. Nature. 2003;425(887).
- View Article
- Google Scholar
53. Granger CW. Investigating causal relations by econometric models and cross-spectral methods. Econometrica: Journal of the Econometric Society. 1969;p. 424–438.
- View Article
- Google Scholar
54. Hameri AP. Innovating from big science research. The Journal of Technology Transfer. 1997 Sep;22(3):27–35.
- View Article
- Google Scholar
55. Horlings E. The societal footprint of big science: A literature review in support of evidence-based decision making. Rathenau Intituut; 2012.
56. Wylie R, Markowski S, Hall P. Big science, small country and the challenges of defence system development: An Australian case study. Defence and Peace Economics. 2006;17(3):257–272.
- View Article
- Google Scholar
57. Resnik DB. Openness versus secrecy in scientific research. Episteme. 2006;2(3):135–147.
- View Article
- Google Scholar
58. Hallonsten O. Introducing ‘facilitymetrics’: a first review and analysis of commonly used measures of scientific leadership among synchrotron radiation facilities worldwide. Scientometrics. 2013;96(2):497–513.
- View Article
- Google Scholar
59. Wiedemann H. Particle accelerator physics. Springer; 2015.
60. Fogg GE. A history of Antarctic science. Cambridge University Press; 1992.
61. Kim H, Jung WS. Bibliometric analysis of collaboration network and the role of research station in Antarctic science. Industrial Engineering & Management Systems. 2016;15(1):92–98.
- View Article
- Google Scholar
62. Berkman PA, Lang MA, Walton DW, Young OR. Science diplomacy. Antarctica, Science and the Governance of International Spaces. 2011;.

[ref1] 1. Weinberg AM. Impact of large-scale science on the United States. Science. 1961;134(3473):161–164. pmid:17818712
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Capshew JH, Rader KA. Big science: Price to the present. Osiris. 1992;7:2–25.
View Article
Google Scholar

[6] View Article

[7] Google Scholar

[ref3] 3. Xin H, Yidong G. China bets big on big science. Science. 2006;311(5767):1548–1549. pmid:16543433
View Article
PubMed/NCBI
Google Scholar

[9] View Article

[10] PubMed/NCBI

[11] Google Scholar

[ref4] 4. Fortin JM, Currie DJ. Big science vs. little science: how scientific impact scales with funding. PLoS ONE. 2013;8(6):e65263. pmid:23840323
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Sonnenwald DH. Scientific collaboration. Annual Review of Information Science and Technology. 2007;41(1):643–681.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref6] 6. Autio E, Hameri AP, Vuola O. A framework of industrial knowledge spillovers in big-science centers. Research Policy. 2004;33(1):107–126.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref7] 7. Choi W, Tho H, Kim Y, Hwang S, Kang D. The economic benefits of big science R&D: With a focus on fusion R&D program in Korea. Fusion Engineering and Design. 2017;124:1263–1268.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref8] 8. Castelnovo P, Florio M, Forte S, Rossi L, Sirtori E. The economic impact of technological procurement for large-scale research infrastructures: Evidence from the Large Hadron Collider at CERN. Research Policy. 2018;47(9):1853–1867.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref9] 9. Heidler R, Hallonsten O. Qualifying the performance evaluation of Big Science beyond productivity, impact and costs. Scientometrics. 2015;104(1):295–312.
View Article
Google Scholar

[29] View Article

[30] Google Scholar

[ref10] 10. Hallonsten O. Use and productivity of contemporary, multidisciplinary Big Science. Research Evaluation. 2016;25(4):486–495.
View Article
Google Scholar

[32] View Article

[33] Google Scholar

[ref11] 11. Qiao L, Mu R, Chen K. Scientific effects of large research infrastructures in China. Technological Forecasting and Social Change. 2016;112:102–112.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref12] 12. Freeman C. The ‘National System of Innovation’ in historical perspective. Cambridge Journal of Economics. 1995;19(1):5–24.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref13] 13. Etzkowitz H, Leydesdorff L. The dynamics of innovation: from National Systems and “Mode 2” to a Triple Helix of university–industry–government relations. Research Policy. 2000;29(2):109–123.
View Article
Google Scholar

[41] View Article

[42] Google Scholar

[ref14] 14. Feller I. Federal and state government roles in science and technology. Economic Development Quarterly. 1997;11(4):283–295.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref15] 15. Larédo P, Mustar P. Research and innovation policies in the new global economy: An international comparative analysis. Edward Elgar Publishing; 2001.

[ref16] 16. Chubin DE. Research evaluation and the generation of big science policy. Knowledge. 1987;9(2):254–277.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref17] 17. Börner K, Klavans R, Patek M, Zoss AM, Biberstine JR, Light RP, et al. Design and update of a classification system: The UCSD map of science. PLoS ONE. 2012;7(7):e39464. pmid:22808037
View Article
PubMed/NCBI
Google Scholar

[51] View Article

[52] PubMed/NCBI

[53] Google Scholar

[ref18] 18. Sinha A, Shen Z, Song Y, Ma H, Eide D, Hsu BJP, et al. An overview of microsoft academic service (MAS) and applications. In: Proceedings of the 24th international conference on world wide web. ACM; 2015. p. 243–246.

[ref19] 19. Wang Q, Waltman L. Large-scale analysis of the accuracy of the journal classification systems of Web of Science and Scopus. Journal of Informetrics. 2016;10(2):347–364.
View Article
Google Scholar

[56] View Article

[57] Google Scholar

[ref20] 20. Chen G, Xiao L, Hu Cp, Zhao Xq. Identifying the research focus of Library and Information Science institutions in China with institution-specific keywords. Scientometrics. 2015;103(2):707–724.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref21] 21. Guevara MR, Hartmann D, Aristarán M, Mendoza M, Hidalgo CA. The research space: using career paths to predict the evolution of the research output of individuals, institutions, and nations. Scientometrics. 2016;109(3):1695–1709.
View Article
Google Scholar

[62] View Article

[63] Google Scholar

[ref22] 22. Li N. Evolutionary patterns of national disciplinary profiles in research: 1996–2015. Scientometrics. 2017;111(1):493–520.
View Article
Google Scholar

[65] View Article

[66] Google Scholar

[ref23] 23. Blei DM, Lafferty JD. Dynamic topic models. In: Proceedings of the 23rd international conference on Machine learning. ACM; 2006. p. 113–120.

[ref24] 24. Gerrish S, Blei DM. A language-based approach to measuring scholarly impact. In: ICML. vol. 10. Citeseer; 2010. p. 375–382.

[ref25] 25. Yu R, Cai J, Leung P. The normalized revealed comparative advantage index. The Annals of Regional Science. 2009;43(1):267–282.
View Article
Google Scholar

[70] View Article

[71] Google Scholar

[ref26] 26. Chen F. An indispensable truth: how fusion power can save the planet. Springer Science & Business Media; 2011.

[ref27] 27. Clery D. A piece of the sun. Gerald Duckworth & Co; 2013.

[ref28] 28. Braams CM, Stott PE. Nuclear fusion: half a century of magnetic confinement fusion research. CRC Press; 2002.

[ref29] 29. Eddington AS. The internal constitution of the stars. University Press Cambridge; 1926.

[ref30] 30. Smirnov V. Tokamak foundation in USSR/Russia 1950–1990. Nuclear Fusion. 2009;50(1):014003.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref31] 31. Wesson J, Campbell DJ. Tokamaks. vol. 149. Oxford University Press; 2011.

[ref32] 32. Kikuchi M. A review of fusion and Tokamak research towards steady-state operation: A JAEA contribution. Energies. 2010;3(11):1741–1789.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref33] 33. Lawson JD. Some criteria for a power producing thermonuclear reactor. Proceedings of the Physical Society Section B. 1957;70(1):6.
View Article
Google Scholar

[84] View Article

[85] Google Scholar

[ref34] 34. Aymar R, Barabaschi P, Shimomura Y. The ITER design. Plasma Physics and Controlled Fusion. 2002;44(5):519.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref35] 35. Ikeda K. ITER on the road to fusion energy. Nuclear Fusion. 2009;50(1):014002.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref36] 36. Grandoni D. Why it’s taking the US so long to make fusion energy work. Huffington Post. 2015;.

[ref37] 37. Rebut P, Bickerton R, Keen BE. The Joint European Torus: installation, first results and prospects. Nuclear Fusion. 1985;25(9):1011.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref38] 38. Oh W, Choi JN, Kim K. Coauthorship dynamics and knowledge capital: The patterns of cross-disciplinary collaboration in information systems research. Journal of Management Information Systems. 2005;22(3):266–292.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref39] 39. Barjak F, Robinson S. International collaboration, mobility and team diversity in the life sciences: impact on research performance. Social Geography. 2008;3(1):23–36.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref40] 40. Heinze T, Kuhlmann S. Across institutional boundaries?: Research collaboration in German public sector nanoscience. Research Policy. 2008;37(5):888–899.
View Article
Google Scholar

[103] View Article

[104] Google Scholar

[ref41] 41. Acosta M, Coronado D, Ferrándiz E, Léon MD. Factors affecting inter-regional academic scientific collaboration within Europe: The role of economic distance. Scientometrics. 2011;87(1):63–74.
View Article
Google Scholar

[106] View Article

[107] Google Scholar

[ref42] 42. Zhang C, Guo J. China’s international research collaboration: evidence from a panel gravity model. Scientometrics. 2017;113(2):1129–1139.
View Article
Google Scholar

[109] View Article

[110] Google Scholar

[ref43] 43. Perianes-Rodriguez A, Waltman L, van Eck NJ. Constructing bibliometric networks: A comparison between full and fractional counting. Journal of Informetrics. 2016;10(4):1178–1195.
View Article
Google Scholar

[112] View Article

[113] Google Scholar

[ref44] 44. Park HW, Yoon J, Leydesdorff L. The normalization of co-authorship networks in the bibliometric evaluation: the government stimulation programs of China and Korea. Scientometrics. 2016;109(2):1017–1036.
View Article
Google Scholar

[115] View Article

[116] Google Scholar

[ref45] 45. Blei DM, Ng AY, Jordan MI. Latent Dirichlet allocation. Journal of Machine Learning Research. 2003;3(Jan):993–1022.
View Article
Google Scholar

[118] View Article

[119] Google Scholar

[ref46] 46. Gerow A, Hu Y, Boyd-Graber J, Blei DM, Evans JA. Measuring discursive influence across scholarship. Proceedings of the National Academy of Sciences. 2018;p. 201719792.
View Article
Google Scholar

[121] View Article

[122] Google Scholar

[ref47] 47. Balassa B. Trade liberalisation and “revealed” comparative advantage. The Manchester School. 1965;33(2):99–123.
View Article
Google Scholar

[124] View Article

[125] Google Scholar

[ref48] 48. Ponds R, Van Oort F, Frenken K. The geographical and institutional proximity of research collaboration. Papers in Regional Science. 2007;86(3):423–443.
View Article
Google Scholar

[127] View Article

[128] Google Scholar

[ref49] 49. Hoekman J, Frenken K, Tijssen RJ. Research collaboration at a distance: Changing spatial patterns of scientific collaboration within Europe. Research Policy. 2010;39(5):662–673.
View Article
Google Scholar

[130] View Article

[131] Google Scholar

[ref50] 50. Murtagh F, Legendre P. Ward’s hierarchical agglomerative clustering method: which algorithms implement Ward’s criterion? Journal of Classification. 2014;31(3):274–295.
View Article
Google Scholar

[133] View Article

[134] Google Scholar

[ref51] 51. Endres DM, Schindelin JE. A new metric for probability distributions. IEEE Transactions on Information Theory. 2003;.

[ref52] 52. Brumfiel G. Canada prepares to pull the plug on fusion project. Nature. 2003;425(887).
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref53] 53. Granger CW. Investigating causal relations by econometric models and cross-spectral methods. Econometrica: Journal of the Econometric Society. 1969;p. 424–438.
View Article
Google Scholar

[140] View Article

[141] Google Scholar

[ref54] 54. Hameri AP. Innovating from big science research. The Journal of Technology Transfer. 1997 Sep;22(3):27–35.
View Article
Google Scholar

[143] View Article

[144] Google Scholar

[ref55] 55. Horlings E. The societal footprint of big science: A literature review in support of evidence-based decision making. Rathenau Intituut; 2012.

[ref56] 56. Wylie R, Markowski S, Hall P. Big science, small country and the challenges of defence system development: An Australian case study. Defence and Peace Economics. 2006;17(3):257–272.
View Article
Google Scholar

[147] View Article

[148] Google Scholar

[ref57] 57. Resnik DB. Openness versus secrecy in scientific research. Episteme. 2006;2(3):135–147.
View Article
Google Scholar

[150] View Article

[151] Google Scholar

[ref58] 58. Hallonsten O. Introducing ‘facilitymetrics’: a first review and analysis of commonly used measures of scientific leadership among synchrotron radiation facilities worldwide. Scientometrics. 2013;96(2):497–513.
View Article
Google Scholar

[153] View Article

[154] Google Scholar

[ref59] 59. Wiedemann H. Particle accelerator physics. Springer; 2015.

[ref60] 60. Fogg GE. A history of Antarctic science. Cambridge University Press; 1992.

[ref61] 61. Kim H, Jung WS. Bibliometric analysis of collaboration network and the role of research station in Antarctic science. Industrial Engineering & Management Systems. 2016;15(1):92–98.
View Article
Google Scholar

[158] View Article

[159] Google Scholar

[ref62] 62. Berkman PA, Lang MA, Walton DW, Young OR. Science diplomacy. Antarctica, Science and the Governance of International Spaces. 2011;.

Figures

Abstract

Introduction

Data and methods

Bibliographic data

Topic modeling and clustering

Fractional publication and collaboration counts by topics

Normalized revealed comparative advantage (NRCA)

Gravity model of scientific collaboration

Results

Knowledge structure of nuclear fusion research

National research capability and its overall trends

Negative relation between complementarity and collaboration

Discussion and conclusion

Supporting information

S1 Appendix. Determining the number of topics from static LDA model.

S1 Fig. Topic usage distribution for static LDA model.

S1 Table. The top 10 words for 41 topics in nuclear fusion research.

Acknowledgments

References