Scientific success from the perspective of the strength of weak ties

Fronczak, Agata; Mrowinski, Maciej J.; Fronczak, Piotr

doi:10.1038/s41598-022-09118-8

Download PDF

Article
Open access
Published: 24 March 2022

Scientific success from the perspective of the strength of weak ties

Agata Fronczak¹,
Maciej J. Mrowinski¹ &
Piotr Fronczak¹

Scientific Reports volume 12, Article number: 5074 (2022) Cite this article

3925 Accesses
11 Citations
4 Altmetric
Metrics details

Subjects

Abstract

We present the first complete verification of Granovetter’s theory of social networks using a massive dataset, i.e. DBLP computer science bibliography database. For this purpose, we study a coauthorship network, which is considered one of the most important examples that contradicts the universality of this theory. We achieve this goal by rejecting the assumption of the symmetry of social ties. Our approach is grounded in well-established heterogeneous (degree-based) mean-field theory commonly used to study dynamical processes on complex networks. Granovetter’s theory is based on two hypotheses that assign different roles to interpersonal, information-carrying connections. The first hypothesis states that strong ties carrying the majority of interaction events are located mainly within densely connected groups of people. The second hypothesis maintains that these groups are connected by sparse weak ties that are of vital importance for the diffusion of information—individuals who have access to weak ties have an advantage over those who do not. Given the scientific collaboration network, with strength of directed ties measured by the asymmetric fraction of joint publications, we show that scientific success is strongly correlated with the structure of a scientist’s collaboration network. First, among two scientists, with analogous achievements, the one with weaker ties tends to have the higher h-index, and second, teams connected by such ties create more cited publications.

Principal component analysis

Article 22 December 2022

Entropy, irreversibility and inference at the foundations of statistical physics

Article 01 May 2024

Assembly theory explains and quantifies selection and evolution

Article Open access 04 October 2023

Social networks (SN), representing patterns of human interactions, have been the subject of both empirical and theoretical research since at least the middle of the last century¹. At the beginning of the twenty-first century, there was a breakthrough in social network analysis (SNA)^2,3. With the era of widespread digitization, which provided access to huge electronic databases, new empirical methods of SNA have emerged and replaced traditional approaches based on questionnaires and interviews. These new methods, rooted in big data mining, finally allowed for the verification of many well-established theoretical SN ideas, in some cases confirming their validity and in others failing to do so⁴. In this regard, the present status of Granovetter’s weak-tie theory^5,6 of SN, one of the oldest and most influential theories in sociology, is still vague. There are convincing studies that show the validity of its selected aspects (e.g.,^7,8,9,10), but there are also many that question it (e.g.,^11,12,13). Our analysis presented in this paper is unique because, using a massive dataset, not only do we confirm Granovetter’s weak tie theory in its full spectrum but also indicate a possible source of problems related to research questioning its validity.

Granovetter’s theory is based on two hypotheses. The first pertains to the structure of social networks and the second to their dynamics (the way in which the afore-mentioned structure influences the flow of information in the network). It is significant that although most empirical studies have focused on the first hypothesis, far less research has been undertaken to verify the second. One possible reason is that the second hypothesis involves notions relative to the nature and importance of information that are hard to quantify and measure. In this study, we clearly confirm both hypotheses—and Granovetter’s theory in its entirety—in the context of a scientific collaboration network.

The scientific collaboration network^{14,15,16,17,18} is particularly well suited to the overarching goal of this paper (i.e., complete confirmation of Granovetter’s theory) because: (i) connections (ties) between network nodes (scientists) are well defined, and their weight¹⁹ (strength of ties) is easy to measure (e.g., through joint publications); (ii) scientific publications themselves are also a specific proxy of information flow in the studied network (diffusion of innovations²⁰); and (iii) the number of citations is an obvious measure of their significance. Easy access to large datasets is also important, making our conclusions statistically reliable.

The network we investigated has all the features of a complex network²¹. In particular, it shows the scale-free node degree distribution $P(k)\propto k^{-\gamma }$ with the characteristic exponent $\gamma \simeq 2.3$. In the theory of complex networks, this value of $\gamma $ is alarming in the sense that it indicates that the network requires special treatment, including methods of results averaging different to the ones used in homogeneous systems. In relation to Granovetter’s theory, this means that in such networks, basic concepts, such as tie strength and neighbourhood overlap, should be defined in a more careful manner than in homogeneous networks. Their incorrect definition may, instead of confirming the theory, result in its contradiction. In all known empirical studies on Granovetter’s theory, interpersonal ties are assumed to be positive and symmetric. However, it is obvious that social relations do not usually follow this assumption (see, for example, the theory of social balance^22,23 or the concept of multirelational organization of SN^24,25). For example, the scientific collaboration between a young scientist and an established one can hardly be called symmetric.

In his original paper⁵, Granovetter treated ties as if they were positive and symmetric, but he also noted that “the comprehensive theory might require discussion of negative and/or asymmetric ties”. We follow this suggestion in this study and reject the assumption about the symmetry of social ties, which is omnipresent in the literature on the subject. The validity of this approach can be explained by intuition trained in the field of complex networks. Granovetter argued that “the degree of overlap of two individuals’ friendship networks varies directly with the strength of their tie to one another”. However, from the theory of complex networks, we know that in social networks with a high degree of heterogeneity (e.g., due to scale-free node degree distribution), the sizes of ego-networks of two connected nodes may differ drastically. Therefore, their common neighbours can be a significant part of the neighbourhood of one node and an insignificant part of the neighbourhood of the other, resulting in a completely different perception of the strength of the link on both ends.

In what follows, we show that the above reasoning, which assumes the asymmetry of tie strength, allows for a quantitative validation of Granovetter’s theory in scientific collaboration networks, that have resisted such verification so far. We use the DBLP Computer Science Bibliography dataset, which includes information on nearly five million computer science papers (i.e., their publication dates, lists of authors and citation records) authored by over four million scientists (see “Data availability” section for more details).

Results

In the standard approach to scientific collaboration networks, the nodes represent authors, and an undirected internode connection occurs when two authors have published at least one paper together. When considered as binary networks—without any additional features assigned to nodes and connections—these networks show numerous structural similarities to other SNs (e.g. high clustering, small-world effect, skewed degree distribution and clear community structure; Fig. 1a,b,c)^14,15,16,17. However, when edges are assigned weights representing, for example, the number of joint publications, then, although macroscopic characteristics of scientific collaboration networks (e.g., distributions of connection weights and node strengths; Fig. 1d) still correspond to those observed in typical SNs^7,26, their microscopic structure related to the location of strong and weak ties is completely different. Dense, local neighbourhoods of nodes consist of weak ties, while strong ties act as bridges between local research groups. The atypical properties of scientific collaboration networks have been confirmed in several independent studies^9,11,27.

Specifically, as shown in Ref.¹¹, these unusual weight-topology correlations can be seen by analysing the relationship between the tie strength, $w_{ij}$, of two scientists i and j, and the overlap, $O_{ij}$, of their ego-networks. As indicated by Onnela et al.⁷, the overlap of two connected individuals is the ratio of the number of their common neighbours, $n_{ij}$, to the number of all their neighbours:

$$\begin{aligned} O_{ij}=\frac{n_{ij}}{(k_i-1)+(k_j-1)-n_{ij}}, \end{aligned}$$

(1)

where $k_i$ and $k_{j}$ represent degrees of the considered individuals. In typical SNs^30,31,32,33, the above-defined overlap is an increasing function of the tie strength, $w_{ij}$, while analyses of scientific collaboration networks show something completely different. As can be seen in Fig. 2a, in the studied network of computer scientists, with $w_{ij}$ standing for the number of joint publications³⁴, for the vast majority of connections ($98\%$), the overlap decreases with connection weight. This relationship indicates that weak ties mainly reside inside dense network neighbourhoods, whereas strong ties act as connectors between them. It has been hypothesized that this counterintuitive observation could be attributed to different driving mechanisms of tie formation and reinforcement in scientific collaboration networks in comparison to other social networks¹¹. In what follows, we argue that the observation is related to the definitions of the tie strength and neighbourhood overlap that are not properly suited to the structure of the studied network.

First, let us deal with the definition of the overlap (1) (referred to as symmetric overlap). In Fig. 3a, this local measure is shown in the case of a link connecting nodes with significantly different degrees. In such cases, for $k_i\ll k_j$, Eq. (1) can be simplified to $O_{ij}\simeq n_{ij}/k_j$, which shows that it is strongly biased towards nodes with high degrees, distorting the image of the common neighbourhood as seen from the perspective of nodes with small degrees. This drawback of symmetric overlap gains importance in networks with highly skewed, fat-tailed node degree distributions P(k). In such networks, as brilliantly exploited by the degree-based mean-field theory of complex networks^35,36,37, node degree distributions for nearest neighbours are even more fat-tailed than the original distributions P(k). As a result, the number of edges in such networks connecting nodes with high and low degrees can be very high, leading to an unintended overrepresentation of strongly connected nodes by Eq. (1).

To overcome problems with symmetric overlap, we introduce the concept of asymmetric overlap:

$$\begin{aligned} Q_{ij}=\frac{n_{ij}}{k_i-1}\ne Q_{ji}. \end{aligned}$$

(2)

This can be used to describe the overlap between the neighbourhoods of two connected nodes from the perspective of each node separately. In the context of complex networks, this new definition is free from the shortcomings of the previous one. In particular, it copes well with connected nodes (collaborating scientists) whose degrees (ego-networks) differ significantly—that is, when their common neighbours (if any) are a significant part of the neighbourhood of one node and an insignificant part of the neighbourhood of the other. In such cases, the values of $Q_{ij}$ and $Q_{ji}$ corresponding to the same tie are different (see Fig. 3b,c). The value of $Q_{ij}$ that is close to 1 means that almost all neighbours of i are also neighbours of j. The value of $Q_{ji}$ close to 0 means that only a small part of the neighbourhood of j belongs to the neighbourhood of i.

The concept of asymmetric overlap naturally leads to the idea of directed networks and justifies the introduction of asymmetric tie strength:

$$\begin{aligned} v_{ij}=\frac{w_{ij}}{p_{i}}\ne v_{ji}, \end{aligned}$$

(3)

where $p_i$ stands for the number of all publications of the i-th scientist³⁸. The intuitive rationale behind Eq. (3) is as follows: For a young scientist, with a small number of publications, each publication makes a significant contribution to his or her publication output, just as each co-author is an important part of his or her research environment (cf. Eqs. (2) and (3)). However, the importance of each publication and collaboration from the perspective of an established scientist with a large number of publications and an extensive network of collaborators is completely different. Depending on the circumstances, a given number of joint publications (e.g., $w_{ij}=1$) may have a completely different meaning.

In Fig. 2b, the dependence of asymmetric overlap on asymmetric tie strength for the considered network of computer scientists is shown. Contrary to what can be seen in Fig. 2a, the relationship $Q_{ij}(v_{ij})$ is increasing in the entire range of variability of its parameters. The result indicates that, from the point of view of a single scientist (ego-network approach), strong ties mainly constitute dense local clusters, whereas weak ties connect these clusters or play the role of intermediary ties¹⁰. The observation clearly confirms the validity of Granovetter’s first hypothesis in scientific collaboration networks.

Now, using the concept of asymmetric tie strength, we will discuss Granovetter’s second hypothesis, which postulates that although weak ties do not carry as much communication as strong ties do, they often act as bridges, providing novel, non-redundant information, which guarantees weakly connected nodes generally understood social success.

In scientific collaboration networks, the validity of Granovetter’s second hypothesis has never been tested. Nevertheless, it is widely believed (see³⁹ and references therein) that information and expertise at the disposal of tightly connected research groups are often redundant, resulting in less creative collaborations and less innovative publications, while intergroup collaborations that bridge the so-called structural holes^40,41,42 can provide access to information and resources beyond those available in densely connected communities, thus leading to novel ideas and valuable publications. To quantitatively address these issues, we check whether the bibliometric indexes of scientists and publications are correlated with the tie strength of the scientific collaboration network. Specifically, we focus on two questions: (i) How does the researcher’s h-index depend on the structure of his/her local collaboration network? (ii) How does the strength of the ties between scientists influence the success of their joint publication?

To answer the first question, we examined how the h-index^43,44 of a scientist depends on his or her average asymmetric tie strength (see Fig. 4):

$$\begin{aligned} \langle v_i\rangle =\frac{1}{k_i}\sum _{j}v_{ij}. \end{aligned}$$

(4)

Equation (4) quantitatively measures the tendency of scientists to keep collaborating with the same people (cf. the concept of social inertia^45,46). Figure 5a shows that the averaged (over all scientists who have a similar average tie strength) h-index decreases with $\langle v_i\rangle $. It means that successful (double-digit h-index) scientists have significantly weaker ties than less successful (single-digit h-index) researchers. The result is consistent with Granovetter’s general understanding of the role of weak and strong ties. However, since some doubts may arise from the fact that the data presented in Fig. 5a are averaged over many different scientists (having a small and large number of all publications, with a small and very extensive network of collaborators), in Fig. 6, we demonstrate that the decreasing nature of the relationship between the h-index and tie strength is independent of the choice of a group of scientists. That is, it still decreases, even in very homogeneous (in terms of scientific achievements) groups of researchers. In particular, as one can see in the small graphs accompanying the colour histogram that represents the available scientists’ samples, of any two researchers who have the same number of publications and/or co-authors, the one with weaker ties tends to have the higher h-index. In a way, this suggests that being a good manager and skilfully planning one’s network of scientific contacts ensures success⁴⁷. This conclusion, however alarming as it may seem, finds its basis in the theory of social networks—the already mentioned concept of Burt’s structural holes and social capital^40,41.

The role of weak ties in scientific success is even more apparent in relation to scientific publications. Figure 5b shows how the number of citations of a scientific paper depends on the asymmetric tie strength (averaged over all co-authors of each article). The decreasing nature of this relationship indicates that publications created by teams of scientists linked by weak ties are better cited than those that arise in teams with strong ties. In Fig. 7, by analysing more homogeneous samples of publications (published in the same year and/or by the same number of co-authors), we clearly confirm the validity of the above finding. Furthermore, although the number of citations does not always translate into the quality of the research presented, it is undoubtedly a measure of the commercial success of a publication and a specific measure of the knowledge diffusion in scientific collaboration networks.

Discussion and concluding remarks

The purpose of this work is to thoroughly verify Granovetter’s weak-tie theory of social networks. As clearly stated in the abstract and in the introduction: Granovetter’s theory is based on two hypotheses that assign different roles to interpersonal, information-carrying connections. Not all those who deal with the Granovetter’s theory pay attention to this distinction, which is undoubtedly crucial. The first hypothesis states that strong ties carrying the majority of interaction events usually correspond to intra-group connections. The second hypothesis maintains that weak inter-group ties, although less active, are of particular importance for the exchange of relevant information. A review of the literature reveals a striking disproportion between the research on the two hypotheses. In fact, the vast majority of empirical research to date has dealt with the first hypothesis, completely ignoring and sometimes not fully correctly interpreting the second one. In this respect our work is unique, because we confirm Granovetter’s weak tie theory in its full spectrum. And although in the absence of other studies, the analysis of the second hypothesis may seem to be the most important result of this work, our research on the verification of the first hypothesis also deserves attention as it highlights some important (and sometimes questionable or not entirely correct) threads in previous studies.

In particular, using massive datasets, clear empirical evidence for the first hypothesis, supported by the positive correlation between the symmetric overlap and tie strength, $O_{ij}(w_{ij})$, were reported in: mobile communication networks^7,30, multiplayer online games^31,32, and dialogues-based online SN³³. On the other hand, the above mentioned methodology, exploiting symmetric network measures, failed in the analysis of scientific collaboration networks^9,11,27, incorrectly classifying them as contradicting Granovetter’s theory. In this paper, we identify the reason why scientific collaboration networks behave differently than other SN. We argue that the U-shaped relation between $w_{ij}$ and $O_{ij}$ observed in coauthorship networks (see Fig. 2a) is related to the definitions of tie strength $w_{ij}$ and neighbourhood overlap $O_{ij}$ that are not properly suited to networks with scale-free node degree distributions. In any of the networks that were considered in Refs.^{7,30,31,32,33} this problem did not exist, because these networks were not truly scale-free (e.g. in mobile communication networks $P(k)\sim k^{-\gamma }$, with $\gamma =8.4$).

In this paper, to overcome the aforementioned issue, we have paid attention to the role of asymmetry in social ties. We have introduced new measures: asymmetric overlap $Q_{ij}$ and asymmetric tie strength $v_{ij}$, which not only allowed the successful verification of the first Granovetter’s hypothesis in scientific collaboration networks (see Fig. 2b), but have also opened the possibility to verify the second hypothesis. Moreover, as for the second hypothesis, which involves concepts related to the nature and importance of information, coauthorship networks have proved to be an extremely accurate choice, because: (i) connections (ties) between network nodes (scientists) are well defined, and their weight (strength of ties) is easy to measure (e.g., through joint publications); (ii) scientific publications themselves are also a specific proxy of information flow in the studied network (diffusion of innovations); and (iii) the number of citations is an obvious measure of their significance.

To be concrete, with regard to the second Granovetter’s hypothesis our results quantify what most scientists know very well: Scientific success is strongly correlated with the structure of a scientist’s collaboration network. We have explicitly shown that publications created by teams of scientists with weak ties are better cited than those that arise in teams with strong ties. And although this result was to be expected, it may be surprising that the differences in the number of citations of works created by weakly tied research groups compared to strongly tied groups amount not to a few or a dozen, but several hundred percent (see Fig. 7). Of course, when looking at these results quantitatively, one should bear in mind the limitations of the DBLP database used for the study. The database covers publications from computer science and includes publications from hybrid fields, where they are considered pertinent to computer science research. Papers from other disciplines are present there only occasionally. It means that super weak inter-domain ties are not covered by our analysis and the differences presented in Figs. 6 and 7 may be underestimated. On the other hand, computer science is quite heterogeneous due to the presence of many subfields, with very different norms in terms of team size and citation standards. Therefore, the results presented in Figs. 6 and 7 are aggregated over different subfields. Keeping above in mind, using more comprehensive database (e.g. Scopus or Web of Science), for the analysis reported in this study, can act as a double-edged sword. It would solve the first problem, but aggravate the second one. In this sense, our choice of the source of data seems to be a golden middle way.

Finally, an important research direction that was not undertaken in this paper, although it directly refers results reported here, is the issue of two recently discovered empirical scaling laws for social networks which relate link weight $w_{ij}$, symmetric overlap $O_{ij}$, and link betweenness centrality⁴⁸ $b_{ij}$ in a non-linear way: $O_{ij}\propto \root 3 \of {w_{ij}}$ and $O_{ij}\propto 1/\sqrt{b_{ij}}$. Several studies (see e.g.^31,32,33) have confirmed universality of these “social laws”. As we have already shown (cf. Fig. 2a and the corresponding figures in^9,11,27), the first of these scaling laws—relating tie strength to the cube of the symmetric overlap—is not fulfilled in coauthorship networks. We have also checked that the same conclusion holds true for the second relation—expressing edge betweenness centrality as the inverse square of the overlap. In our case, the relationship $O_{ij}(b_{ij})$ is non-monotonic (non-increasing for small and intermediate values of betweenness and increasing for its large values, see Fig. S1 in Supplementary Information). Along these lines, we have also checked whether there is a clear correlation between tie strength and betweenness centrality and we have found no apparent dependency (see Fig. S2 in SI).

The additional analysis mentioned above provoke interesting research questions. The most controversial is whether the correlation between link betweenness centrality and symmetric overlap brings any relevant information about dynamical properties of social networks. In particular, whether the negative correlation between these measures provides quantitative evidence for the Granovetter’s theory. A kind of argument that supports these objections is that if we shuffle edge weights in a social network without changing the structure of its binary connections, then the weak ties hypothesis will surely cease to work, although the mentioned correlations will remain unchanged. Perhaps this argument could be refuted by using a kind of weighted/directed edge betweenness centrality, which, in combination with the asymmetric overlap $Q_{ij}$ introduced in this work, would allow for the formulation of more general laws of social dynamics than those formulated in³¹. An interesting way to overcome this problem has been proposed in⁴⁹, where the authors pointed out that classical betweenness centrality is not useful to measure the influence of a team that is composed of more than two people⁵⁰. Instead of this, a weighted hypergraph representation of the coauthorship network with higher-order interactions has been introduced and betweenness centrality measure has been adequately adapted to this new structure. In order to pursue studies on the role of weak ties in this direction, a new kind of overlap measure in hypergraphs has to be devised which itself seems to be challenging. The above considerations can be a starting point for interesting, new research on social networks.

Data availability

The research presented in this paper is based on the publicly and freely available Citation Network Dataset⁵¹. We used the 12th version of the dataset (DBLP-Citation-network V12) which contains detailed information (i.e., year of publication, journal, number of citations, references, list of authors) and approximately 5 million articles published mostly during the last 20 years.

It is important to note that our analysis is limited to the largest connected component (LCC) in the co-authorship network, which can be recreated using the dataset. LCC comprises of close to three million nodes (authors), which means it spans 65% of the entire network. These nodes are connected by more than 13 million bi-directional co-authorship edges.

While the dataset provides exhaustive information about published papers, it does not directly contain any bibliometric information about authors. However, it is possible to calculate various bibliometric indicators either by recreating the network of citations or by directly using article metadata available in the dataset for each article (such as the number of citations). In order to calculate the h-index for all authors in the LCC, we decided to rely on the latter method and use article metadata to determine the number of citations. Considering that the citation network recreated from the dataset is only a sample of the full citation network, this method is more reliable. The number of citations calculated by counting links in the citation network is, in general, underestimated when compared with the number of citations available in the article’s metadata.

Code availability

The code that supports the findings of this study is available from the corresponding author upon request.

References

Wasserman, S. & Faust, K. Social Network Analysis: Methods and Applications (Cambridge University Press, 1994).
Book MATH Google Scholar
Conte, R. et al. Manifesto of computational social science. Eur. Phys. J. Spec. Top. 214, 325–346 (2012).
Article Google Scholar
Lazer, D. M. J. et al. Computational social science: Obstacles and opportunities. Science 369, 1060–1062 (2020).
Article ADS CAS PubMed Google Scholar
Giles, J. Computational social science: Making the links. Nature 488, 448–450 (2012).
Article ADS CAS PubMed Google Scholar
Granovetter, M. The strength of weak ties. Am. J. Sociol. 78, 1360–1380 (1973).
Article Google Scholar
Granovetter, M. Getting a Job: A Study of Contacts and Careers 2nd edn. (University of Chicago Press, 1995).
Book Google Scholar
Onnela, J.-P. et al. Structure and tie strengths in mobile communication networks. Proc. Natl Acad. Sci. U. S. A. 104, 7332–7336 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Eagle, N., Macy, M. & Claxton, R. Network diversity and economic development. Science 328, 1029–1031 (2010).
Article ADS MathSciNet CAS PubMed MATH Google Scholar
Pajevic, S. & Plenz, D. The organization of strong links in complex networks. Nat. Phys. 8, 429–436 (2012).
Article CAS PubMed PubMed Central Google Scholar
Grabowicz, P. A., Ramasco, J. J., Moro, E., Pujol, J. M. & Eguiluz, V. M. Social features of online networks: The strength of intermediary ties in online social media. PLoS ONE 7, e29358 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Pan, R. K. & Saramäki, J. The strength of strong ties in scientific collaboration networks. EPL 97, 18007 (2012).
Article ADS Google Scholar
Aral, S. The future of weak ties. Am. J. Sociol. 121, 1931–1939 (2016).
Article Google Scholar
Gee, L. K., Jones, J. & Burke, M. Social networks and labor markets: How strong ties relate to job finding on Facebook’s social network. J. Labor Econ. 35, 485–518 (2017).
Article Google Scholar
Newman, M. E. J. The structure of scientific collaboration networks. Proc. Natl Acad. Sci. U. S. A. 98, 404–409 (2001).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Newman, M. E. J. Scientific collaboration networks. I. Network construction and fundamental results. Phys. Rev. E 64, 016131 (2001).
Article ADS CAS Google Scholar
Newman, M. E. J. Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. Phys. Rev. E 64, 016132 (2001).
Article ADS CAS Google Scholar
Girvan, M. & Newman, M. E. J. Community structure in social and biological networks. Proc. Natl. Acad. Sci. U. S. A. 99, 7821–7826 (2002).
Article ADS MathSciNet CAS PubMed PubMed Central MATH Google Scholar
Barabási, A. L. et al. Evolution of the social network of scientific collaborations. Phys. A Stat. Mech. Appl. 311, 590–614 (2002).
Article MathSciNet MATH Google Scholar
In complex networks, the Granovetter’s concept of tie strength corresponds to edge weight, while the concept of strength refers to the network nodes and is defined as the total weight of their connections [20]. Due to historical reasons, in this paper the notions of: tie strength and edge weight are treated as equivalent and used interchangeably.
Rogers, E. M. Diffusion of Innovations 5th edn. (Simon & Schuster, 2003).
Google Scholar
Newman, M. E. J. Networks: An Introduction (Oxford University Press, 2010).
Book MATH Google Scholar
Heider, F. Attitudes and cognitive organization. J. Psychol. 21, 107–112 (1946).
Article CAS PubMed Google Scholar
Cartwright, D. & Harary, F. Structure balance: A generalization of Heider’s theory. Psychol. Rev. 63, 277–293 (1956).
Article CAS PubMed Google Scholar
Szell, M., Lambiotte, R. & Thurner, S. Multirelational organization of large-scale social networks in an online world. Proc. Natl. Acad. Sci. U. S. A. 107, 13636–13641 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Gligorijević, V., Skowron, M. & Tadić, B. Structure and stability of online chat networks built on emotion-carrying links. Physica A 392, 538 (2013).
Article ADS Google Scholar
Barrat, A., Barthélemy, M., Pastor-Satorras, R. & Vespignani, A. The architecture of complex weighted networks. Proc. Natl. Acad. Sci. U. S. A. 101, 3747–3752 (2004).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Ke, Q. & Ahn, Y.-Y. Tie strength distribution in scientific collaboration networks. Phys. Rev. E 90, 032804 (2014).
Article ADS Google Scholar
Freeman, T. C. et al. Graphia: A Platform for the Graph-Based Visualisation and Analysis of Complex Data (Cold Spring Harbor Laboratory, 2020).
Google Scholar
Blondel, V. D., Guillaume, J.-L., Lambiotte, R. & Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, P10008 (2008).
Article MATH Google Scholar
Onnela, J.-P. et al. Analysis of a large-scale weighted network of one-to-one human communication. New J. Phys. 9, 179 (2007).
Article ADS Google Scholar
Szell, M. & Thurner, S. Measuring social dynamics in a massive multiplayer online game. Soc. Netw. 32, 313–329 (2010).
Article Google Scholar
Szell, M. & Thurner, S. Social dynamics in a large-scale online game. Adv. Complex Syst. 15, 1250064 (2012).
Article MathSciNet Google Scholar
Šuvakov, M., Mitrović, M., Gligorijević, V. & Tadić, B. How the online social networks are used: Dialogues-based structure of MySpace. J. R. Soc. Interface 10, 20120819 (2013).
Article PubMed PubMed Central Google Scholar
It should be noted that the number of joint publications, which corresponds to the number of times a collaboration between two scientists has been repeated, is not the only possible choice for the tie strength. For example, in Refs. [11, 29, 30] the formula introduced by Newman [16] is used: $w_{ij}=\sum _p\frac{1}{n_p-1}$, where $p$ is the set of papers co-authored by $n_p$ scientists, including $i$ and $j$. The motivation behind the Newman’s formula is that an author divides his/her time and other resources between $n_p-1$ collaborators, and thus the strength of the connection should vary inversely with $n_p-1$. However, in comparison to the definition we use: $w_{ij}=\sum _p 1$, Newman’s formula does not take into account synergy effects of working in a group, nor the effect of social inertia [36, 37] that measures the tendency of scientists to keep on collaborating with previous partners, which seem important in the context of scientific collaboration networks.
Dorogovtsev, S. N., Goltsev, A. V. & Mendes, J. F. F. Critical phenomena in complex networks. Eur. Phys. J. Spec. Top. 143, 47–50 (2007).
Google Scholar
Barrat, A., Barthélemy, M. & Vespignani, A. Dynamical Processes on Complex Networks (Cambridge University Press, 2008).
Book MATH Google Scholar
Pastor-Satorras, R., Castellano, C., Van Mieghem, P. & Vespignani, A. Epidemic processes in complex networks. Rev. Mod. Phys. 87, 925 (2015).
Article ADS MathSciNet Google Scholar
Note that the number of publications does not have to be equal to the strength of the node: $p_i\ne s_i=\sum _j w_{ij}$. It results from the definition of symmetric tie strength $w_{ij}$ adopted in this publication, which we commented on in the [35].
Wang, J. Knowledge creation in collaboration networks: Effects of tie configuration. Res. Policy 45, 68–80 (2016).
Article Google Scholar
Burt, R. Structural Holes: The Social Structure of Competition (Harvard University Press, 1992).
Book Google Scholar
Burt, R. Structural holes and good ideas. Am. J. Sociol. 110, 349–399 (2004).
Article Google Scholar
Goyal, S. & Vega-Redondo, F. Structural holes in social networks. J. Econ. Theory 137, 460–492 (2007).
Article MathSciNet MATH Google Scholar
Hirsch, J. E. An index to quantify an individual’s scientific research output. Proc. Natl. Acad. Sci. U. S. A. 102, 16569–16572 (2005).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Dorogovtsev, S. & Mendes, J. Ranking scientists. Nat. Phys. 11, 882–883 (2015).
Article CAS Google Scholar
Ramasco, J. J. & Morris, S. A. Social inertia in collaboration networks. Phys. Rev. E 73, 016122 (2006).
Article ADS Google Scholar
Ramasco, J. J. Social inertia and diversity in collaboration networks. Eur. Phys. J. Spec. Top. 143, 47–50 (2007).
Article Google Scholar
Petersen, A. M. Quantifying the impact of weak, strong, and super ties in scientific careers. Proc. Natl. Acad. Sci. U. S. A. 112, E4671–E4680 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
The link betweenness $b_{ij}$ is a measure of centrality within a connected graph that quantifies how many shortest paths pass through a given link [50].
Lee, J., Lee, Y., Oh, S. M. & Kahnga, B. Betweenness centrality of teams in social networks. Chaos 31, 061108 (2021).
Article ADS MathSciNet PubMed Google Scholar
Milojević, S. Towards a more realistic citation model: The key role of research team sizes. Entropy 22, 875 (2020).
Article ADS PubMed Central Google Scholar
Tang, J. et al. ArnetMiner: Extraction and mining of academic social networks. In Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD’2008) 990–998 (2008).

Download references

Acknowledgements

Research was funded by (POB Cybersecurity and Data Science) of Warsaw University of Technology within the Excellence Initiative: Research University (IDUB) programme.

Author information

Authors and Affiliations

Faculty of Physics, Warsaw University of Technology, Koszykowa 75, 00-662, Warsaw, Poland
Agata Fronczak, Maciej J. Mrowinski & Piotr Fronczak

Authors

Agata Fronczak
View author publications
You can also search for this author in PubMed Google Scholar
Maciej J. Mrowinski
View author publications
You can also search for this author in PubMed Google Scholar
Piotr Fronczak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.F. conceived and planned the study, wrote the manuscript, M.M. performed numerical analysis, all authors analysed the results and reviewed the manuscript.

Corresponding author

Correspondence to Agata Fronczak.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fronczak, A., Mrowinski, M.J. & Fronczak, P. Scientific success from the perspective of the strength of weak ties. Sci Rep 12, 5074 (2022). https://doi.org/10.1038/s41598-022-09118-8

Download citation

Received: 07 October 2021
Accepted: 25 January 2022
Published: 24 March 2022
DOI: https://doi.org/10.1038/s41598-022-09118-8

This article is cited by

Interplay between tie strength and neighbourhood topology in complex networks
- Maciej J. Mrowinski
- Kamil P. Orzechowski
- Piotr Fronczak
Scientific Reports (2024)
Scaling theory of fractal complex networks
- Agata Fronczak
- Piotr Fronczak
- Maciej J. Mrowinski
Scientific Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.