Bringing the lab back in: Personnel composition and scientific output at the MIT Department of Biology☆
Introduction
“… if the scientists we shadow go inside laboratories, then we too have to go there, no matter how difficult the journey.” (Latour, 1988, p. 63)
The past two decades have witnessed an unprecedented advancement in a researcher's ability to collect and analyze large datasets. An exponential rise in computer storage and power, coupled with ready access to an ever increasing array of online data sources has enabled researchers to analyze datasets numbering in the millions of data points. Large-scale patent data have been used to study knowledge spillovers (Jaffe et al., 1993, Audretsch and Feldman, 1996, Breschi and Lissoni, 2001), inventor mobility (Marx et al., 2009, Singh and Agrawal, 2011), and inventor networks (Fleming and Sorenson, 2004), to name a few recent examples. More recently, a parallel easing of access to data on academic publications (e.g., Azoulay et al., 2006) has enabled the study of collaborative teams (Wuchty et al., 2007) and spillovers across individuals (Azoulay et al., 2010). Whether using patent or publication data, large-scale datasets allow the documentation of temporal trends across multiple fields, as well as the discovery of exogenous variation or the use of matched samples to aid in causal inference. Lastly, these empirical changes have been particularly pertinent for scholarship focused on the innovation economy, where the highly skewed distribution of productive individuals has been recognized for some time (Lotka, 1926). In short, the ability to access large datasets has engendered a revolution in the social studies of innovation, changing what questions social scientists might ask, as well as the way in which these new questions might be answered.
Despite the incontrovertible advantages of larger datasets (we doubt that any scholars would argue for fewer, rather than more data points), we fear that during this shift in the size and scope of data, critical supporting structures that underpin scientific productivity – in the context of this paper, the scientific laboratory – have fallen by the wayside. This oversight is remarkable given the central role of the laboratory in foundational studies ranging from the social construction of technology (Latour and Woolgar, 1979), mentorship and training (Zuckerman, 1977, Dasgupta and David, 1994), organizational structure and boundary spanning (Allen, 1984), as well as the coordination of innovative activities (Pelz and Andrews, 1976). More recently, burgeoning literatures on differences across scientists (Roach and Sauermann, 2010, Pezzoni et al., 2012) and incentive structures within firms (Cockburn et al., 1999, Liu and Stuart, 2014), as well as scientific careers and differences between graduate and postdoctoral stages (Stephan and Levin, 1992, Azoulay et al., 2009) reinforce the notion that there is considerable heterogeneity across scientists. As Stephan (2012) notes in her recent book, “Collaboration in science often occurs in a lab. The lab environment not only facilitates the exchange of ideas. It also encourages specialization…” (p. 67).
This paper's central goal is not to overturn laudable advances in data collection and analysis, but to urge greater attention to the study of scientific laboratories. As one avenue of motivation, in this paper we examine the personnel composition within laboratories. Specifically, we focus on laboratory members of varying characteristics and link changes in the number of these personnel types to the laboratory's scientific output. In particular, we ask the following set of related questions. First, in the biological sciences, how have laboratory personnel compositions changed as this field has grown in prominence through the twentieth century? Second, to what extent do laboratory members with different scientific experience (e.g., graduate students vs. postdocs), funding, or position (e.g., trainee vs. technician) affect the laboratory's scientific output? And in our examination of the laboratory's personnel composition, what might we learn about how different types of personnel members affect incremental versus breakthrough publications?
To answer these questions, we examine the laboratory compositions and scientific outputs for one elite set of scientists: principal investigators (PIs) running laboratories at the MIT Department of Biology. Using a complete personnel roster, we document an increase in the prevalence of postdoctoral scientists for the period 1966–2000, while the number of graduate students and technicians remained largely constant. Consistent with prior research, our analysis suggests that personnel are a critical determinant of laboratory productivity: larger laboratories have more publication outputs. Moreover, we find that experienced scientists (i.e., postdocs), particularly those with external funding (i.e., postdocs with fellowships), make greater contributions to the laboratory's publication outcomes, suggesting that both experience and funding are critical determinants of laboratory productivity.
However, when we focus solely on high-profile publications (i.e., publications in Science, Nature, or Cell), we present three unexpected findings. First, graduate students, who make only nominal contributions to overall publication counts, contribute as much to breakthroughs as postdocs with external funding. Second, postdocs without fellowships have no observable impact on breakthrough publications. In a final intriguing finding, technicians are instrumental to high-profile publications, but have no observable impact on lower impact publication output.
These results speak to the importance of composition, not just size, in a laboratory manager's consideration of potential laboratory members. Although larger laboratories result in more publications, only a subset of these personnel types appears to contribute to breakthrough publications. Moreover, our results have implications for the use of large-scale bibliometric data to study productivity. Our results suggest that the sole use of publication author lists to construct laboratory size may lead to severe biases in estimating a laboratory's productive resources. Coupled with recent, exponential advances in the collection of large datasets, our findings on laboratory composition provide motivation to revisit these ubiquitous social groups. We believe that the time is ripe for a large-scale examination of scientific laboratories.
This paper proceeds as follows. Section 2 reviews the literature on laboratories and their importance to knowledge production, as well as posing our research questions. In Section 3, we describe our setting and data, and Section 4 describes our measures and empirical strategy. Section 5 presents our findings. A final section concludes and discusses the implications of our findings for the current trend toward large bibliometric datasets.
Section snippets
History of industrial and academic laboratories
Across both commerce and the academe, laboratories are central organizational structures in the production of knowledge. Laboratories enable the division of scientific labor (e.g., Jones, 2009), serve as repositories of scientific materials (Furman and Stern, 2011), and transmit tacit knowledge to scientific novitiates (Latour, 1988), to name a few roles among many. Historically, laboratories have been physical spaces that serve both to separate potentially dangerous chemicals and reagents away
Setting and data
To address these topics, this paper presents a quantitative case study examining a dataset comprised of laboratories at the Massachusetts Institute of Technology (MIT) Department of Biology between 1966 and 2000. Although focused on one scientific department at a specific university, this setting has a number of advantages. First, it is an elite biology department that has consistently contributed to scientific breakthroughs since the 1960s. Mirroring other studies that focus on scientific
Publication outcomes
In this paper, we link a laboratory's personnel composition to its publication output. To do so, we examined two dependent variables. The first variable is simply a laboratory's yearly number of publications. An alternative measure of overall productivity, the impact-factor weighted publication count, did not affect this set of results.
A second set of regressions examines a laboratory's likelihood of achieving a “breakthrough” discovery. Specifically, we chose to focus on publications in
Results
We begin our results section with a description of the dataset (Table 1). Overall, our dataset includes 1482 laboratory-year observations, and 20,324 laboratory member-years that span 1966–2000.8 Within this dataset, there are 119 principal investigators and 5694 laboratory members, which include 1798 postdocs with fellowships, 1328 postdocs without fellowships, 1395 graduate
Discussion and conclusion
This study links the number of personnel types with varying experience, external funding, and positions to a laboratory's publication output. We suggest that for incremental publications, postdocs (i.e., those with experience), regardless of their funding level, dominate. However, for breakthrough publications, graduate students and postdocs with external funding make equally significant contributions. By contrast, postdocs without fellowships do not correlate with breakthroughs. Lastly, we
References (85)
- et al.
The anchor tenant hypothesis: exploring the role of large, local, R&D-intensive firms in regional innovation systems
Int. J. Ind. Organ.
(2003) - et al.
Entrepreneurial innovation: the importance of context
Res. Policy
(2014) - et al.
Publication Harvester: an open-source software tool for science policy research
Res. Policy
(2006) - et al.
Networks of inventors and the role of academia: an exploration of Italian patent data
Res. Policy
(2004) - et al.
The mechanisms of collaboration in inventive teams: composition, social networks, and geography
Res. Policy
(2011) From divergent meanings to common practices: the early institutionalization of technology transfer in the life scences at Stanford University
Res. Policy
(2007)- et al.
Today's Edisons or weekend hobbyists: technical merit and success of inventions by independent inventors
Res. Policy
(2004) - et al.
Academic science and the birth of industrial research laboratories in the U.S. pharmaceutical industry
J. Econ. Behav. Organ.
(2007) - et al.
Errors in variables in panel data
J. Econom.
(1986) - et al.
Proximity effects on the dynamics and outcomes of scientific collaborations
Res. Policy
(2014)
Positions and rewards: the allocation of resources within a science-based entrepreneurial firm
Res. Policy
Academic research and industrial innovation: an update of empirical findings
Res. Policy
The role of academic inventors in entrepreneurial firms: sharing the laboratory life
Res. Policy
Career progress in centralized academic systems: social capital and institutions in France and Italy
Res. Policy
A taste for science? PhD scientists’ academic orientation and self-selection into research careers in industry
Res. Policy
Collaboration, Stars, and the Changing Organiation of Science: Evidence From Evolutionary Biology. NBER Working Paper Series #19653
Managing the Flow of Technology
R&D spillovers and the geography of innovation and production
Am. Econ. Rev.
Superstar extinction
Q. J. Econ.
Social Influence Given (Partially) Deliberate Matching: Career Imprints in the Creation of Academic Entrepreneurs. Harvard Business School Working Paper 09-136
Matthew: effect or fable?
Manag. Sci.
Technicians in the workplace: ethnographic evidence for bringing work into organizational studies
Adm. Sci. Q.
In the backrooms of science: the work of technicians in science labs
Work Occup.
Knowledge spillovers and local innovation systems: a critical survey
Ind. Corpor. Change
Microgeography and the Direction of Inventive Activity. Working Paper
Absorptive capacity, coauthoring behavior, and the organization of research in drug discovery
J. Ind. Econ.
Balancing Incentives: The Tension Between Basic and Applied Research. NBER Working Paper Series w6882.
Absorptive capacity: a new perspective on learning and innovation
Adm. Sci. Q.
The (changing) knowledge production function: evidence from the MIT Department of Biology 1966–2000
Knowledge specialization in PhD student groups
IEEE Trans. Eng. Manage.
A Behavioral Theory of the Firm
Toward a new ecoomics of science
Res. Policy
The locational dynamics of the US biotech industry: knowledge externalities and the anchor hypothesis
Ind. Innov.
Initial advantage: the origis of the geographic concentration of the pharmaceutical industry in the mid-Atlantic region
Ind. Corp. Change
Science as a map in technological search
Strateg. Manag. J.
Competition and careers in biosciences
Science
Crafting Science: A Sociohistory of the Quest for the Genetics of Cancer
Climbing atop the shoulders of giants: the impact of institutions on cumulative research
Am. Econ. Rev.
Scale, scope, and spillovers: the determinants of research productivity in drug discovery
RAND J. Econ.
Real effects of academic research
Am. Econ. Rev.
Geographic localization of knowledge spillovers as evidenced by patent citations
Q. J. Econ.
The burden of knowledge and the “death of the renaissance man”: is innovation getting harder?
Rev. Econ. Stud.
Cited by (52)
Gender, race-ethnicity and postdoctoral hiring in STEMM fields
2023, Social Science ResearchThe interdisciplinarity dilemma: Public versus private interests
2022, Research PolicyAssessing differences between university and federal laboratory postdoctoral scientists in technology transfer
2022, Research PolicyCitation Excerpt :Some even reported a 300% increase since 1990 (Biancani et al., 2018). Not surprisingly, postdocs now account for a large portion of research publications and are aligned with external funding that affords labs high-profile, breakthrough innovations (Conti and Liu, 2015). Despite the growing importance of research labs, postdocs have experienced dwindling career prospects in academia.
Training across the academy: The impact of R&D funding on graduate students
2021, Research PolicyCitation Excerpt :We find positive results on the outcome – Cumulative Research Output (0.722 SE 0.135).14 We also estimate the weighted productivity metrics based on annual, rather than cumulative, publication activity (Rosenbloom et al., 2015; Conti & Liu, 2015). Operationally, we estimate the three-year average of author-weighted annual publications (col. 2) and three-year average of publication-weighted annual citations (col. 3).
Unravelling supplier-laboratory knowledge spillovers: Evidence from Toyota's central R&D laboratory and subsidiary R&D centers
2021, Research PolicyCitation Excerpt :In particular, it has been established that knowledge spillovers between firms can facilitate innovations, raise productivity, increase organizational performance, and spur economic development (Jaffe et al., 1993; Audretsch and Feldman, 1996, Jaffe and Rassenfosse, 2017; Audretsch and Belitski, 2020). Although the majority of research focuses on knowledge spillovers between inventors, firms, universities, regions, and countries (Jaffe and Rassenfosse, 2017), a growing number of studies are examining the frequency of knowledge spillovers that are associated with public R&D laboratories (Venturini et al., 2019) and university R&D laboratories (Conti and Liu, 2015). However, research has yet to study the frequency of knowledge spillovers that occur between an industrial R&D laboratory and its suppliers, which represents a significant gap in our understanding of how managers and policy makers can facilitate the flow of knowledge during the innovation process.
- ☆
Authorship is alphabetical. The authors give special thanks to the MIT Department of Biology for access to archival materials. We also thank Paula Stephan, Pierre Azoulay, Nico Lacetera, Jacques Mairesse, Fabiana Visentin, editor Martin Kenney and two anonymous reviewers, as well as participants at the NBER Changing Frontier Conference and the NSF Workshop on Science of Science Policy for their thoughtful comments. Chris Liu is thankful to the Kauffman Foundation for Dissertation Fellowship funding and support. The usual disclaimers apply. Direct all correspondence to Chris Liu at [email protected].