Elsevier

Research Policy

Volume 44, Issue 9, November 2015, Pages 1633-1644
Research Policy

Bringing the lab back in: Personnel composition and scientific output at the MIT Department of Biology

https://doi.org/10.1016/j.respol.2015.01.001Get rights and content

Highlights

  • We introduce a novel dataset on MIT Biology labs from 1966 to 2000.

  • We document trends in laboratory composition over more than three decades.

  • We document that postdocs (vs. grad students or technicians) account for the majority of laboratory publication output.

  • Relative to postdocs, grad students and technicians are equally important for breakthrough publications.

  • We propose a methodology to reconstruct laboratories from bibliometric data.

Abstract

We study the link between a laboratory's personnel composition, its number and types of graduate students and postdocs, and the laboratory's productive output. Building upon a fine-grained dataset with full personnel lists from the MIT Department of Biology from 1966–2000, we find that while postdocs account for the large majority of publication outputs, graduate students and postdocs with external funding contribute equally to breakthrough publications. Moreover, technicians are key contributors to breakthrough publications, but not to overall productivity. Taken together, this study contributes to our understanding of knowledge work, as well as reinforcing the importance of a laboratory's personnel composition.

Introduction

“… if the scientists we shadow go inside laboratories, then we too have to go there, no matter how difficult the journey.” (Latour, 1988, p. 63)

The past two decades have witnessed an unprecedented advancement in a researcher's ability to collect and analyze large datasets. An exponential rise in computer storage and power, coupled with ready access to an ever increasing array of online data sources has enabled researchers to analyze datasets numbering in the millions of data points. Large-scale patent data have been used to study knowledge spillovers (Jaffe et al., 1993, Audretsch and Feldman, 1996, Breschi and Lissoni, 2001), inventor mobility (Marx et al., 2009, Singh and Agrawal, 2011), and inventor networks (Fleming and Sorenson, 2004), to name a few recent examples. More recently, a parallel easing of access to data on academic publications (e.g., Azoulay et al., 2006) has enabled the study of collaborative teams (Wuchty et al., 2007) and spillovers across individuals (Azoulay et al., 2010). Whether using patent or publication data, large-scale datasets allow the documentation of temporal trends across multiple fields, as well as the discovery of exogenous variation or the use of matched samples to aid in causal inference. Lastly, these empirical changes have been particularly pertinent for scholarship focused on the innovation economy, where the highly skewed distribution of productive individuals has been recognized for some time (Lotka, 1926). In short, the ability to access large datasets has engendered a revolution in the social studies of innovation, changing what questions social scientists might ask, as well as the way in which these new questions might be answered.

Despite the incontrovertible advantages of larger datasets (we doubt that any scholars would argue for fewer, rather than more data points), we fear that during this shift in the size and scope of data, critical supporting structures that underpin scientific productivity – in the context of this paper, the scientific laboratory – have fallen by the wayside. This oversight is remarkable given the central role of the laboratory in foundational studies ranging from the social construction of technology (Latour and Woolgar, 1979), mentorship and training (Zuckerman, 1977, Dasgupta and David, 1994), organizational structure and boundary spanning (Allen, 1984), as well as the coordination of innovative activities (Pelz and Andrews, 1976). More recently, burgeoning literatures on differences across scientists (Roach and Sauermann, 2010, Pezzoni et al., 2012) and incentive structures within firms (Cockburn et al., 1999, Liu and Stuart, 2014), as well as scientific careers and differences between graduate and postdoctoral stages (Stephan and Levin, 1992, Azoulay et al., 2009) reinforce the notion that there is considerable heterogeneity across scientists. As Stephan (2012) notes in her recent book, “Collaboration in science often occurs in a lab. The lab environment not only facilitates the exchange of ideas. It also encourages specialization…” (p. 67).

This paper's central goal is not to overturn laudable advances in data collection and analysis, but to urge greater attention to the study of scientific laboratories. As one avenue of motivation, in this paper we examine the personnel composition within laboratories. Specifically, we focus on laboratory members of varying characteristics and link changes in the number of these personnel types to the laboratory's scientific output. In particular, we ask the following set of related questions. First, in the biological sciences, how have laboratory personnel compositions changed as this field has grown in prominence through the twentieth century? Second, to what extent do laboratory members with different scientific experience (e.g., graduate students vs. postdocs), funding, or position (e.g., trainee vs. technician) affect the laboratory's scientific output? And in our examination of the laboratory's personnel composition, what might we learn about how different types of personnel members affect incremental versus breakthrough publications?

To answer these questions, we examine the laboratory compositions and scientific outputs for one elite set of scientists: principal investigators (PIs) running laboratories at the MIT Department of Biology. Using a complete personnel roster, we document an increase in the prevalence of postdoctoral scientists for the period 1966–2000, while the number of graduate students and technicians remained largely constant. Consistent with prior research, our analysis suggests that personnel are a critical determinant of laboratory productivity: larger laboratories have more publication outputs. Moreover, we find that experienced scientists (i.e., postdocs), particularly those with external funding (i.e., postdocs with fellowships), make greater contributions to the laboratory's publication outcomes, suggesting that both experience and funding are critical determinants of laboratory productivity.

However, when we focus solely on high-profile publications (i.e., publications in Science, Nature, or Cell), we present three unexpected findings. First, graduate students, who make only nominal contributions to overall publication counts, contribute as much to breakthroughs as postdocs with external funding. Second, postdocs without fellowships have no observable impact on breakthrough publications. In a final intriguing finding, technicians are instrumental to high-profile publications, but have no observable impact on lower impact publication output.

These results speak to the importance of composition, not just size, in a laboratory manager's consideration of potential laboratory members. Although larger laboratories result in more publications, only a subset of these personnel types appears to contribute to breakthrough publications. Moreover, our results have implications for the use of large-scale bibliometric data to study productivity. Our results suggest that the sole use of publication author lists to construct laboratory size may lead to severe biases in estimating a laboratory's productive resources. Coupled with recent, exponential advances in the collection of large datasets, our findings on laboratory composition provide motivation to revisit these ubiquitous social groups. We believe that the time is ripe for a large-scale examination of scientific laboratories.

This paper proceeds as follows. Section 2 reviews the literature on laboratories and their importance to knowledge production, as well as posing our research questions. In Section 3, we describe our setting and data, and Section 4 describes our measures and empirical strategy. Section 5 presents our findings. A final section concludes and discusses the implications of our findings for the current trend toward large bibliometric datasets.

Section snippets

History of industrial and academic laboratories

Across both commerce and the academe, laboratories are central organizational structures in the production of knowledge. Laboratories enable the division of scientific labor (e.g., Jones, 2009), serve as repositories of scientific materials (Furman and Stern, 2011), and transmit tacit knowledge to scientific novitiates (Latour, 1988), to name a few roles among many. Historically, laboratories have been physical spaces that serve both to separate potentially dangerous chemicals and reagents away

Setting and data

To address these topics, this paper presents a quantitative case study examining a dataset comprised of laboratories at the Massachusetts Institute of Technology (MIT) Department of Biology between 1966 and 2000. Although focused on one scientific department at a specific university, this setting has a number of advantages. First, it is an elite biology department that has consistently contributed to scientific breakthroughs since the 1960s. Mirroring other studies that focus on scientific

Publication outcomes

In this paper, we link a laboratory's personnel composition to its publication output. To do so, we examined two dependent variables. The first variable is simply a laboratory's yearly number of publications. An alternative measure of overall productivity, the impact-factor weighted publication count, did not affect this set of results.

A second set of regressions examines a laboratory's likelihood of achieving a “breakthrough” discovery. Specifically, we chose to focus on publications in

Results

We begin our results section with a description of the dataset (Table 1). Overall, our dataset includes 1482 laboratory-year observations, and 20,324 laboratory member-years that span 1966–2000.8 Within this dataset, there are 119 principal investigators and 5694 laboratory members, which include 1798 postdocs with fellowships, 1328 postdocs without fellowships, 1395 graduate

Discussion and conclusion

This study links the number of personnel types with varying experience, external funding, and positions to a laboratory's publication output. We suggest that for incremental publications, postdocs (i.e., those with experience), regardless of their funding level, dominate. However, for breakthrough publications, graduate students and postdocs with external funding make equally significant contributions. By contrast, postdocs without fellowships do not correlate with breakthroughs. Lastly, we

References (85)

  • C.C. Liu et al.

    Positions and rewards: the allocation of resources within a science-based entrepreneurial firm

    Res. Policy

    (2014)
  • E. Mansfield

    Academic research and industrial innovation: an update of empirical findings

    Res. Policy

    (1998)
  • F. Murray

    The role of academic inventors in entrepreneurial firms: sharing the laboratory life

    Res. Policy

    (2004)
  • M. Pezzoni et al.

    Career progress in centralized academic systems: social capital and institutions in France and Italy

    Res. Policy

    (2012)
  • M. Roach et al.

    A taste for science? PhD scientists’ academic orientation and self-selection into research careers in industry

    Res. Policy

    (2010)
  • A. Agrawal et al.

    Collaboration, Stars, and the Changing Organiation of Science: Evidence From Evolutionary Biology. NBER Working Paper Series #19653

    (2013)
  • T.J. Allen

    Managing the Flow of Technology

    (1984)
  • D.B. Audretsch et al.

    R&D spillovers and the geography of innovation and production

    Am. Econ. Rev.

    (1996)
  • P. Azoulay et al.

    Superstar extinction

    Q. J. Econ.

    (2010)
  • P. Azoulay et al.

    Social Influence Given (Partially) Deliberate Matching: Career Imprints in the Creation of Academic Entrepreneurs. Harvard Business School Working Paper 09-136

    (2009)
  • P. Azoulay et al.

    Matthew: effect or fable?

    Manag. Sci.

    (2014)
  • S.R. Barley

    Technicians in the workplace: ethnographic evidence for bringing work into organizational studies

    Adm. Sci. Q.

    (1996)
  • S.R. Barley et al.

    In the backrooms of science: the work of technicians in science labs

    Work Occup.

    (1994)
  • S. Breschi et al.

    Knowledge spillovers and local innovation systems: a critical survey

    Ind. Corpor. Change

    (2001)
  • C. Catalini

    Microgeography and the Direction of Inventive Activity. Working Paper

    (2012)
  • I.M. Cockburn et al.

    Absorptive capacity, coauthoring behavior, and the organization of research in drug discovery

    J. Ind. Econ.

    (1998)
  • I.M. Cockburn et al.

    Balancing Incentives: The Tension Between Basic and Applied Research. NBER Working Paper Series w6882.

    (1999)
  • W.M. Cohen et al.

    Absorptive capacity: a new perspective on learning and innovation

    Adm. Sci. Q.

    (1990)
  • A. Conti et al.

    The (changing) knowledge production function: evidence from the MIT Department of Biology 1966–2000

  • A. Conti et al.

    Knowledge specialization in PhD student groups

    IEEE Trans. Eng. Manage.

    (2014)
  • R.M. Cyert et al.

    A Behavioral Theory of the Firm

    (1963)
  • P. Dasgupta et al.

    Toward a new ecoomics of science

    Res. Policy

    (1994)
  • M.P. Feldman

    The locational dynamics of the US biotech industry: knowledge externalities and the anchor hypothesis

    Ind. Innov.

    (2003)
  • M.P. Feldman et al.

    Initial advantage: the origis of the geographic concentration of the pharmaceutical industry in the mid-Atlantic region

    Ind. Corp. Change

    (1996)
  • L. Fleming et al.

    Science as a map in technological search

    Strateg. Manag. J.

    (2004)
  • R. Freeman et al.

    Competition and careers in biosciences

    Science

    (2001)
  • J.H. Fujimura

    Crafting Science: A Sociohistory of the Quest for the Genetics of Cancer

    (1996)
  • J.L. Furman et al.

    Climbing atop the shoulders of giants: the impact of institutions on cumulative research

    Am. Econ. Rev.

    (2011)
  • R.M. Henderson et al.

    Scale, scope, and spillovers: the determinants of research productivity in drug discovery

    RAND J. Econ.

    (1996)
  • A.B. Jaffe

    Real effects of academic research

    Am. Econ. Rev.

    (1989)
  • A.B. Jaffe et al.

    Geographic localization of knowledge spillovers as evidenced by patent citations

    Q. J. Econ.

    (1993)
  • B.F. Jones

    The burden of knowledge and the “death of the renaissance man”: is innovation getting harder?

    Rev. Econ. Stud.

    (2009)
  • Cited by (52)

    • Assessing differences between university and federal laboratory postdoctoral scientists in technology transfer

      2022, Research Policy
      Citation Excerpt :

      Some even reported a 300% increase since 1990 (Biancani et al., 2018). Not surprisingly, postdocs now account for a large portion of research publications and are aligned with external funding that affords labs high-profile, breakthrough innovations (Conti and Liu, 2015). Despite the growing importance of research labs, postdocs have experienced dwindling career prospects in academia.

    • Training across the academy: The impact of R&D funding on graduate students

      2021, Research Policy
      Citation Excerpt :

      We find positive results on the outcome – Cumulative Research Output (0.722 SE 0.135).14 We also estimate the weighted productivity metrics based on annual, rather than cumulative, publication activity (Rosenbloom et al., 2015; Conti & Liu, 2015). Operationally, we estimate the three-year average of author-weighted annual publications (col. 2) and three-year average of publication-weighted annual citations (col. 3).

    • Unravelling supplier-laboratory knowledge spillovers: Evidence from Toyota's central R&D laboratory and subsidiary R&D centers

      2021, Research Policy
      Citation Excerpt :

      In particular, it has been established that knowledge spillovers between firms can facilitate innovations, raise productivity, increase organizational performance, and spur economic development (Jaffe et al., 1993; Audretsch and Feldman, 1996, Jaffe and Rassenfosse, 2017; Audretsch and Belitski, 2020). Although the majority of research focuses on knowledge spillovers between inventors, firms, universities, regions, and countries (Jaffe and Rassenfosse, 2017), a growing number of studies are examining the frequency of knowledge spillovers that are associated with public R&D laboratories (Venturini et al., 2019) and university R&D laboratories (Conti and Liu, 2015). However, research has yet to study the frequency of knowledge spillovers that occur between an industrial R&D laboratory and its suppliers, which represents a significant gap in our understanding of how managers and policy makers can facilitate the flow of knowledge during the innovation process.

    View all citing articles on Scopus

    Authorship is alphabetical. The authors give special thanks to the MIT Department of Biology for access to archival materials. We also thank Paula Stephan, Pierre Azoulay, Nico Lacetera, Jacques Mairesse, Fabiana Visentin, editor Martin Kenney and two anonymous reviewers, as well as participants at the NBER Changing Frontier Conference and the NSF Workshop on Science of Science Policy for their thoughtful comments. Chris Liu is thankful to the Kauffman Foundation for Dissertation Fellowship funding and support. The usual disclaimers apply. Direct all correspondence to Chris Liu at [email protected].

    View full text