Introduction

Fibroblasts were first described in 1858 by Rudolf Virchow as cells located in connective tissue1, a term coined already in 1830 by Johannes Peter Müller for the matter that connects and mechanically supports other tissues2. In 1879, Matthias Duvall distinguished fibroblasts from epithelial cells within the mesenchyme of chick embryos, thereby adding to their demarcation3. Progressively, it became clear that fibroblasts reside within all dense and loose fibrous connective tissues. Dense fibrous connective tissue makes up tendons, ligaments and fasciae of the musculoskeletal system. Loose fibrous connective tissue is located at many sites, including visceral organ capsules, beneath the epithelial linings of airways, gastrointestinal and urogenital tracts (laminae propria) and skin (dermis), in central nervous system meninges, and between parenchymal cells in muscle and adipose tissue. Fibroblasts are thus a major cellular component of the diverse fibrous connective tissues that disperse throughout our organs2,4. Alongside endothelial and blood cells, fibroblasts are one of the most common and widespread cell types in our bodies2.

The term fibroblast, used already more than 100 years ago by Ernst Ziegler5 and Alexander Maximow6, deserves a comment, as the “-blast” suffix refers to morphological hallmarks of active protein synthesis. A key research goal of the early 20th century was to identify the cellular source of extracellular matrix (ECM) during tissue repair4,7,8,9. In these studies, the quiescent fibroblasts of uninjured organs were sometimes referred to as fibrocytes, which displayed a small cytoplasm, few ribosomes and condensed chromatin, together suggesting low levels of protein synthesis10. It was assumed that fibrocytes give rise to fibroblasts upon wounding, and conversely that fibroblasts differentiate into fibrocytes during healing10. Perhaps due to limited interest in the quiescent cells, the original meaning of fibrocyte was progressively lost and fibroblast adopted as the common name irrespective of its state of activity. Fibrocyte later reappeared as the name for a circulating monocytes/macrophages-related antigen-presenting cell involved in tissue repair11,12 (for review see ref. 13).

In the early days of cell culturing, fibroblasts turned out to be sturdy and easy to propagate on artificial surfaces such as glass and plastic. They regularly outgrew other cell types and thus became popular for basic studies in cell biology. In the 1950s, Michael Abercrombie and Joan Heaysman established the important concept of cell contact inhibition using fibroblasts14, and Leonard Hayflick showed in 1965 that fibroblasts stop dividing and senesce after a finite number of population doublings in vitro15, thereby disproving the view that cultured cells could be passaged indefinitely (see ref. 16). Fibroblasts were further utilized to identify cellular oncogenes in the 1970s and 1980s17, as well as for the more recent establishment of techniques for induced pluripotency18. Fibroblasts are also commonly used as feeder cells in co-culture experiments to provide cell contacts, ECM and growth factors needed for other cells, such as embryonic stem cells, to thrive19,20.

It is well-established that fibroblasts produce collagens and numerous other proteins that make up the ECM of fibrous connective tissues. In fact, one of the fibroblasts’ archetypical products, collagen type 1, is the most abundant protein in our bodies21. The ECM consists of more than 300 different core proteins and numerous additional bridging proteins and matrix-modifying enzymes, together referred to as the matrisome22. Each tissue has its unique ECM composition, and it was early recognized, based both on morphological and molecular criteria, including gene expression profiling, that fibroblasts from various anatomical locations differ23,24. In addition to ECM production, fibroblasts play a role in wound contraction. In 1971, Giulio Gabbiani and colleagues reported that fibroblasts respond to tissue injury by assuming a contractile, myofibroblast, phenotype25. Similar cells are implicated in fibrosis, which may be defined as unresolved repair after tissue damage (for review, see ref. 26). Besides fibrosis, fibroblasts are important also for other disease processes, including cancer where they make up a variable proportion of the tumor stroma and deposit ECM of various molecular compositions and secrete different cocktails of growth factors and cytokines that influence tumor growth and metastasis27,28. Secretion of soluble extracellular signaling molecules is also of profound physiological importance during development, where fibroblasts provide instructive paracrine signals during organogenesis and thus take part in reciprocal cell-cell signaling required for the differentiation of other—often epithelial—cell types, for example during epidermal, oral, intestinal and kidney development29,30,31,32,33,34. Finally, fibroblasts are known to hold positional memory in the body, notably by expression of specific sets of Hox transcription factors in various sectors along the craniocaudal axis of the vertebrate embryo35,36.

To better understand the various roles of fibroblasts, it is important to gain insights into how heterogenous they are, as well as the functional consequences emanating from fibroblast heterogeneity. At least in part, research into fibroblast heterogeneity has however been hampered by ambiguous and diversified cell classification with regard to, for example, lineage, morphology, location or growth characteristics. An important step towards a deeper and more detailed characterization of fibroblasts can however now be taken thanks to single-cell transcriptomics, which complements the detailed anatomical and morphological knowledge with insights into the molecular profiles of fibroblasts at unprecedented depth and scale.

Evolution from morphological to molecular characterization of fibroblasts

In normal adult tissues, fibroblasts appear spindle- or stellate-shaped with an oval nucleus and a distinct endoplasmic reticulum (ER)3,4 (for review see ref. 27). While these features are helpful to localize the cells anatomically, they provide few clues to origin, molecular composition, heterogeneity and relationship to other cell types. Although not uniquely expressed on fibroblast, certain proteins, such as vimentin (VIM (protein), Vim (gene/mRNA mouse)) and fibroblast specific protein 1 (FSP1, S100a4), have served as useful markers to identify fibroblasts by immunohistochemical techniques37,38. With the advent of technologies that allow the gene expression patterns of individual cells to be resolved at substantial depth, i.e., single-cell RNA-sequencing (scRNA-seq) (see Box 1 for a detailed description of the technology and potential caveats and pitfalls), an additional important step could be taken in the quest to better characterize fibroblasts along with all other cell types that make up our organs. Hence, the traditional morphological classification of cell types is now being complemented by molecular classifications, foremost based on gene expression.

Following from the first cDNA microarray-based study of fibroblast heterogeneity24, transcriptional analysis of fibroblast was conducted using bulk mRNA isolates obtained from multiple cells39,40,41. This work provided molecular information at the organ level and initial insights into inter-organ transcriptional heterogeneity of fibroblasts. From these and other studies, certain molecular markers were proposed to distinguish fibroblasts from other cell types, including VIM, platelet derived growth factor receptor-alpha (PDGFRA, Pdgfra), fibroblast activation protein-alpha (FAP, Fap), FSP1; and CD90 (Thy1). Subsequently, scRNA-seq datasets have been obtained from fibroblasts from essentially all major organs in the mouse and human (see Supplementary data 1 for a compilation of select scRNA-seq studies covering fibroblasts). When fibroblasts were collected and sequenced as part of broader atlas projects, they were occasionally called by other names, such as stromal cells, mesenchymal stem cells, myofibroblasts42 and unknown mesenchymal cells43, illustrating the prevailing ambiguities regarding markers and cell nomenclature. Some studies have been devoted more specifically to fibroblasts and their involvement in cancer, fibrotic diseases44,45 or development46. We have ourselves compared fibroblasts across healthy adult organs in the mouse47, whereas others have reported multi-organ comparisons between mice and humans48.

What have we learned from these scRNA-seq studies thus far? As a brief account (more specific examples and discussion are provided below), one lesson is that no single marker appears capable to discriminate all fibroblasts from all other cell types across organs; combinations of markers are needed. A second lesson is that organotypic fibroblast heterogeneity is profound47, although certain marker combinations may point to fibroblast subtypes that cross organ boundaries. Examples include peptidase inhibitor 16 (Pi16) and collagen type 15 alpha-1 (Col15a1) expression, which define subtypes present in several organs48, a Wif1+Comp1+ fibroblast subtype observed in heart valve as well as skeletal muscle perimysial fibroblasts, and two fibroblast subtypes present in both colon and bladder, defined by Tnc+Cd34 and Tnc-Cd34+ expression, respectively47. Thirdly, insights into the specialized physiological functions of various fibroblast subtypes are emerging, for example in human skin, where some fibroblast subtypes appear specialized for ECM production and others for immunological and antimicrobial activities49, or in mouse skin, where fibroblast subtypes have been identified that participate in fibroblast growth factor (FGF) and Wnt signaling with hair follicle epithelial stem cells during cycling of the hair follicle50. In the mouse intestine, specific intestinal fibroblast subtypes are involved in Wnt signaling that regulates intestinal epithelial stem cell differentiation32,47,51,52. Fourthly, insights into the specialized pathological functions of various fibroblast subtypes are emerging. For example, a rare intestinal fibroblast subtype that converts arachidonic acid into prostaglandin E2 appears to play a role in tumorigenesis in the mouse33. Fibroblasts in skeletal muscle express specific cytokines and proinflammatory factors, likely aiding in regeneration of myofibers following injury53. A fifth lesson relates to species similarities and differences, including how similar human and mouse fibroblasts are, which is of relevance for human translation of mouse data. While the number of studies comparing mouse and human fibroblasts “side-by-side” are still few48,54,55,56,57,58, analogous fibroblast subtypes and similar gene activation programs exist in mouse and human cancer-associated fibroblasts (CAFs)48. There also appears to be molecular similarities between perivascular brain fibroblasts in humans and mice59,60,61,62, as well as between human and mouse lung fibrosis-inducing CTHRC1+ fibroblast subtypes63,64. Notwithstanding these examples, more research will be required to precisely define the degree of fibroblast conservation between the humans and mice.

Given the current pace of acquisition of fibroblast scRNA-seq data, we will undoubtedly see rapid progress, resulting in more fine-grained views of fibroblast heterogeneity, as well as refinement of transcriptomic signatures for cell type demarcations, such as proposed pan-fibroblast signatures defined by Pdgfra, Dpt combined with Pi16 or Col15a148, or a 90-gene signature that demarcates fibroblasts from vascular mural cells47. Progress will also be aided by improvements on the bioinformatics side (see Box 1 regarding technological considerations for scRNA-seq studies). Going forward, it is important that novel fibroblast subtypes are rigorously characterized, not only bioinformatically through clustering algorithms, which can be somewhat arbitrary and should be seen as a framework for detailed analysis rather than final conclusions about cell types, but also anatomically using carefully annotated RNA and protein markers. Ideally, marker combinations that readily and reliably distinguish fibroblasts from other cell types in specific organs of interest should be identified and used.

Fibroblast heterogeneity within and between organs

As mentioned above, scRNA-seq data now allow fibroblast heterogeneity and relationship with as well as demarcation to other cell types to be decoded with unprecedented speed and precision. When discussing progress in these areas, we were inspired by the “The Ancestor’s Tale” by Richard Dawkins and Yan Wong, in which the authors describe our evolutionary history from the perspective of a human first discussing with its closest relatives, i.e., the extinct hominins, and then proceeding gradually to evolutionarily more distant relatives, ending with Archaea. In the same vein, we first discuss fibroblast heterogeneity within and between organs and then proceed to discuss relationships of fibroblasts to other connective tissue and vasculature cell types with whom fibroblasts share functional similarities and possible molecular relationships.

Intra-organ fibroblast heterogeneity

Fibroblast heterogeneity spans species, organ and developmental stage boundaries. It has long been recognized that different fibroblast populations reside simultaneously in the same organ (for review see refs. 65,66). For example, several studies have advanced our understanding of fibroblast subtypes in the skin (Fig. 1). In mice, fibroblasts residing in the upper (papillar) layer express alpha-8 integrin (Itga8) and dipeptidyl peptidase 4 (CD26, Dpp4), whereas fibroblasts in the lower (reticular) layer express podoplanin (Pdpn) and delta-like non-canonical Notch ligand 1 (Dlk1)67. One study identified four fibroblast subtypes in the adult mouse skin (FIB1-4), which based on marker expression (Dcn, Gpx3, Sparc, and Plac8, respectively), were found to occupy three distinct anatomical localization (dermis, hypodermis and adventitia)50. Two of the subtypes (FIB1 and 2), likely representing different states of the same fibroblast subtype, respond transcriptionally to different cell cycle stages in the hair follicle50. In humans, one study suggests that skin fibroblasts segregate into six transcriptionally distinct cell clusters, of which one (expressing DPP4) is the main ECM-producing cell type49. In a meta-analysis of human skin fibroblast transcriptomes, 10 subtypes formed three major groups (A-C, marked by MMP2, IGFBP7 and SFRP1 expression, respectively)68, of which group A appears specialized for ECM production and group B in immune surveillance68. In the mouse heart, two major fibroblast subtypes, presumably derived from endo- and epicardium, were identified and shown to express Lamb1 and Dkk3, respectively69. Heart fibroblasts also express Csf1, Vegfa, Igf1 and Fgf2, indicating active paracrine signaling to other cell types in the heart69. The human adult heart was found to contain seven fibroblast subtypes, two of which show enrichment in atria and ventricles and can be distinguished by expression of SCN7A and CFH, respectively, and two (called FB4 and FB5) which appear specialized in responding to transforming growth factor-β signaling and in ECM remodeling, respectively70. In the mouse lung, fibroblast subtypes have been characterized, one of which (a PDGFRa+Axin2+ subtype) responds by differentiating into myofibroblasts following injury71,72. An analysis of the human lung revealed five fibroblast subtypes, one of which (CTHRC1+) promotes lung fibrosis in COVID-19 patients64. Interestingly, a similar Cthrc1+ fibroblast subtype drives lung fibrosis in the mouse63. Two fibroblast subtypes with different injury-response features were identified in the synovial tissue of joints in the mouse: an immune effector subtype expressing fibroblast activation protein-α (FAPα)+ as well as THY1+ and a tissue destructive FAPα+THY1 subtype73. Depletion of the FAPα+ fibroblasts reduced bone erosion and inflammation in a mouse arthritis model73. Another study identified NOTCH3 activity as a driver of inflammation in THY1+ synovial fibroblasts44. Intra-organ fibroblast heterogeneity is not confined to mammals but has also been noted in zebrafish where different pdgfra-positive fibroblast subpopulations take part in lymphangiogenesis46, or in the axolotl where several fibroblast populations were found amongst blastema-cells during limb regeneration74,75.

Fig. 1: Schematic depiction of examples of intra-organ fibroblast heterogeneity in humans and mice based on scRNA-seq analysis.
figure 1

Examples of gene enrichment for specific fibroblast subtypes that were defined in each study are shown. Ascensión et al.68 summarizes four individual studies. AlvFB alveolar fibroblasts, AdvFB adventitial fibroblasts, LipFB lipofibroblasts, MyoFB myofibroblasts (sub-class: fibromyocytes), F-SH Sca1-high fibroblasts, F-SL Sca1-low fibroblasts, F-Trans transitory fibroblasts, F-WntX Wnt expressing fibroblasts. The annotations of fibroblast subtypes as well as their marker genes were compiled from the respective original works. If marker genes were not explicitly stated in the original paper, they were interpreted and derived from the available data presented in the figures. No additional analysis of the raw data was performed.

In addition to the distinction between fibroblast entities by the presence/absence of specific singular markers, intra-organ fibroblast heterogeneity may also reflect cell zonation, which is defined as gradual cellular phenotypic transitions along an anatomical or functional axis. For example, zonation of hepatocytes and endothelial cells occurs along the porto-central axis in the liver lobule76,77 and endothelial and mural cell zonation is observed along the arterial-venous axis in brain vasculature60. There are anatomical and physiological reasons to believe that fibroblast zonation (in addition to the Hox transcription code, discussed above) would make sense, for example along the muscle-tendon axis or along the crypt-villus and crypt-surface axes of the small and large intestine32,34. By identifying genes with gradient or nested transcription profiles across adjacent tissue areas/volumes, fibroblast zonation will potentially be uncovered and shown to be instrumental for generation and maintenance of proper tissue architecture in multiple organs.

Inter-organ fibroblast heterogeneity

The first genome-wide transcriptional information on inter-organ fibroblast heterogeneity was provided through cDNA microarray analysis of cultures of human skin fibroblasts established from different ages and anatomical locations24. Interestingly, despite the concern that in vitro culturing may have influenced gene expression patterns, these fibroblasts retained a certain degree of transcriptional memory of their origin, including HOX gene expression patterns24. In a cross-comparison of mouse fibroblasts isolated directly from different muscular organs and analyzed by scRNA-seq, it was evident that differential expression of genes related to the ECM—the matrisome—is a major feature of inter-organ fibroblast transcriptomic heterogeneity47 (Fig. 2). This information is consistent with previous evidence that fibroblasts at different anatomical locations and at different ages tailor a unique composition of the ECM24. An important topic for further analysis is to understand the genetic and/or epigenetic underpinnings of how fibroblasts of different organs regulate their gene expression in order to achieve spatial/temporal-specific ECM profiles.

Fig. 2: Schematic depiction of examples of inter-organ fibroblast heterogeneity in humans and mice, based on scRNA-seq analysis.
figure 2

Examples of gene enrichment for specific fibroblast subtypes that were defined in each study are shown. Studies that specifically investigate the inter-organ heterogeneity of fibroblasts were selected. a alveolar, p peribronchial, hi PDGFRα-high, rp red pulp. The annotations of fibroblast subtypes as well as their marker genes were compiled from the respective original works. If marker genes were not explicitly stated in the original paper, they were interpreted and derived from the available data presented in the figures. No additional analysis of the raw data was performed.

The relationship between fibroblasts and other cell types in connective tissue and vasculature

Specialized connective tissue cell types

Several connective tissues harbor highly specialized cell types: adipocytes in fat, chondrocytes in cartilage, and osteocytes/blasts in bone. There are both in vivo and in vitro data suggesting that these cell types are lineage-related and share a common mesenchymal and fibroblast-like origin, and it is therefore of interest to explore how related the differentiated cells are to each other and to fibroblasts at the molecular level. Adipocytes are designed to store, release, and burn lipids at different relative rates depending on their subtype (white, brown, and beige/brite). Several scRNA-seq studies of adipocytes have been conducted, although adult white adipocytes are challenging to handle due to their large size and fragility (for review see ref. 78). From available data, it is apparent that the transcriptomes of differentiated fibroblasts and adipocytes are readily distinguishable by the adipocyte transcripts that encode proteins involved in lipid handling. The resemblance between the adipocyte precursor cell (preadipocyte) and fibroblasts is however close, although relationships between preadipocytes and pericytes, discussed below, have also been proposed (see ref. 79 and references therein). Chondrocytes are morphologically highly similar to fibroblasts, but express several unique proteins, notably a number of ECM proteins contained within cartilage, including collagen type 280 (for review see ref. 81). Beyond the highly specific markers of adipocytes, chondrocytes and osteoblasts, further analyses are needed to define transcriptional similarities and differences between these cells, their precursors, and fibroblasts.

Vascular mural cells

Vasculature courses through and is sometimes even counted as a component of connective tissue. Blood vessels consist of an inner tubing of endothelial cells surrounded by mural cells. The latter are either vascular smooth muscle cells (VSMCs), located in larger vessels (arteries, arterioles and veins), or pericytes, which coat capillaries and some venules and are transcriptomically related to venous VMSCs60. VSMCs contain a contractile machinery involving several proteins, among them alpha-smooth muscle actin (ASMA, Acta2), which is a highly expressed and often-used smooth muscle cell marker. Pericytes probably have multiple and organ-specific functions, for example in the brain where they are essential for the maturation and maintenance of the blood-brain barrier82,83. Molecularly, pericytes can be distinguished from VSMCs by for example abundant expression of potassium inwardly rectifying channel subfamily J member 8 (Kcnj8) and absent or very low (relative to VSMCs) expression of transcripts encoding the smooth muscle contractile machinery60. Fibroblasts reside on the outside of the mural coat in many vessel types. In the largest arteries and veins, fibroblasts form an anatomically discernable vascular layer, the adventitia. Pericytes have a morphology somewhat resembling that of fibroblasts, and it has been proposed that pericytes are in fact specialized perivascular fibroblasts84. Live imaging in zebrafish embryos indeed suggest that mural cells derive from perivascular fibroblast-like cells85. ScRNA-seq analysis of pericytes from several organs, including brain, lung, skeletal muscle, colon, bladder and heart has illustrated their relationship to fibroblasts47,60. From these studies, it seems clear that a distinct demarcation between fibroblast and pericyte transcriptomes can be made, including differences in ECM-related transcription profiles. For example, pericytes express specific ECM-binding proteins, such as basal cell adhesion molecule (BCAM, Bcam), but generally low levels of collagens in comparison to fibroblasts, with the exception of collagen type 4 (Col4a1, Col4a2), which are components of the microvascular basement membrane to which the pericytes contribute47. The molecular demarcation between VSMCs and fibroblasts also seems distinct, with several genes uniquely expressed in one or the other cell type. Regarding ECM production, VSMCs express low amounts of collagens, similar to pericytes, but high amounts of elastin (Eln), and certain other ECM proteins of presumed relevance for the resilience of the arterial wall47. Genes encoding proteins of the smooth muscle contractile machinery, such as Acta2, myosin heavy chain 11 (Myh11), transgelin (Tagln), calponin 1 (Cnn1), leiomodin 1 (Lmod1) and regulator of calcineurin 2 (Rcan2), are highly expressed by VSMCs, distinguishing them from hitherto analyzed quiescent fibroblasts, although these genes may be re-expressed (albeit at lower levels than in VSMCs) when fibroblasts become activated into myofibroblasts during pathological conditions or in normal fibroblasts located in organs with high deformation capacity, such as the urinary bladder47,57.

Cell types with features of both fibroblasts and pericytes

While in most cases fibroblasts and pericytes can be readily distinguished by their distinct gene expression profiles, certain cell types appear to exhibit characteristics of both. In the kidney glomerulus (Fig. 3A), mesangial cells display intermediate features between those of pericytes and fibroblasts, VSMC and myofibroblasts86. Mesangial cells have indeed been proposed to be a specialized type of pericytes87,88, a notion supported by their lineage-dependence on PDGFB-PDGFRB signaling89,90,91, which is shared with other mural cell lineages. Other reports however claim that mesangial cells are distinct from pericytes92. Mesangial cells lack the typical cellular morphology and vascular basement membrane embedment of other capillary pericytes, and the fact that they deposit a rich ECM (called mesangial matrix) may be more in line with a fibroblastic than pericytic nature. However, no other pericyte-like cells are present within the glomerulus, and if a generic pericyte function were needed there, it would have to be provided by the mesangial cells. Currently, several kidney scRNA-seq studies have claimed to identify (and hence molecularly characterize) mesangial cells, but links between cellular heterogeneity to precise anatomical location were lacking93,94,95. However, one study succeeded in distinguishing between intra- and extraglomerular mesangial cells through their unique scRNA-seq profile and in situ analysis of relevant markers58. Notably, the intraglomerular mesangial cells expressed desmin (Des) and Acta2, i.e., markers of mural cells, but also Pdgfra and Pi16, which are fibroblast markers typically absent in mural cells58.

Fig. 3: Cell types with features of both fibroblasts and pericytes.
figure 3

A Schematic depiction of the kidney with the nephron and glomerulus. In the glomerulus, the position of the mesangial cells close to a capillary loop is shown. B Schematic depiction of the liver with the lobules and portal triads. The position of the hepatic stellate cells along the sinusoid is shown.

Hepatic stellate cells (HSCs) were originally discovered in the 1870s by Karl Wilhelm von Kupffer, who called them Sternzellen (star cells)96 and portrayed them as a specialized type of pericyte by virtue of their anatomical position along the sinusoidal capillaries and in the perisinusoidal space (the space of Disse) between the endothelial cells and hepatocytes97,98,99,100 (Fig. 3B). Like pericytes in other organs, the HSCs contact capillary endothelial cells, but unlike both typical pericytes and glomerular mesangial cells, HSCs do not depend on PDGFB-PDGFRB signaling for their embryonic development101, possibly suggesting a non-mural identity. HSCs were recently characterized by scRNA-seq and found to be heterogeneous across the healthy liver lobule, with distinct molecular markers expressed at the periportal and central locations, respectively102. Similar to mesangial cells, HSCs appear to express markers of both mural cells (Des) and fibroblasts (Pdgfra), suggesting an identity intermediate between these two cell types.

Annotation of ambiguous cell types

The emerging transcriptomic profiles of fibroblasts and the other cell types discussed above can be useful to shed light on the nature of recently identified cell types, for which annotation has been problematic, or remains controversial. A novel type of pericyte involved in spinal cord repair has been reported103,104. While these cells share some markers with brain pericytes and locate in the proximity of blood vessels, other markers may argue that these cells instead are perivascular fibroblasts105, a notion receiving support from scRNA-seq data60. Lineage-tracing studies likewise suggest a fibroblast origin of cells forming spinal cord fibrotic scars106, but a final conclusion will have to await a more extensive analysis of all cell types present in the spinal cord vasculature. Kidney fibroblasts were recently annotated in a kidney scRNA-seq study based on the expression of a singular marker, FSP1 (S100a4)107. However, cross-organ comparison provides little support for these cells being fibroblasts, but there is instead ample evidence for similarity with macrophages47,108, which, along with smooth muscle cells, also express FSP1. Another study instead annotated kidney fibroblasts in both human and mouse as positive for PDGFRA, PDGFRB, maternally expressed 3 (MEG3), Scavenger Receptor Class A Member 5 (SCARA5) and numerous collagens109, which are markers found also in fibroblasts from several other organs47. Further studies are thus needed to establish the identity and possible heterogeneity of kidney fibroblasts. Some of the annotations in the large-scale analysis presented in Tabula Muris42 may also need to be revised, notably some cell populations defined as myofibroblasts should rather be referred to as mural cells (pericytes and/or VSMCs). A novel lung cell type characterized by high PDGFRB expression and expansion in response to experimental lung injury was recently identified110. The authors posed the question whether these cells were pericytes, a notion supported by the extensive transcriptomic similarity to cells independently annotated as lung pericytes60. These examples illustrate how the improved understanding of fibroblasts, pericytes and other vascular or connective tissue cell types may assist in fine-tuning annotations of cell types from various datasets.

Lineage relationships between fibroblasts and other cell types

The recent advances in transcriptomics can be productively interfaced with insights from lineage-tracing analysis to better understand fibroblast lineage relationships. Connective tissue is largely derived from mesoderm that via epithelial-to-mesenchymal transition (EMT) of the primitive epithelium generates embryonic mesenchyme, which in turns forms most connective tissues in the adult organism, including fibroblasts65,111 (Fig. 4). The vascular mural cells of most trunk tissues likewise have a mesodermal origin, arguing that a common progenitor for fibroblasts and vascular mural cells can be found somewhere in the mesodermal lineage65, a view supported by lineage-tracing analysis112. Fibroblasts and mural cells in the head are however in part derived from a different origin, the neuroectoderm (neural crest)84, and it will be interesting to learn to what extent transcriptomic differences between fibroblasts and mural cells in the head versus organs in the trunk reflect these different cellular origins. To establish fibroblast transcriptomes from different developmental stages will also be important to understand how different fibroblast subtypes dynamically change during development.

Fig. 4: Schematic representation of the fibroblast lineage, with the origin in mesoderm and from ectoderm (via the neural crest) for fibroblasts in the head.
figure 4

Other cell types originating from the mesoderm are also depicted. MSC are shown as a transitory cell type that yield fibroblasts. In fibrosis, wound healing and cancer, fibroblasts likely progress further to become fibrogenic cells, myofibroblasts and CAFs, respectively.

In a discussion on connective tissue lineage and possible stem/progenitor populations, it is difficult to avoid touching upon mesenchymal stem cells (MCS; a.k.a. mesenchymal stromal cells). MSCs were first mentioned in the 1960s in conjunction with bone marrow transplantation and explant studies113,114, and cells with the potential of differentiating into multiple connective tissue phenotypes, including adipose, cartilage and bone, were subsequently established in vitro from multiple tissue sources (for review see ref. 115). There has however been a lack of rigorous criteria to define MSCs116 and although efforts are made to define a set of commonly agreed markers115,117, there is still a somewhat bewildering literature regarding MSC cell lineage and differentiation capabilities. It should also be considered that most MSC differentiation experiments have been conducted in vitro, which makes it difficult to yet establish their true existence in vivo, both regarding abundance and locations, and to what extent they contribute to repair and remodeling of connective tissue, including fibroblasts. Several studies propose that pericytes may be in vivo equivalents to MSCs based on that MSCs can be established from explanted vascular fragments and by using a limited set of markers118,119,120. A strong similarity between MCSs and fibroblasts has however also been proposed121. It is increasingly clear that fibroblasts coat also small caliber vessels without a discernable adventitia and that fibroblasts and mural cells share many markers including the ones used for MSCs47,60. In sum, MCSs have been extracted from various tissues, but their true in vivo origin, i.e., whether they exist as a bona fide resident stem/progenitor cell in the tissue or derive through dedifferentiation of a tissue-resident fibroblast, pericyte or other cells, remains to be worked out.

Lineage-tracing in the mouse has shed new light on relationships between fibroblast subtypes and between fibroblasts and other cell types, although there are still some technical caveats and pitfalls with the technology, as discussed in Box 2. In skin, progenitors for both papillar and reticular fibroblasts, as well as for the papillar and reticular lineages individually, have been identified by the use of Pdgfra-CRE, Blimp1-CRE and Dlk1-CRE drivers, respectively67 (for review see ref. 122). Skin fibrosis has been lineage traced to an En1+ fibroblast population123. During wound healing, cells descending from a Hic1+ mesenchymal progenitor population generate extrafollicular fibroblasts124, and Hic1 marks mesenchymal progenitors with functions in regeneration also in skeletal muscle53 and heart125. Interestingly, these cells express a number of pericyte markers (e.g., Rgs5, Mcam, Notch3 and Kcnj8), possibly suggesting a lineage relationship between pericytes and regenerating fibroblasts53. In heart, mesenchymal progenitors yield Pdgfra+Sca1 fibroblasts upon cardiac injury, and the transition towards the fibroblast state is regulated by Hic1125. Lineage-tracing of a Pdgfra+Ly6a+(Sca1) mesenchymal stromal progenitor cell in skeletal muscle indicates that it can give rise to both fibroblasts and adipocytes126,127 and be diverted towards an osteogenic fate by bone morphogenetic protein stimulation128. This may suggest a common mesenchymal progenitor for fibroblasts, adipocytes and osteoblasts.

Fibroblasts and pathological conditions

Fibroblasts have been strongly implicated as the major fibrogenic cell type in pathological tissue fibrosis. 50 years ago, it was demonstrated that fibroblasts convert to myofibroblasts expressing certain smooth muscle cell markers, notably Acta2, when activated by tissue injury or stress25. While this transition is part of a normal wound-healing process that resolves over time, it can also result in persistent fibrosis, a severe medical condition with impaired function of organs, including the kidney, lung, liver and heart. A hallmark of fibrosis is the accumulation of myofibroblasts and excessive ECM deposition resulting in abnormal fibrous connective tissue (for review see refs. 27,129). While many studies point to fibroblasts as the source of the myofibroblast, the situation may not be that simple. Alternative suggested origins of myofibroblasts include endothelial cells, macrophages, mesothelial cells (the squamous epithelial lining of body cavities and their organs), pericytes, HSCs and MCSs. Summarizing the extensive body of original papers regarding the potential diverse origins of myofibroblasts goes beyond the scope of this review, and we refer interested readers to other literature on the topic66,129,130. Lineage-tracing has identified a Gli1+ mesenchymal progenitor population, which generates myofibroblasts and drives fibrosis in a sonic hedgehog (Shh)-dependent manner in kidney, lung, liver, heart and bone marrow131,132,133. The localization of the Gli1+ mesenchymal cells close to endothelial cells in the perivascular niche and their expression of Cspg4 (NG2) may suggest a relationship to pericytes131. The partially conflicting propositions for the various cellular origins of myofibroblasts rest on lineage-tracing experiments in mice and may to some extent reflect methodological pitfalls, including leakiness in inducible CRE/loxP systems134 or aberrant expression of CRE recombinase (Box 2). It however remains a possibility that there is a multi-cell type (including non-fibroblastic) origin generating functionally distinct subpopulations of myofibroblasts. Heterogeneity of fibroblast origin and differentiation during tissue fibrosis is indeed supported by data from heart, lung and kidney109,135,136. Further careful assessment of transcriptomic data from resting fibroblasts and other normal cell types, comparisons with myofibroblast transcriptomes, and the analysis of transcriptional profile changes during myofibroblast generation109 should bring clarity to both origin and heterogeneity amongst pathological myofibroblasts.

In cancer research, it is increasingly realized that it is not only the properties of the genetically abnormal tumor cells that dictates the cancer’s severity, growth potential and ability to metastasize; the outcome is also influenced by the genetically normal stroma cells that populate all solid tumors at variable abundance. Among the latter, CAFs constitute an important cell population (for a recent consensus statement paper on CAFs, see ref. 137). Together with their produced matrix, CAFs often make up a significant volume of carcinomas, and they influence tumor growth and metastasis27,28. Similar to the situation with myofibroblasts and fibrosis, our knowledge about the origin and molecular characteristics of CAFs is still limited—they are in part “negatively” characterized by not expressing markers for other well-annotated cell types and not carrying the chromosomal abnormalities observed in the neoplastic tumor cells. To gain insights into the origin of CAFs is complicated, particularly in humans, and relies on inference from studies of earlier stages of cancer (hyperplasia and adenomas), and based on such studies, normal tissue fibroblasts have been implicated as a founding cell type for CAFs (for review see ref. 138). Lineage-tracing experiments in mice have resulted in different views on the cell type of origin for CAFs137, but the conclusions may suffer from problems with CRE expression specificity (Box 2), as discussed above for myofibroblasts. Cell transplantation experiments provide an additional source of information regarding the cellular origin of CAFs. In such experiments, asking whether a particular transplanted cell type ends up as CAFs in the tumor stroma, both MSCs and adipocytes appear to be able to generate CAFs in xenograft mouse tumor models139,140,141,142,143. As for myofibroblasts, it is an emerging view that distinct subpopulations of CAFs can be identified in a particular tumor stroma45, possibly indicating a multi-cell type origin of CAFs.

Conclusions and future outlooks

At present, our understanding of fibroblasts is rapidly improving, and large-scale transcriptomic analyses at single-cell resolution allow us to better define fibroblasts, their heterogeneity and relationship to other types and classes of cells. While this undoubtedly is transforming our current understanding, it is important to also consider that there still is room for improvement regarding the current transcriptomic and transgenic technologies (see Box 1 and Box 2 for examples). Transcriptomic technologies indeed rapidly improve and become more affordable, which will lead to an acceleration in the accumulation of gene expression information with increasing granularity. Similarly, novel lineage-tracing methodologies are at the horizon, which rely less on promoter specificity but more on clonal evolution based on naturally occurring somatic mutations in cells144,145 or the introduction of traceable genetic markers146. Improved bioinformatic algorithms to identify cell state transitions and advances in spatial transcriptomics will likewise help deciphering lineage relationships and the exact anatomic location of fibroblast subsets147. Such studies may shed new light on lineage relationship and provide insights into whether differentiation yields only divergent fibroblast subsets from a founding cell population or whether there may also be convergence from different lineages towards a similar molecular phenotype, for example when fibroblasts of different origins experience similar physiological conditions and challenges.

Undoubtedly, gene expression signatures that define and demarcate fibroblasts will be refined and reveal further intra- and inter-organ fibroblast heterogeneity, advancing our functional understanding of fibroblasts and its related cell types. While this progress in principle is applaudable, it should be remembered that transcriptomic information, even when available with perfect fidelity for single cells, is not the sole basis of cellular heterogeneity. Cellular protein levels may deviate from the corresponding mRNA levels, and posttranslational protein modifications provide an additional level of molecular complexity and heterogeneity. Additionally, lipids, sugars and low molecular weight metabolites and solutes are likely to contribute to cellular heterogeneity independent of transcriptional control. Whereas single-cell analysis of other molecules than DNA and RNA currently has limited depth, we may expect technological advances also in these areas, and therefore our view and understanding of cellular (including fibroblast) heterogeneity will continue to evolve. Secondly, while the wealth of single-cell transcriptomic data can inspire efforts to define an expanding number of molecularly distinct fibroblast subpopulations, it must all the time be asked to what extent the molecular differences are physiologically meaningful; some of the identified cell “subtypes” may reflect states on a developmental trajectory or responses to stress or injury rather than stable functional specializations at steady state. To tease out specific physiological functions for steady state differences in fibroblasts will undoubtedly take longer, but it remains an important future research avenue and is increasingly feasible. It is possible that the fibroblast cell class in due time may be subdivided into functionally meaningful fibroblast subtypes, some of which execute organ-specific physiological functions, while other subtypes may transcend organ boundaries and carry out functions that are not organ-specific, such as the involvement in immune responses or wound healing. The latter may lead to a terminology similar to the one established for T-cells, for which we now recognize a set of functionally distinct subtypes, including T-regs, CD8+ T-cells, CD4+ T-cells and double-negative T-cells. It will be interesting to see how these endeavors will pan out, but it is certainly likely that the fibroblast will be a well-defined and annotated class of cells for which heterogeneities across and within organs as well as developmental and pathological states are understood in a foreseeable future.