1 Introduction

Cells within the body are constantly bombarded with a large repertoire of molecules that must be dealt with as potential stimuli. Most of the time, these molecular inputs are measured by receptors at the surface of the cells. State of these receptors are thus informative on the outside world, and experimental and theoretical biophysicists [61] have extensively used information theory to estimate how much information (in the Shannon sense [55]) can be encoded (see e.g. in a developmental context work of Gregor and coworkers [26]).

Over longer time-scales, information processing eventually leads to a decision, that we define as a change in physiological macroscopic behavior or in the steady states of a gene regulatory network. Decision is by essence a computation process, based on sometimes limited information. For instance, in the well-studied example of bacterial chemotaxis, a cell might decide to switch behavior between tumbling or swimming [6], a strategy that can be well-explained by a Max/Min game theory model [9]. It is not even always desirable to optimize information collection if the environment changes too rapidly, as illustrated by the“info taxis” strategy [64]. Other examples include cellular commitment to a given fate in response to dynamical signaling pathways [11], or decision to take action in the case of immune responses, characterized by binary Erk phosphorylation, cytokine release, and cell proliferation [2]. The later decisions are irreversible, indicating that computation is accompanied by information erasure and thus energy dissipation [48]. Additionally, information processing may be multi-tiered in order to retrieve different features of the input stimuli: rapid decision could discriminate between ligands of different nature, while slower decision could report the quantity of ligands.

In this review, we will focus on one specific cellular decision where similar considerations apply: the ability of T cells to discriminate very specifically between self and not-self ligands. The functional significance of such ligand discrimination is quite obvious. If a T cell “recognizes” a (potentially single) ligand as foreign, a large set of responses is triggered to eradicate the pathogenic infection that generated this stimuli. Conversely, if a T cell interacts with (many present) self ligands it should remain quiescent to avoid auto-immune catastrophe.

This discrimination task is particularly daunting as T cells are constantly exposed to a large number of molecular stimuli at once. This issue of signalling pleiotropy is potentially a very generic problem in biology and we will coin the term “absolute discrimination” to describe it. Multitudes of receptors are indeed shared in examples as different as BMP signalling, olfaction, endocrine signalling, etc... More theoretical works have suggested that organization of immune repertoire requires strong overlapping signals [46, 50]. In the context of this review, absolute discrimination thus is the specific and sensitive recognition of foreign ligands, independent of ligand quantity. Absolute discrimination of massive amounts of self vs low amount of foreign ligands is expected to be challenging since some self-ligands might be very close biochemically to foreign ligands.

The first part of this review will be devoted to a formal introduction to the problem of immune recognition by T cells, presenting current experimental understanding, past and present attempts to model this decision problem, and introducing the paradigm of adaptive sorting. In the second part we will introduce our current model for absolute immune discrimination, at the cellular scale. Finally, we will discuss how tools borrowed from statistical physics are needed to understand the higher level of processing in the immune system, at the cellular population scale.

2 Theoretical Approaches for Absolute Ligand Discrimination

T cells probe their environment in search of potential foreign peptides. This is done via the interaction of their T cell receptors (TCR) with ligands (pMHC), presented by Antigen Presenting Cells. At a given time, these cells “present” a repertoire of oligopeptides (embedded within an MHC) that is representative of the current proteome (i.e. a mix of peptides from the self genome as well as a potential genome of the pathogen). The core function of T cells is to scan such repertoire and detect the presence of pathogen-derived ligands and respond, while not responding to self-derived ligands.

This decision must be, by essence, absolute in the sense that it must be determined by ligand quality (here self or not-self), independently of ligand quantity (i.e. how many ligands there are). In that context, decision has been shown to be logically all-or-none, via binary/bistable response in Erk phosphorylation [2, 3, 43] or in NFAT translocation to the nucleus. A first difficulty is that there is no qualitative biochemical difference (e.g. a clear-cut structural distinction) between self-derived pMHC and pathogen-derived pMHC. Another natural hypothesis would be that foreign ligands lead to specific allosteric modifications (conformational changes) at the level of T cells receptors, which would be an ideal way to confer extreme sensitivity and specificity to immune recognition. Molecular immunology has made a lot of progress in listing all components implicated in this early response, but could not find evidence for such a direct qualitative sensing in the general case. Hence T cells must make a discrimination decision based on continuous quantitative biophysical differences between self and not-self, explaining why mathematical and physical modeling must be called upon to address how continuous variation in ligand characteristics gets processed with absolute discrimination.

2.1 Insight from Biophysics: The Lifetime Dogma of Antigen Discrimination

2.1.1 Antigen Discrimination is Set by the Lifetime of the Antigen/Receptor Complex

The exact molecular events associated with self/not-self ligand discrimination by T cells remain elusive. However, immunologists, structural biologists and biophysicists have made great progress to extract key parameters that physicists can build upon to tackle the issue of specific immune sensing (Fig. 1a).

The first insight came in the 90s when researchers measured the biophysical characteristics of ligand-receptor interactions using purified proteins assayed in vitro for binding/debinding (e.g. detection by surface-plasmon resonance or by calorimetry). Kersh et al. established a hierachy of ligands with similar binding activities, where the life-time of the ligand-receptor complex determines their ability to trigger response (Table 2 in [36]). Qualitatively, there exists a threshold of binding time (around 3–5 s) so that for ligands with a lower binding time, T cells do not respond, while for ligands with higher binding time T cells do respond (see in particular Table 2 and Fig. 3 in  [24]), thus realizing absolute discrimination based on binding. Such experimentally-derived rule (so-called “lifetime dogma” [18]) was well established by the turn of the millennium such that it became the springboard for many modeling efforts.

It should be immediately pointed out that like any dogma, this one is not absolute, and there are exceptions to the rule (see [18] for a discussion of some exceptions, [16] for an experimental approach on cell populations). For instance, it has been seen that some ligands could somehow “compensate” a small lifetime with a very high \(k_{on}\) [25], which has been interpreted as an effect due to constant rebinding [10] . More recent measurements have been carried in the context of more complete biological systems (e.g. T cells reading their ligands on the surface of antigen presenting cells), with single ligand resolution: these brought about a correction on parameters for association and dissociation rates, but concurred qualitatively with the previously-acquired in vitro measurements.

Recent work by Cheng Zhu and coworkers has challenged the lifetime dogma, using a Molecular Force Spectroscopy to interrogate individual pMHC–TCR interactions. Such technique relies on micropipette manipulation of pMHC-coated beads and exquisite measurement of the force induced by the engagement with one individual TCR on the surface of T cells to resolve the dynamics of ligand–receptor engagement. Zhu et al.’s measurements lead to paradoxical results at first: strong ligands that trigger T cells were found to be weaker binder to the receptor, thus inverting the life-time dogma [33]. Subsequent studies tested how force loading on the ligand-receptor complex would alter its lifetime, and the more intuitive hierarchy of ligands was recovered, with better binders inducing better signaling responses [44]. Zhu and colleagues thus proposed a dynamics structural model, whereby agonist ligands induce a conformational change in the complex (so-called catch bonds) that triggers T cell activation. Alternatively, non-activating ligands (e.g. self-derived peptide MHC) would not induce such conformational change, would be released rapidly (so-called slip bond) and would fail to activate a significant signaling response. Hence, there would in the end be qualitative differences between activating and non-activating ligands. This result, while potentially establishing absolute ligand discrimination at the structural level, must be reconciled with the observation that non-activating ligands may be sufficient to activate T cells when proper external cues are provided in the form of cytokines (cf Sect. 4.3). Ultimately, the mechanical differences between catch bonds and slip bonds for pMHC-TCR pairs, as uncovered by Zhu et al., remain to be interpreted in their capacity to trigger signal transduction.

From the Physics point of view, one intriguing aspect of these force measurements would be to add mechanical aspects to ligand discrimination. Understanding the forces associated with ligand-receptor interactions and the coupling with the mechanics of membrane deformation would be critical to account for the differential potency of ligands to activate T cells. Such quantitative models have been introduced [53], based on Ginzburg–Landau equations coupling the biochemistry of ligand-receptor interactions with the energetic cost of membrane deformation. Such models established that biochemical/mechanical coupling could be sufficient to physically sort membrane proteins on the T:APC cell interface, and generate a threshold of activation. Such physical models generated intriguing predictions that were subsequently validated experimentally: of note, it predicted that a family of ligands (with intermediate binding capacity) would abrogate the formation of so-called immunological synapse (a self-assembled bull-eye structure at the surface of T cells, where TCR aggregates at the center the synapse, and adhesion molecules occupy the periphery of synapse). In our context of immune recognition, one must point out that such synapse formation occurs downstream passed the initial signaling response associated with the ligand discrimination: it may constitute a reinforcing mechanism to anchor ligand discrimination over longer timescales, rather than the core cell-decision we are focusing on in this review. Recent models have explored how membrane stiffness influences effective binding times via supradiffusive effects  [1].

Two additional lines of work must be added to the biophysical conundrum of self/not-self ligand discrimination by T cells. First, in the field of immunotherapy, researchers have engineered T cells with synthetic chimeric-antigen receptors (CAR) whose extracellular domain is composed of an antibody recognizing a protein on the surface of tumors to be targeted (e.g. CD19 for B cell lymphoma), and whose intracellular domain is derived from signaling components of T cells (Fig. 1b): engagement of these receptors (with non-physiological ligands of surface antigens with very large lifetime) has been shown to be necessary and sufficient to activate T cells. In fact, examples of supra-physiological lifetimes for antigen/receptor complexes that lead to T cell activation were derived experimentally by in vitro evolution of the TCR/pMHC complex [30]. In the context of modeling early immune detection, this is relevant as the biophysics of ligand-receptor interaction are very different (with very large binding affinities), yet consistent with the lifetime dogma: antigen/receptor pairs with very strongly-held complexes, and very large lifetimes are indeed very stimulatory.

Another line of experimental evidence has recently been reinforcing the lifetime dogma. Taylor et al. [59] engineered a new class of chimeric antigen receptors, whose extracellular recognition unit is composed of single-stranded DNA 1c). Antigens for these T cells are composed of complementary single-strands of DNA (e.g. an oligomer of adenosines and cytokines, to avoid secondary structures). Hence immune detection in that context is highly tunable, quantifiable and easy to model: it is essentially the biophysics of DNA hybridization that drives the engagement of this artificial antigen receptor. In that context, Taylor et al. demonstrated that the association rates of these artificial receptor/ligand pair were essentially constant as it is limited by the nucleation of double stranding between two complementary DNA pairs. However, the dissociation rates are highly variable and essentially dominated by the free energy of double-strand formation. Hence, these DNA-based chimeric antigen receptor and ligands recapitulate the biophysical characteristics of ligand-receptor interaction in the natural immune detection context. Most strinkingly, Taylor et al. found that the lifetime dogma holds with a threshold of activation set around 3s for the lifetime of the antigen-receptor complex [59].

As of 2015, although the structural details of the early events in immune recognition by T cells remain elusive, the consensus around the lifetime dogma is thus holding and it is enabling physicists to build biochemically-explicit or phenotypic models of good biological significance [2, 23, 42]. It constitutes a rich paradigm for both theoretical and experimental biophysical considerations, and most of our discussion will be within this framework.

Fig. 1
figure 1

Three examples of immune recognition by T cells. a The natural system of antigen discrimination by T lymphocytes relies on the T cell Receptor. It is composed of an extracellular a/b domains that interact with peptide-MHC complex on the surface of antigen-presenting cells, and 6 intracellular domains containing 10 Immunoreceptor Tyrosine-based Activation Motifs (ITAMs) that get phosphorylated upon engagement with not-self ligands, and trigger T cell activation. b Recent developments in the field of immunotherapy introduced Chimeric Antigen Receptor (CAR): its extracellular domain is composed of a monomeric antibody that is specific for an antigen on the surface of the targeted tumor; its intracellular domain concatenates one ITAM-containing domain (z) and additional costimulatory domains (CD28 and 4-1BB) to induce robust T cell activation upon CAR engagement with its ligand. Such CAR combines the specificity of an antibody-based recognition, with robust signaling response. c A DNA-based Chimeric Antigen Receptor was recently proposed by Ron Vale and coworkers [59]: its intracellular domain is based on the concatenation of signaling domains-similarly to the CAR described in b. Its recognition platform is composed of a single-stranded oligonucleotide, that recognizes another single-stranded oligonucleotide by sequence complementarity. Rather than relying on natural pMHC ligands whose biophysical characteristics are not tunable, Vale et al.’s design can be engineered to achieve variable lifetime for the ligand-receptor complex. Such ingenious experimental design will be critical to probe the sufficiency and limits of the lifetime dogma

2.2 Setting the Problem for Physicists: What Does Absolute Immune Discrimination Entails?

In recent years, quantitative immunology has partially characterized the “phenotypic space” of T cells as a function of these parameters. A “golden triangle” characterizing immune response can be drawn [18], Fig. 2a. The first vertex of this triangle is ligand specificity, as encapsulated in the lifetime dogma described in the previous section: there exists an absolute discrimination threshold on ligand binding time, around 3–5 s.

The second vertex of the triangle is ligand sensitivity. Minute amounts of ligand are able to trigger response. Actually, there are strong experimental evidence that one foreign ligand can trigger immune response [32], so that the physical limit of detection is reached biologically, a situation reminiscent of other famous examples such as photon sensing [5]. Such high sensitivity might be functionally critical as the immune system can “snip” a pathogenic infection before it has a chance to expand.

The last vertex of this triangle is decision speed. We know from experiments that immune decision at the single cell level is taken within a couple of minutes  [68]. Note that this decision time depends quite strongly on ligand concentration [2] yet it is relatively fast to accommodate the limited time T cells spend scanning the surface of one antigen–presenting cell.

To reformulate this problem in a generic way, imagine a cell with a given set of identical receptors is suddenly exposed to L ligands, with identical binding time \(\tau \). We can plot in the \((L,\tau )\) plane a “response line”, characterizing the boundary between responding regions (“agonist” ligands) and non-responding ones (“non-agonist” ligands). The life-time dogma states that below some critical time \(\tau _c\), for all L, there is no response, while above \(\tau _c\), for any L there is response. This defines a vertical line in the \((L, \tau )\) plane. In biological terms, this is usually called a “specific” response to a category of ligands, and thus we call this line “specificity line”. This line corresponds to the first vertex of the golden triangle.

It should be noted immediately that it is not obvious how such response can be realized, especially it seems a priori impossible to have such specific response for very small L (we can not have response without signal!). Indeed, in any kind of biological settings, an obvious physical limitation is that there can not be less than 1 ligand presented at any given time. Immune cells can nevertheless trigger response when exposed to 1–3 agonist ligands: this defines a (horizontal) sensitivity line. The corresponding idealized lifetime dogma response line is displayed on Fig. 2b. A third dimension would be necessary to account for the third element of this triangle, speed, but is not drawn here.

Fig. 2
figure 2

Absolute discrimination for physicists. a Golden triangle for immune recognition. b Idealized response line corresponding to the immune golden triangle. c Two possible distributions for an Output variable directing immune decision, for networks exposed to random concentrations of ligands with identical binding times. In the top example, typical values for ligands with \(\tau =3\) s and \(\tau =10\) s overlap, so that it is not possible to discriminate between these ligands. In the bottom example, those distributions are well separated so that it is possible to choose a thresholding procedure on this Output to ensure absolute discrimination. d Typical response lines for various models discussed in this review. Decision threshold was adjusted to have \(\tau _c \sim 5\) s. Parameters for KPR and immune model from [23], and from [40] for adaptive sorting

2.3 Early Attempts at Modeling Immune Recognition: Kinetic Proofreading

Historically, McKeithan [47] was the first to propose a mechanistic model to underly the early events in immune recognition, accounting for such qualitative ligand recognition. The control of the quality of immune response by a single kinetic parameter \(\tau \) is reminiscent of the famous Hopfield-Ninio kinetic-proofreading (KPR) paradigm, first proposed in the context of DNA replication and protein translation [31, 52]. In the immune context, McKeithan pointed out that subsequent to TCR-pMHC interactions, the receptor does internally go through several rounds of phosphorylation. Calling \(C_n\) the ligand-receptor complex that has reached the \(n^{th}\) degree of phosphorylation in the cascade, L the ligand and R the receptor, simplified equations for a continuous model of this process are:

$$\begin{aligned} \dot{C}_0=&-(\phi +\tau ^{-1} ) C_0+\kappa \Big (L-\sum _{i} C_i\Big ) \Big (R-\sum _{i} C_i\Big ) \end{aligned}$$
(1)
$$\begin{aligned} \dot{C}_n=&-(\phi +\tau ^{-1} ) C_n+\phi C_{n-1}, 1<n<N\end{aligned}$$
(2)
$$\begin{aligned} \dot{C}_N=&- \tau ^{-1} C_N+\phi C_{N-1} \end{aligned}$$
(3)

where \(\kappa \) is the association rate between ligand and receptor and \(\phi \) the phosphorylation rate in the cascade (here and in the following, we make a mean field approximation and concentrations are measued in units of “molecule per cell”). Parameter \(\tau \) corresponds to binding time of ligand to the receptor. It appears in equations for all \(C_n\)s, accounting for the hypothesis that after unbinding of the pMHC ligand, receptor would be quickly dephosphorylated and the phosphorylation cascade would need to restart from “ground zero” (i.e. \(C_0\)).

At steady state, assuming R is far from saturation and that \(\phi <<\tau ^{-1}\), one can easily derive that the last complex \(C_N\) thus has concentration scaling as \(L \tau ^{N+1}\): this is the usual geometric dependency characteristic of kinetic proof–reading with N steps. Its role is to amplify difference between ligands: with only 2 steps, for the same initial ligand concentration, one can get up to 6 orders of magnitude in the difference of concentration of \(C_N\) for self ligands (\(\tau =0.1 \quad s\) ) vs agonist ligand (\(\tau =10 \quad s \)). So if immune decision is taken via a downstream thresholding mechanism, physiological concentration of self ligands can not trigger immune responses even though one single agonist ligands theoretically can (see response line on Fig. 2d).

However, there are several quantitative shortcomings for a simple proofreading model. First, it is well known that to work efficiently, KPR needs to be very slow, which renders it incompatible with the observed fast response times of adaptive immune responses [2]. Second, the response line of KPR for any reasonable number of proofreading steps with realistic decision speed simply does not account for the observed specificity in terms of binding time \(\tau \), as illustrated on Fig. 2d. Finally, mixtures of ligand with different binding times will yield purely additive response to a kinetic proofreading mechanism, and thus would not account for more puzzling (but yet fundamental) aspects of immune response such as antagonism, where some non-reactive ligands can actually inhibit response of not self ligands  [2].

Thus, one needs to augment the traditional KPR scheme with feedback regulation in order to be able to account for our “golden triangle” (specificity, sensitivity and speed) that characterizes the early events of T cell activation.

2.4 In Silico Evolution and Adaptive Sorting

The most counter-intuitive and puzzling property of immune response is its high specificity. The reason is that we would expect a priori that some shorter binding time could be “compensated” by higher ligand concentrations. More quantitatively, similar to the kinetic proofreading model, one would expect in general that any output O of a general signaling would behave as \(O=f(L,\tau )\), where f is a monotonic function of both \(L, \tau \). But then how could we have a sharp process so that, on the response curve, a small decrease in \(\tau \) (from agonist to self) leads to a change of L or several orders of magnitude?

To answer this question, we turned to in silico evolution [40] (a review of this method can be found in [21]). The idea is to simulate a Darwinian process on a space of possible models to select for absolute discrimination. Considering a population of biochemical networks (typically 30), our algorithm randomly mutates networks. Possible evolutionary moves are inspired by proofreading-based models of immune recognition, and consist in addition/removal of proofreading steps, as well as addition/removal of internal phosphorylations and dephosphorylations, or of kinase/phosphatases. For selection, we need to define a scoring or “fitness” function, and we chose to use mutual information as explained below.

Assume a cell is exposed to one type of ligands, binding time \(\tau \), with probability \(p_\tau (L)\). We want discrimination to be efficient over several orders of magnitude in L concentration. In the absence of any other information we choose \(p_\tau (L)\) to be uniform on a log scale, within physiological concentration range. Such a choice is also consistent with the well-known fact that concentrations in cells are distributed log-normally [2].

Consider an Output variable O.Footnote 1 To each couple \((L,\tau )\) corresponds a distribution of output variable O characteristic of the signalling pathway, that we call \(p_\tau (O|L)\). Since our problem for absolute ligand discrimination implies immune detection independently of ligand concentration, let us marginalize over all possible ligand concentrations and define a probability distribution for this Output associated to a binding time \(\tau \):

$$\begin{aligned} P_\tau (O)=\int p_\tau (O|L)p_\tau (L) dL \end{aligned}$$
(4)

This probability distribution is then a pure function of the binding time of the ligand. Good ligand discrimination will be possible only if there is very little overlap between distributions corresponding to different \(\tau \)s (see Fig. 2c for an example of distributions with two different ligand types).

A practical way to use this for in silico evolution is to consider a situation where cells can be exposed to two different types of ligands (say \(\tau =3s\) and \(\tau =10s\) to fix ideas). To maximize selection for efficient discrimination, we assume equal probability for observing these two \(\tau \)s. Then, based on computed distributions such as the ones on Fig. 2c, we compute mutual information between the Output and \(\tau \), based on \(P_\tau (O)\). A mutual information of 1 bit means that perfect discrimination is possible.

This evolutionary procedure quickly converges to a very simple scheme described in [40], that we called adaptive sorting (Fig. 3a). Simplified continuous equations for adaptive sorting are:

$$\begin{aligned} \dot{C}_0=&\,\kappa \Big (L-\sum _{i} C_i\Big ) \Big (R-\sum _{i} C_i\Big ) -(\phi _K(C_0)+\tau ^{-1} ) C_0,\end{aligned}$$
(5)
$$\begin{aligned} \dot{C}_1=&\,\phi _K(C_0) C_0 - \tau ^{-1} C_1, \end{aligned}$$
(6)
$$\begin{aligned} \phi _K(C_0)=&\,\frac{\phi }{C_0+C^*} \end{aligned}$$
(7)

The basis of this network is a one-step kinetic proofreading process, that is modulated by regulation of the phosphorylation rate \(\phi _K\) from \(C_0\) to \(C_1\). This term is a Michaelis-Menten function that can be interpreted as a repression by the first complex in the cascade (\(C_0\)) of kinase K responsible for its phosphorylation. Total contribution of phosphorylation \(\phi _K(C_0) C_0\) therefore contains two \(C_0\) dependency: a direct linear increasing contribution \(C_0\) (substrate of the phosphorylation), and an indirect decreasing \(\phi _K(C_0)\) (regulation of phosphorylation by substrate), thus encoding a so-called incoherent feedforward loop [45]. In the limit of high \(C_0\) (and thus high L), those \(C_0\) dependencies compensate yielding an output at steady state from Eqs. 67:

$$\begin{aligned} C_1=\tau \phi _K(C_0) C_0 \simeq \phi \tau \quad \mathrm{if} \quad C_0>>C^* \end{aligned}$$
(8)

The later expression is independent from the amount of ligand presented, and then is a pure function of binding time as illustrated on Fig. 3a. Any thresholding process on \(C_1\) can thus efficiently discriminate between binding times. Response line of network is illustrated on Fig. 2d for a simple thresholding process on \(C_1\) (“Adaptive sorting N=1”), in close agreement with the idealized response from Fig. 2b.

It should be stressed that there are two very important biochemical assumptions related to kinase K in adaptive sorting: i. it should diffuse rapidly inside the cell and ii. it should belong to a pool shared by all receptors. Since any bound receptor can deactivate K, total K thus aggregates global information over multiple bound receptors. K in turns tune local state of receptors, so that the total activity of the kinase K is a decreasing function \(\phi _K(C_0)\) of total \(C_0\). Overall, K effectively couples the different receptors, which explains the non-linearity of the output as a function of the ligand concentration.

Fig. 3
figure 3

Adaptive sorting networks. a One-step adaptive sorting topology evolved in [40]. Output concentration as a function of ligand for different values of \(\tau \) is represented. Dashed blue lines indicate thresholding for decision for deterministic systems used for simulations. b Adaptive sorting with more proofreading steps. Output concentration for agonist (\(\tau =10\) s) mixed with and \(10^4\) sub thresholds ligands illustrates antagonism ((response for pure ligands is qualitatively identical to the network of panel A and thus is not shown) (Color figure online)

Revisiting the golden triangle, by design, adaptive sorting realizes specificity (i.e. discriminates between different \(\tau \)s) in the limit of high ligand concentration. Sensitivity (to small ligand concentration) depends on the relative values of \(C_0\) and \(C^*\): asymptotic limit of Eq. 8 is realized only when \(C_0\) is higher than \(C^*\). For lower value of \(C_0\), \(\phi _K\) is approximately constant and one recovers a one-step kinetic proofreading regime. So while discrimination very close to threshold can not be fully absolute for small L, stronger agonists still trigger response at low concentration as indeed visible on Fig. 2d. Speed is related to low number of molecules: considering the immune example, if a T cell is able to perform detection for ligand concentration as low as one ligand per cell, then one would expect a potentially deleterious sensitivity to stochastic fluctuations. A natural answer to this problem is to time-average response, but then it is not clear any more if a quick decision can be made. In the adaptive sorting mechanism, if we assume that \(C_1\) is activating a downstream slow output, it can be shown that indeed, intrinsic fluctuations can be averaged out within tens of seconds, which is then compatible with the experimentally observed immune decision time and the appropriate sensitivity and specificity of the TCR signalling pathway [40].

2.5 Antagonism

In the previous section, we have considered discrimination between ligands with different binding times when only one type of ligands is presented. However, a more realistic immunological situation is that agonist ligands are presented simultaneously with many sub threshold (self) ligands. So cells not only need to discriminate agonists from self ligands, but should also detect agonists presented within many self ligands.

It turns out that the adaptive sorting scheme presented in Fig. 3a does not perform well: addition of few sub thresholds ligands considerably decreases output concentration for the same agonist concentration. The fundamental reason is due to the coupling of K between multiple receptors discussed in previous section. Imagine self ligands are added, calling Ds corresponding complex (quality \(\tau _s\)), then we have for the phosphorylation activity:

$$\begin{aligned} \phi _K(C_0+D_0)=\frac{\phi }{C_0+D_0+C^*} \end{aligned}$$
(9)

and the total output concentration is

$$\begin{aligned} C_1+D_1=\phi _K (C_0+D_0) (\tau C_0+\tau _s D_0) \end{aligned}$$
(10)

For \(\tau =\tau _s\), we naturally recover the same result as before, but if \(\tau _s<<\tau \), the total output concentration clearly is much lower than the asymptotic response \(\phi \tau \), an effect called antagonism. Intuitively, the “self” ligand titrates the kinase necessary for the proofreading step, akin to a “dog in the manger” effect described in other immune contexts [62]. If response is due to a thresholding effect, as a consequence, many more agonists ligands would be required to trigger response. In terms of response line of Fig. 2b, while specificity is conserved, the system loses sensitivity to minute concentrations of ligands.

It can be shown mathematically that antagonism is a necessary consequence of absolute discrimination, qualifying as a “phenotypic spandrel” [22]. Intuitively, absolute discrimination necessarily requires some internal variable (similar to kinase K) to discriminate between ligands with different \(\tau \)s irrespective of their concentration, and as a consequence antagonism will always occur when those internal variables are activated by subthreshold ligands. A model performing perfect absolute discrimination is presented in [22], as well as a simple categorization of antagonistic effects, that can be increased or mitigated as \(\tau \rightarrow 0\) depending on the model considered.

In particular there is a simple way to minimize the range of binding times with strong antagonism in the model of Fig. 3a, by the addition of a short upstream proofreading cascade as discussed in  [40]. Assuming now that kinase K is activated after m proofreading steps, we have

$$\begin{aligned} \phi _K(C_m+D_m)=\frac{\phi }{C_m+D_m+C^*} \end{aligned}$$
(11)

But we have \(C_m\propto \tau ^{m+1}\) and \(D_m \propto \tau _s^{m+1}\) so that \(C_m >>D_m\) if m is high enough, even if many self ligands are presented. As a consequence kinase K is barely influenced by complex \(D_m\). As said before, there nevertheless still is some antagonism close to threshold when \(\tau _s \sim \tau \) which actually constitutes a smoking gun for absolute discrimination mechanisms (see next section for experimental evidence and [22] for a more careful study) . Antagonism is illustrated on Fig. 3b for a model with 4 proofreading steps (response line is displayed on Fig. 2d).

While antagonism is reduced by addition of proofreading steps, there are however other trade-offs appearing in the system: for instance adding too many proofreading steps might reduce the final concentration of the output too much, which creates several downstream problems in terms of response times very similar to what happens in McKeithan’s KPR model (see Supplement of [41] for discussions of this effect).

3 Biologically Realistic Models for Immune Detection

As we have seen in previous section, generic solutions to the problem of absolute discrimination are now available within the simple adaptive sorting framework. But one naturally wonders if biochemical reactions in actual T cells correspond to any of such Platonician view, and, if not, if they can be related to it in any ways.

Adaptive sorting elaborates on a small number of kinetic proofreading steps. It should thus be first pointed out that many (if not all) molecular components of the original McKeithan model [47] are indeed present in actual cells. Such “realistic” model explains its popularity and its use as a blueprint for many current models of immune decision [42]. The phosphorylation cascade would correspond to the known phosphorylation of Internal Tyrosine Activation Motifs (a.k.a. ITAM) containing chains in the TCR complex  [35]. Rapid dephosphorylation upon unbinding would fit the kinetic segregation mechanism, specifying that generic phosphatases are segregated only upon ligand-receptor interaction [13] .

3.1 Negative Feedback and Antagonism

Adaptive sorting based models are expected to include an extra negative feedback, (i) buffering for ligand concentration to realize adaptation/absolute discrimination, and (ii) associated with ligand antagonism. This has been indeed observed and characterized in seminal papers by Dittel et al. [14], and Stefanova et al. [58]. These papers establish the existence of a negative component in ligand detection via the Tyrosine-protein phosphatase non-receptor type 6 (PTPN6), also known as Src homology region 2 domain-containing phosphatase-1 (SHP-1), and its role in ligand antagonism. In details, Dittel et al. quantified antagonism by measuring T cells responding to agonists in conjunction with increasing concentrations of antagonist ligands (which decrease the magnitude of immune response) in T cells endowed with two separate TCRs. Their measurement also suggested that antagonism is associated with SHP-1 association with TCR and the ensued dephosphorylation of ITAMs. Importantly, the antagonistic effect is “infectious” within the cells: recruitment by SHP-1 spreads from receptor bound to antagonistic ligands even to unbound receptors, suggesting a global coupling of receptors via SHP-1. Involvement of SHP-1 in antagonism is definitely established in [58]. They show in particular how moderate increase of SHP-1 activity gives several orders of magnitude increase of antagonism potency. Antagonistic ligands are also shown to recruit more rapidly SHP-1 compared to agonist ligands (which only recruit SHP-1 in a later time). They finally show that agonist ligand specifically trigger a positive feedback loop, mediated by ERK-1, and that ERK-1 specifically inhibit SHP-1 recruitment by the TCR, thus explaining the kinetic difference between ligands.

3.2 Modeling Negative Feedback, Approximate Adaptive Sorting

The first model combining these different aspects was published by one of us (G.A–B) in collaboration with R.N Germain in 2005 [2]. This work combined a kinetic proofreading backbone with a SHP-1 mediated feedback and an ERK-1 positive feedback. We included most known components of the system, including different co-receptors, kinases, and eventual phosphorylation cascade in a very complex mathematical framework including around 300 dynamical variables. This model succeeded in satisfying the previously described golden triangle as a modeling target. It also establishes a clear linear hierarchy for antagonism, where stronger antagonists are ligands with binding times just below critical threshold \(\tau _c\). Most importantly, the full-blown model was tested with new experiments quantifying precisely antagonism strength and decision time of the network, and validating predictions from the model.

Fig. 4
figure 4

A simplified model for immune detection. a Sketch of some of the molecular players and their interactions. b Simplified model based on kinetic proofreading combined with negative feedback. c \(C_N\) concentration as a function of ligand for different values of \(\tau \). d Total output concentration as a function of agonist (\(\tau =10\) s) ligand in presence of \(10^4\) sub thresholds ligands, showing antagonism. e Behaviour of the network when S total concentration is doubled, showing collapse at high ligand concentration. f Experimental quantification of antagonism, reproduced from [23]. OVA is agonist, E1 is weak antagonist, G4 is strong antagonist

While the Altan-Bonnet/Germain paper could explain many experimental features, it was not clear at that time if the full complexity of the model was required to understand the system. Can we see the “biological wood emerg[ing] from the molecular tree” as nicely formulated by Gunawardena  [27]? A first simplified model was proposed in 2008 by Lipniacki et al. [43], but still contained around 40 variables, and was especially focusing on possible bistable properties of the system via ERK positive feedback loop. In 2013, we published in collaboration with Voisinne et al. a considerably simplified version of this model [23], focusing on the part of the decision network upstream of ERK. Schematic of the model is displayed in Fig. 4b. Equations for this model are

$$\begin{aligned} \dot{S}= & {} \alpha C_1\left( S_T-S\right) -\beta S \end{aligned}$$
(12)
$$\begin{aligned} \dot{C_0}= & {} \kappa \Big (L_1-\sum _{i=0}^N C_i\Big )\Big (R-\sum _{i=0}^N C_i\Big ) \nonumber \\&+\,(b+\gamma S)C_1-(\phi +\tau ^{-1})C_0\end{aligned}$$
(13)
$$\begin{aligned} \dot{C}_j= & {} \phi C_{j-1}+(b+\gamma S)C_{j+1}-(\phi +b+\gamma S+\tau ^{-1})C_j\end{aligned}$$
(14)
$$\begin{aligned} \dot{C}_N= & {} \phi C_{N-1}-(b+\gamma S+\tau ^{-1})C_N \end{aligned}$$
(15)

Like many other models, the backbone of this coarse-grained model relies on a kinetic proofreading backbone. Then, a simple negative feedback is included, in the form of the activation of a global phosphatase called S (corresponding biologically to SHP-1) by a single complex in the phosphorylation cascade.

This model essentially recapitulates all observations made in 2005 [2], satisfies the golden triangle, and is semi-analytic as detailed in the Supplement of [23]. In details, linearity of Eqs. 1315 in the limit of unsaturated receptors allows to easily express concentrations of all complex \(C_j\) as a function of S concentration. Defining \(r_+>r_-\) the roots of the characteristic equation that corresponds to Eq. 14

$$\begin{aligned} 0 =\phi +(b+S)r^2-(\phi +b+S+\nu _1)r \end{aligned}$$
(16)

we can show that

$$\begin{aligned} C_j\simeq (1-r_-) \frac{\kappa R L \tau }{\kappa R \tau + 1}. r_-^j \end{aligned}$$
(17)

for \(0<j<N\) and :

$$\begin{aligned} C_N\simeq \frac{\kappa R L \tau }{\kappa R \tau + 1}\left( 1-\frac{r_-}{r_+}\right) r_-^{N} \end{aligned}$$
(18)

One can then use Eq. 12 that relates S to \(C_1\) to close the system, which gives a fourth order polynomial equation that can be most easily solved numerically.

Response line of the model is illustrated on Fig. 2d (“Immune model”). Interestingly, this network actually performs an approximate adaptive sorting: it flattens out the concentration of the output \(C_N\) over several orders of magnitude of ligand concentration L, on different plateaus as a function of binding time \(\tau \) as shown on Fig. 4c . Thus adaptive sorting appears to be the core principle of early immune detection, as could have been guessed in retrospect from first principles constraints evolved in silico. It is known that adaptation can be performed either via feedback or feedforward interactions, so that the network from Fig. 4b can be seen as a feedback version of the feedforward adaptive sorting network presented in Fig. 3 . Another difference with adaptive sorting as discussed before is that the same kinase and phosphatase is shared between all proofreading steps, while networks such as the ones displayed on Fig. 3 require one specific kinase for the step actually performing adaptation. Our model implements approximate adaptive sorting with a minimum set of shared unspecific kinases and phosphatases, and as such can be considered as optimum in terms of parsimony of biochemical species used. Full stochastic simulations show that decision times are consistent with data from [2] and can be as low as 30 s for strong agonists. Finally, only adaptive sorting models with several proofreading steps such as the one of Figs. 3b and 4 give antagonistic properties similar to experimental data Fig. 4f [23].

Intuititive analytical limits can be obtained when negative feedback does not saturate, which is biologically relevant given the observed phenotypic variability (see [20] and next section). For instance, if we assume that phosphatase S (corresponding to SHP-1) is in excess and that \(b<<1\) , in the limit of high ligand, we can show that \(C_1 \propto S \propto \sqrt{L}\) using Eqs. 12, 16 and 17, and characteristic equation 16 gives \(r_+\sim 1\) and \(r_-\propto \phi /S\). The latter expression of \(r_-\) is particularly interesting, being a ratio of the forward phosphorylation rate \(\phi \) over the backward dephosphorylation rate scaling like S. From Eq. 18, it appears that the balance of these two rates essentially determines level of response, which fits the intuitive biological idea that the main role of the negative feedback is to restrain progress in the proofreading cascade.

Putting everything together, for high concentration of agonist ligands, irrespective of binding times, a simple asymptotic relationship holds

$$\begin{aligned} C_N\propto L^{1-\frac{N}{2}} \end{aligned}$$
(19)

connecting ligand concentration L and \(C_N\), with N the number of proofreading steps. So if N is high enough, \(C_N\) decreases with ligand concentration L in this model. This seems rather counter-intuitive, and predicts that if feedback is strong enough, immune response at high agonist ligand concentration could thus disappear (see Fig. 4e, full stochastic treatment can be found in [23] ). Indeed, we tested and verified this prediction in cells with high level of SHP-1 [23]. Furthermore, if SHP-1 level is increased above a couple a few-fold, negative feedback essentially dominates for all ligand concentration, and response is fully abolished, a “digital effect” indeed experimentally observed in [20] and first predicted with the Altan-Bonnet Germain model [2].

3.3 Beyond Absolute Discrimination?

Other modelling strategies for immune recognition by single cells have been proposed. Major differences in most models is that \(\tau \) dependency is not considered as sharp as what is assumed here, and modulation by other parameters is considered. For instance it has been suggested that there might actually be an optimal time of interaction (i.e. response would decrease at higher \(\tau \)s) [34]. This effect is debated, since strong binding TCRs evolved in vitro, with \(\tau \)s of the order of 50 to 100 s conversely yield an exceptionally potent response [15]. Lever et al. recently reviewed and compared 5 families of models [42], including the one from previous section, and suggested that a mid-range \(\tau \) can be simply explained by a kinetic proofreading with “limited signalling”, meaning that the signalling complex in the cascade gets inactivated with constant rate. However, this model does not include the negative feedback described here, and thus do not display antagonism. Another issue of interest is the regime of fast “on rates”’: for such ligands, response is experimentally observed even for low \(\tau \)s. It has been suggested that competion between binding/rebinding and diffusion of receptors on the membrane would actually give an effective higher \(\tau \) [25], which therefore would not necessarily contradict the life-time dogma but rather complexify it. A recent review [10] summarizes molecular players of the systems and relationships to proofreading mechanisms, discussing in particular the role of co-receptors and various kinases, and possible other mechanisms that could explain better high sensitivity such as serial triggering by one ligand.

Other theoretical models have explored biophysical limits or more refined computations. For instance, a related question is the minimum theoretical decision time for such process ( “decision on the fly”). As said in introduction, a trade-off between accuracy and precision is expected, and this is a potentially acute issue in an immune context. If decision takes too much time, a foreign ligand might be gone before proper response is activated. If accuracy is decreased, there is potential for auto-immune response. Exact results for this problem have been recently obtained by Siggia and Vergassola  [56]. They study the problem of detection of change of composition of ligand mixtures. It turns out that Wald sequential probability test ratio on the log likelihood of a sequence of binding events can be used to take decision. Strikingly, simple phosphorylation networks reminiscent of network controlling early immune detection can naturally implement biological versions of this test [56]. An important aspect to optimize decision time is that decision here is made “dynamically”, while networks performing decision here essentially work at steady state (see nevertheless Supplement of [40] for discussions of transient behaviours). Such dynamical sensing could also help in the regime of very small ligand concentrations, maybe in conjunction with serial triggering [10]. An even more general problem is chemodetection of ligands of different qualities in fluctuating environments, where mixture composition can vary with time. If there is a high probability of observing ligands just subthresholds, antagonism is helpful to essentially buffer spurious fluctuations of ligands [41], reminiscent of what is observed in olfaction [63]. Finally, some models have recently explored theoretically the possible network topologies maximizing information transmission on ligand mixtures, and have pointed out the need of both proofreading and internal feedbacks [49, 57].

4 Ligand Recognition by a Population of Lymphocytes: More is Different?

4.1 Tackling Cell-to-Cell Variability in Immune Responses: Statistical Physics to the Rescue?

There are still many challenges to fully understand early immune response. In particular, once a cell has been activated, it appears that further processing occurs at the immune population level. Modeling the early events in immune detection potentially is very topical for statistical physics as it involves the accounting of cell-to-cell variabilities, modeling immune responses with distribution of cells, and the testing of their functional significance.

Starting with the seminal work of Elowitz et al. [17] a subfield of biological physics has grown to address the emergence of cell-to-cell variability.Footnote 2 Extrinsic variability stems from many sources (e.g. all sources that do not relate to the stochasticity and low-copy number in biochemical reactions): they can include heterogeneity in the cellular environment, as well as epigenetic variation and metabolic fluctuations from cell-to-cell. Intrinsic contributions can be best studied in an isogenic population of cells and are the physical consequences of stochasticity in the biochemical reactions. As a consequence, cells can display broad distributions of phenotypes based on the heterogeneity of abundance of key regulatory proteins (receptors, kinases and phosphatases in the context of signal transduction network; transcription factors in the context of gene regulation).

First forays to address the functional significance of cell-to-cell variability in biological systems were focused on bacterial responses. From chemotaxis (the ability to orient motions in gradients of nutrients or chemokines) to competence the ability to acquire new genomic materials), researchers demonstrated that, indeed, varied levels of key proteins could map into varied phenotypes [8, 37]. Such observations were used at first as new quantitative constraints to validate biochemical models of biological regulation.

Concomitantly, these observations demonstrated that a “mean-field” measurement and model of a population of cells might have serious shortcomings when predicting global responses. One example where such cell-to-cell variability was found to be critical is in the study of bacterial antibiotic resistance. For example, Balaban and coworkers introduced a microfluidic device to track the proliferation and death of bacteria under antibiotic treatment [4, 54]. A sub population of isogenic bacteria were found to resist to antibiotics, simply by being a different metabolic state compared to their sister cells at the time of exposure to antibiotics. Such process of distributed response based on distributed phenotype at stimulation time was described as an optimal strategy to tune responses in a fluctuating environment, by matching probability of phenotypic switching to probability of environmental changes  [39]. Indeed, there is such a fundamental mismatch between the necessary response time (cells must exclude antibiotics on very short timescales) and the evolutionary constraints (it would take a large amount of time and a very low probability for cells to generate a solution to the problem of antibiotics resistance), that cells are better off diversifying their phenotype pre-emptively such that a solution is readily accessible when the antibiotic perturbation applies.

4.2 Phenotypic Variability of T Cell Ligand Discrimination

Similar constraints are at play in the context of the immune response. There is a similar disconnect between the dynamics of biological problems at stake (eradicating a fast-replicating fast evolving pathogen vs. generating an adaptive immune response). In particular, recent measurements by the Jenkins and Davis lab [28, 51] have beautifully illustrated the number of constraints for a good immune response. Lymphocytes can rapidly proliferate (by factors of \(10^3\) to \(10^6\)) and relax back to low numbers for the memory pool. Such explosive proliferation is critical to match the challenge posed by fast-proliferating pathogens. However, surprisingly, the number of T cell clones that can recognize a specific antigen (e.g. flu peptide) is very small, with 10 to 100 clones per individual (mouse or human). This is particularly relevant as pathogens can be “cunning” and attempting to invoke multiple escape mechanisms. Hence, rather than a deterministic adaptation of the immune response to the pathogenic challenge and optimization of detection at the cellular scale, one can conjecture that the immune system relies on some degree of statistical randomness in order to diversify responses within an isogenic population of lymphocytes.

The functional underpinnings of such cell-to-cell variability can be readily detected in the context of the early events of immune detection. Indeed, researchers have been using the vast panoply of antibodies to quantify protein and phospho-protein levels in cells, as well as the single-cell resolution of cytometry (using fluorescence-based or mass-spectrometry-based approaches), in order to quantify the cell-to-cell variability of responses to external stimuli. In a nutshell, if the abundance of protein X is limiting in the activation of Y into Y* (e.g. phosphorylation), then measuring and correlating X with Y* abundances at the single cell level will reveal heterogeneity of the response. Such cell-to-cell variability analysis has been carried out in the context of immune detection to demonstrate the sensitivity of ligand discrimination to varied levels of signaling components (e.g. CD8 and SHP-1) [20], as well as the sensitivity of T cells response to the cytokine IL-2 to varied levels of cytokine receptors (e.g. CD25, CD122, and CD132 a.k.a. IL-2R\(\alpha \), IL-2R\(\beta \) and \(\gamma _{C}\)) [12]. Note that such parameter sensitivities were first predicted from the dynamical model of the signaling cascades at play, and Cell-to-Cell Variability Analysis (CCVA, a new methodology introduced in [12]) validated these predictions quantitatively. Similarly, single-cell analysis has been carried out in the space of phospho-proteins by the Pe’er & Nolan labs to quantify the strength of connections within the TCR signaling pathway [38]. There, analysis of single-cell measurements using overall, resolution of immune responses at the individual cell level has highlighted the large phenotypic variability in the signaling response of individual lymphocytes: such observations must then be interpreted functionally to map out how T cells diversify their response to optimize its detection capabilities.

Fig. 5
figure 5

Immune recognition over long timescales may involve collective synchronization of T cell activation. While T cells perform an absolute (immune) discrimination at the individual cell level, their heterogenous expression of signaling components leads to unreliability in their threshold of activation. Moreover, experimental work on lymphocyte signaling demonstrated that individual T cells respond in an all-or-none (digital) such that their output contains limited information. Additional mechanisms must be at play to “proofread” individual immune recognition and/or generate plastic immune responses. a Feedback regulations based on cytokine secretion and consumption constitute an efficient solution to the limitations of individual T cells [19, 29, 60, 67]. In particular, T cells can rely on such cell-to-cell communications to achieve higher levels of immune recognition. b Tkach et al. found a surprising scaling law, whereby the maximum concentration (\([IL-2]_{max}\)) of the IL-2 cytokine released by a population of T cells scales almost linearly with the amount of antigens that is present in the system, practically independently of the number of T cells present in the system. c Such analog scaling at the population level was found to derive through a coherent feed-forward loop of Type 4, using the nomenclature introduced by Alon and Mangan [45]

4.3 Deriving Reliable Immune Responses from Unreliable Responses of Individual Cells

The cell-to-cell variability of lymphocyte response presents a challenge for our current cell-centric understanding of self/not-self discrimination in the immune system. Indeed, if each individual T cell makes a rapid, sharp yet utterly variable decision to respond to a ligand (see 4.2), one could anticipate many “mistakes” whereby T cells would respond to self tissues and trigger an auto–immune disorder.

We conjecture that it is the integration of the responses of individual T cells over longer timescales (\(>\)hour) that may correct for the “sloppiness” of individual T cell on short timescales (\(\approx \) minutes). Such integration can be carried out through cell-cell communications e.g. through secretion and consumption of cytokines. In particular, Tkach et al. measured experimentally how much cytokine accumulates in the supernatant of T cells that were activated in vitro [60] (Fig. 5). A surprising scaling law for this \(IL-2_{max}\) capacity with the regards to the quantity of antigens \(\#(pMHC)\) and the size of the T cell population is observed

$$\begin{aligned}{}[IL\text {-}2]_{max} \propto \left( \#(pMHC)\right) ^{0.8}.\left( N_{T cells}\right) ^{-0.1} \end{aligned}$$
(20)

Hence, T cells are able to output cytokines in a near-linear scalable manner across four decades of \(\#(pMHC)\): this is quite a striking observation when considering the limited dynamic range of individual T cell response and more generally of any biological system. Moreover, the independence of output with the number of T cells \(N_{T cells}\) in the system is remarkable considering that cytokine secretion is a priori an extensive variable: this finding implies that the overall capacity of the system (in terms of maximal cytokine concentration) is an intensive variable. Deriving an intensive output (i.e. an output that is independent from the size of the system) for a population of T cells has been observed in very related experiments [29]. Hart et al. measured the cell expansion of a population of CD4\(^+\) T cells after activation through their antigen receptor pathway. Hart et al.’s model focused on the cytokine IL-2 whose divergent function (cell proliferation and apoptosis) would explain such robust system-size-independent output.

Understanding mechanistically the emergence of such scaling laws implied revisiting our classical biochemical understanding of the IL-2 cytokine pathway. It was well established that T cells would essentially shut down their IL-2 production as soon as they secreted and built up a pool of shared cytokine. Biochemically, this implied that the IL-2 output of a population of T cells would have a ceiling of 10 pM, as the concentration of cytokine required to induce IL-2 signaling and subsequent shutdown to IL-2 secretion [66]. However, detailed quantitation of the biochemistry of the system unraveled a negative cross-talk from the antigen signaling response to the IL-2 response pathway [60]. When measuring the phosphorylation of the STAT5 transcription factor downstream of IL-2 sensing, Tkach et al. uncovered a surprising convolution between response to pMHC antigens and \(IL\text {-}2\) cytokine:

$$\begin{aligned} \#(pSTAT5)\propto \frac{\#(IL\text {-}2R\beta /\gamma _C)}{1+\frac{K_D}{\#(IL\text {-}2R\alpha ).[IL-2]}}.\frac{1}{1+\frac{\#(pMHC)}{\#(pMHC)_0}} \end{aligned}$$
(21)

where \(\#(X)\) represents the number of X within the cell, \(\#(IL\text {-}2R\alpha )\) is the number of \(\alpha \) chain of the \(IL-2\) receptor, and \(\#(IL\text {-}2R\beta /\gamma _C)\) is the number of complexed \(IL\text {-}2R\beta \) and \(\gamma _C\) receptors on the surface of T cells –in a nutshell, \(\#(IL\text {-}2R\alpha )\) gets expressed in activated T cells to complete the signaling \(\#(IL\text {-}2R\beta /\gamma _C)\) pair into a signaling complex, upon \(IL\text {-}2\) binding [12]. Overall, this equation encapsulates the convolution between local and global responses (i.e. pMHC abundances and \(IL\text {-}2\) concentration respectively): this convolution is particularly significant as it regulates the off switch for \(IL\text {-}2\) production. This is also important because pSTAT5 in turns regulate IL-2, which means that there is a feedback between the local stimulation (by pMHC ) and the global readout (by \(IL\text {-}2\)) through the regulation of STAT5 phosphorylation. Overall, a combination of an incoherent feedforward loop was shown to be necessary and sufficient to explain the scaling law in the accumulation of \(IL\text {-}2\) in the milieu.

Such initial attempt by Tkach et al. [60] was based on detailed biochemical modeling with explicit biochemistry being implemented. Additional experiments revealed additional functional relevance for the secretion of IL-2, as a global regulator of self/not-self discrimination: Voisinne et al. probed the proliferation response of a population of T cells containing multiple clones of diverse specificity (i.e. a polyclonal population of cells). This study demonstrated that strongly activated T cell clones could induce the activation and proliferation of neighboring weakly activated T cell clones  [65]: this previously–undocumented phenomenon of propagating T cell activation was termed “T cell co-optation”. Similarly to previous studies [2, 19, 60], an experimentally-derived biochemically-explicit model of such multi scale integration was introduced to account for such lymphocyte co-optation. Overall, it was demonstrated that the addition of antigen signals (read locally through the TCR pathway) and cytokine signals (read globally through the IL-2 pathway) decides T cell fate. Such observations expand over longer timescales (days) and larger timescales (lymph nodes) what individual T cell can contribute in terms of immune response. This is particularly significant as it demonstrated that T cell activation can no longer be considered as a local property of antigen recognition, but gets decided by integrating multiple cues. The practical implication should not be understated when manipulating the response of a polyclonal population of T cells to force weak T cell clones to respond to antigens (e.g. in the context of tumors) may hold the key to tumor eradication. Future efforts will require phenomenological coarse-graining to allow better understanding of immune recognition at the level of the system.

Recent technical developments to monitor T cell responses at the individual cell level are enabling researchers to track the early events of immune activation, one cell at a time. Monitoring the differentiation of individual lymphocytes will certainly accelerate our study of the immune system, in particular when stochastic effects and phenotypic variability are necessary to explain the diversification in immune detection. Yet, in the context of the study of the immune system as a whole, it is the collective properties of the cells based on their cytokine communications and competition for antigen that shape the overall immune response.

Finally, on the pure theory side, some recent questions inspired by statistical physics include how the immune ligand landscape is matched by receptors/cellular diversity, and how collective decisions can be made within this framework, without entering into specific signalling details. A simplified probabilistic model of T-cell activation has been derived using extreme values theories (modelling binding energy of TCRs to single ligand), from which probability of false positive can be computed [7], and collective decision was shown to improve with the help of cell to cell communications. Mayer et al. very recently proposed a general framework, predicting that more receptors are needed for rate antigens, and strong cross-reactivity to correctly cover the full antigen landscape [46]. The future will certainly involve more statistical-physics based approaches that tackle the large number of lymphocytes and focus on such emergent properties.

5 Wrapping Up

To conclude, we presented an overview of some physics-inspired studies in immunology, focused on early recognition by T-Cells. Specifically, we discussed how the core problem of the adaptive immune response (the recognition of pathogen-derived antigens) must be addressed with quantitative models. Indeed, experimental results have now well established the biophysical underpinnings of self/not-self discrimination namely the fact that small increases in the lifetime of ligand-receptor complexes lead to large increases in ligand potency. Additionally, cells can respond very sensitively and with great speed. Quantitatively reconciling these three experimental aspects (so-called golden triangle) is a theoretical challenge with great conceptual and practical relevance (e.g. when fine-tuning T cell activation is required, as in cancer immunotherapies). We reviewed the current states of theoretical models, building upon the original kinetic proofreading scheme. In silico evolution of biochemical networks satisfying the golden triangle unraveled a minimal model that can reconcile all aspects of ligand discrimination. In particular, a proximal negative feedback (associated with the activation of a phosphatase) was found to be critically relevant to abrogate responses to self ligands (even in large quantities) while allowing responses to not-self ligands (even in small quantities). The functional pay-off of these models is to “predict” the existence of antagonism in immune recognition as well. Finally, we discussed how these models of ligand discrimination by T cells create new challenges in terms of understanding the phenotypic variability of isogenic populations of T cells, or in terms of accounting for the quantitative response to antigens when measuring T cell activation over long timescales. Similar collaborations between experimentalists and theoretical physicists will remain fruitful to expand our quantitative understanding of T cell activation to more complex issues in immunology (role of regulatory T cells, tuning of responsiveness according to inflammatory milieu etc.). More generally, we hope that these fundamental issues of immunology will spark the interest of statistical physicists, as the derivation and manipulation of large-scale immune response from the local activation of individual T cells remains poorly understood at the theoretical level.