Key words

1 Introduction

Optical methods provide powerful means to investigate the structure and function of many neurons simultaneously. Importantly, photons can be focused to, and imaged from, multiple neurons in parallel to control and detect their activity. Several new imaging strategies are developed every year, and neurophysiologists must navigate growing stacks of methods papers, microscope, and laser adverts to identify which modality can best achieve the scientific goals of their experiments. There are a broad set of competing requirements on the techniques used to image neuronal activity including speed, depth, spectral separation, robustness to scattering, and motion artifacts. The goal of this chapter is to distill the trade-offs driving choice of imaging strategy for all-optical experiments.

We begin by detailing two key challenges to imaging neuronal activity in intact brains. We then define signal-to-noise ratio and the concept of a “fluorescence budget,” which ultimately determines how small and fast of a transient signal can be resolved by a given imaging configuration. We then detail how the choice of fluorophore, in particular indicators of calcium and membrane potential, and imaging modality relate to the fluorescence budget, and the trade-offs inherent to the choice of each.

2 Key Challenges to Imaging Neuronal Activity

2.1 Challenge 1: Brains Are Three-Dimensional

Brains are three-dimensional (3D), while conventional microscopes image one plane at a time. A widefield fluorescence microscope (Fig. 1, left) excites fluorescence throughout a 3D volume and images fluorescence from a single plane. Unless the fluorophore itself is restricted to one plane (e.g., culture cell monolayers), fluorescence is excited outside of the plane of focus (Fig. 2, left). At the camera chip, this out-of-focus light reduces the contrast of the in-focus image. Worse, in the context of imaging calcium or voltage signals, the labeled out-of-focus cells have individually varying fluorescence time courses that contribute to the in-focus time course. This makes standard widefield fluorescence imaging with single-cell resolution often untenable in densely labeled 3D samples. Many techniques have been developed to restrict fluorescence excitation (two/three-photon, light-sheet microscopy) and/or collection (confocal variants) to a single plane of focus and/or remove out-of-focus fluorescence in post-processing by combining multiple images with structured illumination [22, 63, 73, 76]. The ability to localize fluorescence to a plane orthogonal to the optical axis is termed “optical sectioning.” This “optical section” can scan orthogonal to the plane of focus to build up a volumetric image plane by plane.

Fig. 1
Two diagrams illustrate a beam of laser passing through a mesh and observed through a tube lens and collectors for one photon epifluorescence and multiphoton epifluorescence.

Widefield, one-photon imaging (left) and multiphoton scanning (right) imaging modalities; not to scale

Fig. 2
Two diagrams, a and b, illustrate a ray of light passing through the image plane, tube lens, and objective lens, and touching the focal plane concerning the two different points of exits.

The challenges of 3D imaging in brain tissue. (a) Techniques that are not optically sectioning suffer from blur caused by the collection of light from out-of-focus planes. (b) Light rays can be scattered before exiting the tissue, changing their apparent origin. In aggregate, this leads to a blurring effect that drastically increases with depth. Reproduced from [88], CC BY-SA 4.0

2.2 Challenge 2: (Most) Brains Scatter Light

While certain model organisms (e.g., larval zebrafish, C. elegans) are transparent, most brains, especially mammalian brains, scatter light strongly. Fluorescence excitation efficiency decreases with increasing imaging depth as excitation light is scattered and absorbed, resulting in the degradation of the excitation point-spread function (PSF). Moreover, light excited in one spatial location can be scattered before collection and detected as though arising from somewhere else (Fig. 2, right). This degrades image contrast and confuses analysis of fluorescence time courses in adjacent areas.

Together, out-of-focus fluorescence and scattered fluorescence produce blurring, which drastically increases with imaging depth [46]. The need for optical sectioning and robustness to scattering has driven development and application of two- [30] and three-photon [45, 121] (or “multiphoton”) point scanning modalities, which achieve both. Due to non-linear dependence on excitation intensity, two- and three-photon fluorescence excited by high numerical aperture (NA) focusing generates femtoliter fluorescence volumes. Non-descanned fluorescence generated at the focus, scattered and non-scattered, is collected through the objective onto a large area detector (Fig. 1, right), commonly a photomultiplier tube. Two-dimensional images or 3D volumes are built up point by point by scanning the focus. This strategy enables calcium-based imaging of action potentials (APs) at depths up to 1 mm in mouse brains [59]. These techniques optically section and are robust to scattering; for structural imaging in live brains, this is as close as we have to perfect. Why, therefore, consider a different strategy for functional imaging? The next section introduces the key concept and guiding principle to designing and selecting a modality to image neuronal activity: the “fluorescence budget.”

3 Signal-to-Noise Ratio Is King: Fluorescence Budget

The principal consideration guiding optical neurophysiology systems is the signal-to-noise ratio (SNR) of time-varying fluorescence changes. Here the term signal-to-noise ratio most commonly refers to the ratio of the amplitude of a time-varying signal (S) transient to the “baseline noise” (σ), typically the root mean square of the signal that precedes or follows a transient peak (Fig.3). Note that this definition differs from how signal-to-noise ratio is often defined in scientific and engineering disciplines (see Note 1). Note also that in this chapter we are dealing specifically with the SNR of temporal, not spatial, fluorescence changes.

Fig. 3
A diagram of a waveform, where the horizontal axis is labeled as t and the vertical axis is labeled as s. The formula for the signal-to-noise ratio is given as s, divided by sigma.

Signal-to-noise ratio can be defined as the ratio of the amplitude of a time-varying signal (S) transient to the “baseline noise” (σ)

The temporal SNR quantifies the ease with which a fluorescence transient can be resolved from a noisy time-varying signal. The SNR is equal to the product of the fractional change in fluorescence with respect to baseline, or “indicator sensitivity,” multiplied by the total fluorescence collected per location (pixel or voxel), per integration period, the “fluorescence budget” (see Note 2):

$$ SNR=\frac{\Delta F}{F_0}\sqrt{F_0}. $$
(1)

ΔFF0 reflects the sensitivity of the indicator’s fluorescence to the physiological signal (e.g., membrane potential or calcium). The fluorescence budget, F0, is given by

$$ {F}_0=\phi \kern2.22198pt {F}_{gen}, $$
(2)

where ϕ (unitless) is the fluorescence collection efficiency and Fgen (in photons) is the fluorescence generated per location (pixel or voxel), per integration period. ϕ is determined by an assortment of factors dependent on the sample, such as imaging depth, wavelength-specific absorption, scattering length, and anisotropy, and specific to the collection optics, such as acceptance angle (numerical aperture, NA) and detector efficiency [129]. ϕ quantifies what fraction of fluorescence is collected by the imaging system. The fluorescence generated is

$$ {F}_{gen}={R}_{Fl}\kern2.22198pt {C}_{Fl}\kern2.22198pt {V}_{Fl}\kern2.22198pt \Delta t, $$
(3)

where RFl is the fluorescence rate (photons/s/molecule), CFl is the fluorophore concentration (molecules/m3), VFl is the per location fluorescence excitation volume (m3), and Δt is the integration period (s). The fluorescence rate is given by

$$ {R}_{Fl}={\sigma}_n<{I}^n> $$
(4)

in photons/s where σn is the n-photon brightness (1-, 2-, or 3- photon), the wavelength-dependent product of cross-section and quantum yield (m2n sn−1/photonsn−1), and < In >  is the time average of the fluorescence excitation intensity raised to the n-th power (photonsn/m2n/sn).Footnote 1 Ideally, < In >  is maximized to increase SNR up to the limit of photobleaching, heating, and in the case of ultrafast pulsed excitation, non-linear damage thresholds. Substituting gives

$$ SNR=\frac{\Delta F}{F_0}\sqrt{\phi \kern2.22198pt {\sigma}_n<{I}^n>{C}_{Fl}\kern2.22198pt {V}_{Fl}\kern2.22198pt \Delta t}. $$
(5)

The two imaging modalities introduced in the previous section, widefield and multiphoton scanning, are at opposite extremes in terms of fluorescence budget. Widefield integrates photons from all locations in a two-dimensional frame simultaneously throughout the frame period, and hence:

$$ \Delta t=\frac{1}{R_{acq}{N}_Z}, $$
(6)

where Racq is the frame acquisition rate and NZ is the number of planes imaged per volume. In contrast, multiphoton point rastering modalities integrate fluorescence from each location for only a small fraction of the acquisition period

$$ \Delta t=\frac{1}{R_{acq}{N}_X{N}_Y{N}_Z}, $$
(7)

where NX and NY are the lateral frame dimensions.

Consider the case where NZ = 1 to image a single plane, such that Racq is the frame rate, and compare the widefield to the multiphoton raster scan case. For the widefield case, Δt = 1∕Racq. For the multiphoton scanning case, digitizing each frame for example as NX = NY = 256 pixels, we see that Δt = 1∕(2562Racq). If we base our SNR comparison solely on differences in Δt, we see that the SNR for the widefield case is \( \sqrt{25{6}^2}=256 \) times that of multiphoton scanning! Note that this factor could be even bigger if the scanning modality involves significant “dead time” (i.e., galvonometric scanner turn around). Cameras also require read-out time, during which they are not integrating photons; however, this is typically around 10 μs per line for a modern sCMOS camera, a small fraction of the frame period, while scanning dead time can be around 10–20% of each frame.

The optical sectioning and robustness to scattering achieved by 2P and 3P point scanning modalities over widefield comes at high cost to the fluorescence budget, F0. Note that in the multiphoton point scanning case, F0 is inversely proportional to the product of the number of locations (pixels or voxels, NXNYNZ) monitored, and the rate at which they are monitored, Racq. This implies that F0 can be maintained by performing high Racq acquisition of few locations or low Racq acquisition of many locations. For example, at the high rate extreme, 10 kilohertz acquisition of voltage transients from a single “voxel” has been achieved in two photon by parking the beam over a dendritic spine [3]. At the other extreme, two-photon “mesoscopes” image FOVs multiple millimeters wide at low frame rates (0.1–10 Hz) [102, 106] using slow and highly sensitive calcium indicators [25, 29].

From Eq. 5, we see that SNR can be increased by: 1. use of a high sensitivity (large ΔFF0) fluorescence reporter and/or 2. increasing F0. F0 can be increased by: maximizing ϕ with efficient photo-sensors, high collection NA, etc.; selecting a bright fluorophore and exciting it at wavelengths for which its brightness, σn, is highest; maximizing fluorophore concentration, CFl, and exciting and collecting fluorescence over the largest possible voxel, VFl; exciting the fluorophore with the highest feasible intensity, < In > ; and maximizing integration time, Δt. Table 1 summarizes the parameters contributing to SNR of functional fluorescence signals. The next sections discuss trade-offs driving choice of fluorophore and imaging modality.

Table 1 Summary of variables determining temporal SNR for functional fluorescence imaging

4 Choosing a Fluorescent Indicator

A fluorescent indicator is an organic molecule or protein functionalized to transduce biophysical changes into changes in fluorescence. Most commonly used are indicators that respond to changes in membrane potential [11, 13] and calcium [7, 112], but others exist that respond to pH [93, 94], sodium [56, 61], and neurotransmitter concentration including glutamate [69, 70], acetylcholine [52], or GABA [71]. With regard to temporal SNR, the key parameters of such a molecule include its sensitivity (ΔFF0) and brightness (σn) as described above and its temporal profile.

4.1 Temporal Considerations

Here, indicator “temporal profile” refers to the convolution of the time course of a biophysical process (e.g., fast membrane potential variations vs. slow changes in calcium or neurotransmitter concentration) with the kinetics of the fluorescent indicator. Here we refer to τOff as a general term for the rate of decay back to baseline of the fluorescence transient induced by an action potential or other brief event. We note, however, that many indicators’ kinetics are not described by a simple mono-exponential decay. Importantly, τOff determines the minimum Racq (and hence maximum Δt) able to resolve the time-varying signal without aliasing. For example, some highly sensitive Förster resonance energy transfer (FRET)-based voltage indicators temporally low pass filter fast events such as action potentials due to slow translocation of mobile charges in the plasma membrane [41]. While considered a disadvantage in most scientific contexts, this low pass filtering is an advantage for imaging, as slower transient changes in fluorescence can be imaged at lower rates Racq without aliasing, thereby increasing Δt and F0. The minimum Racq is defined by the functional fluorescence temporal profile and also by the goals of the experiment. For instance, Racq can be relatively low for mere transient detection but must be much higher to characterize transient timing and kinetics [90]. The implications of differing τOff for imaging strategy design can be readily appreciated through comparison of calcium and membrane potential indicators for neuronal AP detection.

4.2 Membrane Potential vs. Calcium

The majority of all-optical neurophysiology experiments monitor neuronal membrane potential, either directly with voltage-sensitive fluorophores or indirectly through fluorophores sensitive to calcium concentration. In most cases, calcium transients are monitored to detect when a neuron fires an action potential; so why not always choose voltage-sensitive fluorophores to image membrane potential directly? The particular challenges of imaging membrane potential compared to calcium are summarized in Fig. 4. First, the physiological signal kinetics are fast, milliseconds in the case of action potentials, necessitating high Racq. High sampling rates limit the available photon integration time, Δt, demanding brighter (high σn) indicators for adequate SNR as discussed in Note 2.

Fig. 4
A diagram depicts a triangular structure with a circle at the center, and the nodes are extended. The triangle with labeled parts, membrane voltage indicator, and intracellular indicator.

The challenges of voltage imaging. Three issues make voltage imaging more challenging than calcium imaging. First, faster intrinsic kinetics limit the photon integration period. Second, voltage indicators must lie in the membrane or degrade the signal; this limits the volume of indicators that can be integrated to measure the signal. Lastly, membranes where signal arises are tightly packed in the brain; fluorescent signals from overlapping membranes wash out single-cell signals. Adapted from original in [88], CC BY-SA 4.0

While calcium indicators are distributed throughout the cytosolic volume, voltage indicators are confined to the membrane. The 100-mV membrane potential fluctuation during an action potential leads to an electric field change of around 3 × 107 V/m across the 3-nm-thick plasma membrane. Despite this large field, the fluorescent indicator’s sensor molecules must orient across the neuron’s external plasma membrane to sense any changes. This causes multiple challenges. First, the potential for physiological disruption by membrane indicator expression limits labeling density (and hence CFl). Membrane capacitance also increases with the number of charged or polarizable molecules in the plasma membrane, and over-labeling can even abolish action potentials completely [18]. Second, highly lipophilic dyes or poorly targeted genetic constructs can label membranes non-specifically, including internal membranes not exposed to changing fields. This increases background relative to the voltage-dependent signal, effectively reducing ΔFF0 and proportionally the SNR. Third, overlapping membranes from adjacent cellular processes in densely labeled samples are indistinguishable in most microscope images. This results in signal mixing, especially with widefield single-photon illumination. Voltage transients from individual cells are then “washed out” by the bright background from adjacent cells.

Calcium indicators are widely used in neuroscience to indirectly detect APs due to their relative ease of use compared to voltage indicators. These indicators change brightness when bound to Ca2+ ions. As AP firing in most neurons is accompanied by a rapid, transient increase in intracellular Ca2+ concentration, calcium sensors report a proxy for neuronal spiking. The increase in intracellular Ca2+ on AP firing due to the opening of voltage-gated channels can be “amplified” by release from internal stores [14] and results in up to an order of magnitude change with a slow decay over hundreds of milliseconds [14, 60]. This low pass filters AP waveforms and enables longer integration times (Δt) without signal aliasing, increasing photon counts. Many more calcium indicators can be loaded into the cell compared to voltage indicators as calcium concentration changes throughout the cytosolic volume, increasing signal brightness. Combined, these advantages have made calcium imaging a popular technique enabling high SNR optical recording of AP activity.

Despite these advantages, there remain significant drawbacks to calcium indicators that limit their applicability. Calcium transients are not exclusively linked to action potential firing in all neurons [42, 68, 74]. AP-evoked calcium dynamics are also slow compared to APs, and indicator kinetics often further reduce the fluorescence response speed. Although beneficial for imaging, this limits the accuracy with which the timing of the underlying electrophysiological activity can be estimated, even when the link between APs and calcium transients is clear. Further complications confusing the link between indicator brightness and electrophysiology include indicator saturation, intrinsic and indicator buffering, and diffusion biophysics [95]. Increased calcium buffering due to high indicator concentrations has also resulted in pathology [72, 105]. Possibly the most important disadvantage to calcium indicators is their inability to report subthreshold membrane potential changes. Calcium indicators can be used to image calcium influx related to synaptic events [17]; however, at the soma, they report suprathreshold AP activity. These factors have motivated development of better optical, chemical, and biological techniques for imaging voltage in neurons. Both the relatively low ΔFF0 of these indicators and high required RFl, however, necessitate careful consideration of the photon budget (F0) when developing imaging strategies.

In summary, voltage indicators feature small ΔFF0 and fast kinetics (short τOff) that require imaging with high fluorescence budget, especially long Δt, modalities (e.g., widefield, light field) to maintain SNR. Calcium indicators feature comparatively high ΔFF0 and calcium transients decay slowly, enabling imaging at low Racq. These features render calcium indicators eminently compatible with low fluorescence budget modalities such as multiphoton raster scanning.

4.3 Spectral Considerations

Of particular concern for all-optical experiments is avoiding spectral crosstalk between photostimulation and fluorescence excitation light. Such crosstalk comes in two types: imaging and physiological. Imaging crosstalk occurs when photons meant to stimulate neurons spuriously excite the functional fluorescent indicator. Imaging crosstalk can be avoided altogether if the photostimulation wavelength does not efficiently excite the indicator (e.g. [40]). If not, fortunately, transient artefacts due to imaging crosstalk can be predicted or measured, and subtracted, even in real time in some configurations [35], and generally do not affect the scientific integrity of the data. Physiological crosstalk occurs when light intended to excite the functional indicator fluorophore is spuriously absorbed by the opsin, causing neuronal de- or hyperpolarization. In contrast with optical crosstalk, physiological crosstalk, even subthreshold, undermines the scientific validity of the data by compromising the membrane potential, and in some cases action potential rate and timing, of the imaged neurons. Indeed the photocurrents spuriously generated by the imaging light cannot be subtracted; they must be prevented to realize the scientific potential of all-optical neurophysiology.

There are two ways to prevent physiological crosstalk in all-optical experiments: spectral separation and spatial separation. Spectral separation refers to exciting the indicator fluorophore at wavelengths at which the opsin actuator cross-section is so small that no photocurrents can be measured by whole-cell patch clamp in the opsin-transfected cells. Importantly, this control must be measured at the intensities and durations needed for imaging the fluorescent indicator. For one-photon excitation, this has been achieved by pairing blue–green-absorbing opsins with red-shifted calcium dyes [104], voltage dyes [64, 114, 119], GEVIs [1, 34, 47], and GECIs [7, 124]. Alternatively, red-shifted opsins, such as C1V1 [125], ReaChR [65], or Chrimson [57], can be combined with green-emitting calcium indicators (see also Chapters 3 and 4), although these red-shifted opsins exhibit 20%–30% actuation efficiency under blue-light excitation [116] and are thus more susceptible to spectral crosstalk than pairings where the opsin is excited at shorter wavelengths than the indicator. Spatial separation refers to limiting [107] or eliminating the fluorescence excitation light incident on opsin-expressing cells and substructures. For instance in cases where the opsin is targeted to the soma [10, 67, 99] or to a specific neuronal subpopulation, the fluorescence excitation light could be patterned exclusively over non-opsin-expressing structures and cells. Efforts to reduce crosstalk in one-photon excitation schemes with large spectral overlap between opsin actuators and indicators (e.g., the actuator channelrhodopsin-2 [ChR2] + GCaMP calcium reporters) have minimized read-out light intensities (< In > ) [44, 107] to the detriment of SNR.

Multiphoton schemes for opsin actuation and imaging have yet to demonstrate a configuration completely free of physiological crosstalk. Finding a scheme in which the imaging laser does not evoke spurious photocurrents, producing sub- and/or suprathreshold membrane potential changes, is especially difficult due to broad opsin two-photon action spectra. Spurious suprathreshold activation has been reduced or avoided by using opsins and indicators with partially separated absorption spectra (e.g., C1V1 with GCaMP6s [83]; ChR2, GtACR2, or stCoChR with jRCaMP1a [36, 37]) and by limiting the imaging dwell time, although subthreshold actuation may still occur. Broad multiphoton spectra can provide an advantage for imaging-only configurations (without an opsin) by exciting multiple fluorophores simultaneously with a single wavelength (e.g., green-emitting OGB-1 or GCaMP with red-emitting SR101 [16, 75]). Excitation to a higher-energy electronic excited state has also enabled multi-fluorophore three-photon imaging with a single wavelength [48].

4.4 Fluorophore Spatial Distribution

Of critical importance to SNR is the fluorescent indicator spatial distribution, both within a cell and within a population of cells. The fluorescent indicator properties determine the maximum ΔFF0, but this is effectively reduced in proportion to the amount of “useless” or “background” fluorescence in the cell and surrounding space. Background fluorescence reduces the SNR as the non-signal-containing photons are collected from the same ROI as signal-containing photons, reducing the fractional change in fluorescence. If the baseline fluorescence rate from signal-containing molecules is given by F0, and the rate of background fluorescence is given by FB, then the fractional change in fluorescence is reduced to

$$ \frac{\Delta F}{F_0+{F}_B}=\left(1-{f}_B\right)\frac{\Delta F}{F_0},\kern1em \textrm{where}\kern1em {f}_B=\frac{F_B}{F_0+{F}_B}, $$
(8)

the fraction of fluorescence contributed by the background. The SNR is then given by

$$ \textrm{SNR}=\left(1-{f}_B\right)\frac{\Delta F}{F_0}\sqrt{F_0+{F}_B}=\sqrt{1-{f}_B}\frac{\Delta F}{F_0}\sqrt{F_0}=\sqrt{1-{f}_B}{\textrm{SNR}}_0, $$
(9)

as \( {F}_0+{F}_B=\frac{F_0}{1-{f}_B} \) [58].

Dense fluorophore labeling poses problems especially for voltage indicators due to their membrane localization. Voltage signals in densely labeled samples cannot be resolved without indicator somatic restriction to reduce fluorescence contributions from overlapping adjacent processes (e.g., [2, 4, 117]) or imaging at sub-micron resolution, which has yet to be demonstrated. This problem is mitigated in preparations where the labeled cells are non-adjacent or “sparse.” For example, single neuron, single-trial action potential GEVI imaging has been achieved by expressing the GEVI strongly and sparsely in a subpopulation of cortical layer 2/3 excitatory neurons [90] due to the high effective ΔFF0.

In summary, the sensitivity, brightness, kinetics, and spatial distribution of a fluorescent indicator determine which imaging modalities can resolve the functional fluorescence transients. Bright, slow, and sensitive indicators can boost temporal SNR for low fluorescence budget imaging modalities. For example, if interested in membrane potential, but wanting to track the activity of many cells with scattering-robust two-photon imaging, one can compensate two photon’s low fluorescence budget with a slow, bright calcium indicator.

5 SNR and Imaging Modality

Fluorescence imaging systems are comprised of two subsystems: (1) the fluorescence excitation subsystem and (2) the fluorescence detection subsystem. Here we detail how the characteristics of each subsystem contributes to temporal SNR.

5.1 Fluorescence Excitation

Regarding SNR, the fluorescence excitation subsystem can be characterized in terms of light intensity, < In > , and the per pixel, per integration period excited volume, VFl. Together with the fluorophore cross-section (σn), concentration (CFl), integration period (Δt), and spatial distribution, these parameters determine the maximum available fluorescence excitation budget (Fgen; Eqs. 3, 4). The illumination wavelength and fluorophore cross-sections determine whether the system favors one-photon, two-photon, or three-photon fluorescence excitation. Two- and three-photon excitation requires ultrafast pulsed lasers with megahertzFootnote 2 repetition frequencies to achieve RFl sufficient for imaging. One-photon fluorescence excitation rates, RFl, vary widely depending on indicator brightness (σ1) and illumination intensity (< I > ) and generally exceed RFl for two- and three-photon modalities, which typically excite < 0.1 photons per laser pulse [101].

5.1.1 Fluorescence Excitation Volume, VFl

The fluorescence excitation rate critically depends on the degree to which the fluorescence excitation is parallelized. Widefield excitation, which excites fluorescence in all locations throughout a volume simultaneously, features the highest degree of parallelization, thus maximizing the fluorescence excitation budget. Focusing a laser beam to a diffraction-limited point and serially scanning that point correspond to lowest parallelization and excitation budget. In between widefield and scanned point excitation, the excitation light can be sculpted into many forms, including a large point (scanned-temporal focusing; S-TeFo, [87, 118]), multiple scanned (spinning disk confocal [108, 126], multifocal 2P [15, 55, 78, 89, 98, 120, 127]) or static (computer-generated holography, [23, 32, 85, 122]) points, a line (TeFo line scanning [28], SLAP [53], vTWINS [103], Bessel beams [19, 66]), whole planes [5, 20, 51, 54, 82, 96, 97, 128], and extended shapes patterned directly onto structures of interest [21, 39, 79, 109, 110]. It is important to note that not all fluorescence photons contribute useful signal. For example, a higher proportion of photons excited through two-photon point scanning contribute to image formation compared to widefield imaging, where photons excited outside the plane of focus are not imaged and can smear the temporal signals extracted from in-focus ROIs. It is also important to note that VFl introduced earlier in this chapter refers to the per location fluorescence volume, not the total spot, line, or sheet volume, and therefore depends on the spatial discretization performed by the collection subsystem.

5.1.2 Fluorescence Integration Time, Δt

The fluorescence excitation subsystem determines the relationship between Racq and Δt. In particular, for excitation that does not move or change shape during the acquisition period, Δt = 1∕Racq. Scanning generally reduces Δt in proportion to the number of locations scanned. Table 2 summarizes Δt for the different scanned excitation shapes. Bearing in mind that \( SNR\propto \sqrt{\Delta t} \), we appreciate the power of parallelization to boost the fluorescence budget (F0), enabling imaging of smaller and faster signals, and/or over larger fields of view. Fluorescence parallelization, however, reduces robustness to scattering, as discussed next.

Table 2 Δt for the different excitation volume shapes in scanning configurations

5.2 Fluorescence Detection

The fluorescence detection system determines how the fluorescence excitation budget is exploited to form images. Sensors for fluorescence detection fall into two categories: single and multi-channel.

5.2.1 Single-Channel Detectors

Single-channel detectors, including photodiodes and photomultiplier tubes (PMTs), read out fluorescence intensity (or photons) as a function of time. Imaging is achieved by combining single-channel detectors with scanned fluorescence excitation through the process of “temporal multiplexing”: the localization of fluorescence based on when it was detected. Moreover, temporal multiplexing can be used to scan multiple areas or z-planes by alternating the focus of time sequential laser pulses [12, 24, 26, 49, 106, 118]). The degree of temporal multiplexing, along with the total rate of fluorescence excited from the sample (RFl), is ultimately determined by the indicator’s fluorescence lifetime [26].

Single-channel detection of point-scanned two- and three-photon fluorescence excitation features the highest achievable robustness to scattering and finest optical sectioning, owing to temporal multiplexing. These advantages, as previously discussed, are achieved at the cost of fluorescence excitation bandwidth, even when fluorescence rates (RFl) are maximized through pulse energies and repetition rates increased to the maximum allowable by photo-damage thresholds and fluorophore lifetimes, due to Δt’s dependence on the number of voxels (NX × NY × NZ rastered, or Nlocs randomly accessed).

Intermediate techniques excite fluorescence from extended regions while collecting fluorescence on a single-channel detector and postprocess the signal to recover a 2- or 3-dimensional image. Notable examples include “Scanned Line Angular Projection” (SLAP) [53], Bessel beam scanning [66], “volumetric two-photon imaging of neurons using stereoscopy” (vTwINS) [103], and multiplane imaging [123]. These can considerably increase F0 for the same Δt compared to traditional single-point scanning while remaining robust to scattering. This comes with the caveat that the often complex and computationally expensive reconstruction techniques typically require the imaged sample or activity to be sparse.

With single-channel detection, the fluorescence excitation volume, VFl, is equal to the total volume excited by the spot, line, or sheet. Assuming that RFl remains constant, SNR increases in proportion to \( \sqrt{V_{Fl}} \). Hence, scanning with a large spot [87] increases fluorescence excitation budget at the cost of spatial resolution, which is also determined by the fluorescence spatial profile or “point-spread function” (PSF). The ability to attribute fluorescence to individual neurons depends on the PSF and the sparsity of the fluorescent indicator labeling.

5.2.2 Multi-channel Detectors

Multi-channel detectors for optical neurophysiology include one- or two-dimensional arrays of PMTs or photo diodes, and cameras, primarily charge-coupled device (CCD) and complementary metal oxide semiconductor (CMOS). Multi-channel devices enable “spatial multiplexing”: the localization of fluorescence based on where it was detected on the array. With the notable exception of computational reconstructions based on structural image priors [53], imaging parallelized fluorescence excitation (multiple points, lines, sheets, widefield) requires spatially multiplexed detection, in most cases imaging a two-dimensional plane onto an array detector or camera. For volumetric imaging, the imaging plane can be scanned with the fluorescence excitation by moving the objective or sample, with electrically or acoustic gradient tunable lenses, or by remote focusing [8, 19]. Alternatively, the imaging depth of field can be extended to encompass the entire volume through, for example, wavefront coding [81, 92] or intentional spherical aberration [113]. Volumetric imaging can also be achieved with light field microscopy, which uses a microlens array to encode positional and angular information, enabling reconstruction of full volumes from a single two-dimensional frame [77]. Light field microscopy’s high fluorescence budget has recently been exploited to image both neuronal calcium [43, 50, 80, 84, 86, 100] and membrane potential [6, 27, 91].

For multi-channel detection, VFl is equal to the volume of excited fluorescence “seen” by each pixel of the detector. Therefore, imaging at the lowest feasible magnification and/or binning the fluorescence detected by pixels into regions-of-interest post hoc, both benefit fluorescence budget and SNR at the cost of spatial resolution.

All strategies that combine fluorescence parallelization with multi-channel detection feature contrast that decreases quickly with depth in scattering brain, because with multiple detectors scattered photons can no longer be localized with certainty to a single location. Thus increasing the photon budget by parallelizing collection reduces the depth in scattering tissue at which functional fluorescence signals can be imaged.

6 Summary of Key Points

The choice of fluorescent indicator and imaging strategy determine SNR through each’s contributions to the fluorescence budget, summarized in Fig. 5. Scientific priorities ultimately drive whether the experiment is designed around an “ideal indicator” or “ideal imaging modality,” which then constrains the choice of the other to achieve sufficient SNR at the minimum required acquisition rate, Racq. Figure 5 describes three example modality/indicator combinations and situates them with respect to relative contributions of each to SNR. For example, two-photon mesoscopy serially scans many locations and hence features low Δt, which is compensated through GCaMP6s’s high sensitivity (ΔFF0), brightness (σ2), and long τOff, which accommodates low Racq [102]. In contrast, widefield imaging of fast voltage indicators, such as Di-4-ANEPPS analogs [9], relies on widefield’s large fluorescence budget to resolve membrane potential changes at kilohertz frame rates.

Fig. 5
A rectangular table, the top portion represents mathematical formulas and is subdivided into three properties with five aspects of imaging morality, indicator properties, modality, priorities, and limitations.

Balancing indicator and imaging strategy contributions to SNR. The imaging modality (upper triangle) and fluorescent indicator (lower triangle) properties together determine fluorescence transient SNR (horizontal axis; equation, top). Importantly, the SNR of a “low fluorescence budget” imaging modality can be compensated by a high ΔFF0 or “high fluorescence budget” indicator and vice versa. The vertical dashed lines situate three example indicator/modality pairings with respect to each’s relative contribution to SNR

While this chapter has focused on SNR for shot noise-limited imaging strategies, it is important to bear in mind other noise sources. Importantly, instrument noise can dominate in low light or low ΔFF0 regimes. In vivo, noise arising from the sample, including respiratory, cardiac, and other motion, can dominate [33]. However, physiological noise occurs in distinct frequency bands that can often be compensated or subtracted [38].

The problem of physiological crosstalk, in which imaging light spuriously actuates changes in membrane potential due to broad opsin action spectra, has not been fully addressed for multiphoton excitation. Physiological crosstalk could be completely avoided, in principle, by restricting fluorescence to structures or cell populations that are not illuminated by the imaging laser.

A key take home is that indicators and imaging modalities featuring the highest fluorescence budgets enable the highest acquisition rates over the largest number of locations. This concept is reviewed in detail for multiphoton modalities by [62]. However, high budget modalities also generally do the least to mitigate light scattering effects, limiting the depth at which functional fluorescence transients can be resolved. When comparing candidate imaging modalities for all-optical experiments, careful inspection, in particular of VFl and Δt, can enable reasonable prediction of how SNR would compare to that of alternative strategies. SNR is the most important figure of merit to consider when designing an optical physiology imaging strategy, as it encompasses a variety of variables and ultimately determines whether or not the experiment will be able to detect the biophysical phenomenon of interest.

7 Notes

  1. 1.

    Defining SNR: Variation Across Disciplines

    Although a ubiquitous concept in many scientific and engineering fields, the exact definition of SNR varies between fields. In functional neuroscientific imaging particularly, SNR is often defined in a way at odds with what is common in the signal processing world. SNR is commonly understood as the ratio of the level of the signal of interest to the level of the noise in the measurement. Precisely defining what we mean by level, however, and how to report the ratio, is where different fields and different studies within fields start to vary.

    The canonical signal processing definition of the SNR is given by [115]

    $$ {\textrm{SNR}}_{\textrm{SP}}=\frac{P_S}{P_N},\kern1em \textrm{where}\kern1em {P}_x=\frac{1}{N}\sum \limits_{i=0}^N{\left|{x}_i\right|}^2, $$
    (10)

    where PS is the signal power, PN is the noise power, and Px defines the power of a discrete signal of length N, xi. Calculating this SNR for a functional imaging trace requires measuring or estimating a noise-free signal and the signal-less noise. This, however, can often be difficult or near impossible for many common functional imaging paradigms when there is no simultaneous electrophysiology. The functional imaging community therefore often reports a different SNR measure (sometimes called peak SNR or PSNR, not to be confused with the PSNR measure used in image processing [111]), defined as

    $$ {\textrm{SNR}}_{\textrm{N}}=\frac{S}{\sigma },\kern1em \textrm{where}\kern1em S=\frac{F-{F}_0}{F_0},\kern1em \textrm{and}\kern1em {\sigma}^2=\textrm{Var}\left({F}_0\right). $$
    (11)

    S is the amplitude of the fluorescence change during the signal of interest, such as an AP, and σ is the estimated RMS noise from a section of the time course without any signal, approximately equal to \( \sqrt{P_N} \) from Eq. 10. This approach is straightforward for most neuroscience signals, as the activity is often temporally sparse, facilitating selection of time course sections with and without activity. This SNR is often commonly reported as a simple ratio, whereas SNRSP is often reported in dB.

    These differences can result in different values for SNR for the same traces, as the figure above demonstrates. Increasing indicator sensitivity will scale SNRSP quadratically, while SNRN scales linearly due to using signal amplitude, not power. Second, and more importantly, SNRSP captures information about signal duration which SNRN ignores. For signals that are temporally sparse, SNRSP can seem surprisingly low compared to SNRN as the noise is spread throughout the whole trace, while the signal is concentrated into short periods (Fig. 6).

    Fig. 6
    Four waveform diagrams represent two short waveforms and two high waveforms concerning percentages (50 percent and 100 percent) and time (200 milliseconds and 400 milliseconds).

    Variation in SNR due to calculation method

  2. 2.

    The Fundamental Limit on SNR in Optical Imaging

    The physical nature of photon detection limits the theoretical maximum SNR of functional optical imaging. We measure the fluorescence intensity, F, of an indicator to infer something about the underlying physiological process it reports. Poisson noise due to a collection of fluorescence photons dictates how well we can do this for a given number of photons collected. The Poisson distribution gives the probability of detecting k photons in an interval when the mean rate is F0 as

    $$ P\left(F=k|{F}_0\right)=\frac{e^{-{F}_0}{F}_0^k}{k!}. $$
    (12)

    Both the expected value and the variance of F are equal to F0. Traces are commonly normalized to the mean intensity, F0, enabling easier comparison of structures with a different labeling brightness, and the variance of the normalized variable, FF0, can be simply calculated due to the linearity of the expectation value as σ2 = 1∕F0. We assume here that the change in brightness is small compared to the baseline brightness, such that the noise during and outside the signal period is the same. Our normalized signal, S, is given by ΔFF0, the change in fluorescence brightness, and so our signal-to-noise ratio is given by

    $$ SNR=\frac{S}{\sigma }=\frac{\Delta F}{F_0}\sqrt{F_0}. $$
    (13)

    The figure above illustrates this by drawing samples from a Poisson distribution to simulate fluorescent signals. The rate is increased by 10% in the central samples, and the baseline brightness is 10000 counts/sample in the top trace, and only 1000 counts/sample in the bottom. This leads to an SNR clearly increased by a factor of \( \approx \sqrt{10}\approx 3 \) from the bottom to the top trace. On the right, a graph shows the theoretical maximum achievable SNR for a given brightness, for different relative changes in fluorescence from the signal of interest, ΔFF0. This demonstration assumes our imaging system is Poisson noise-limited, which is typically true for bright fluorescent samples (Fig. 7).

    Fig. 7
    Diagram (a) represents two waveforms, where the sound-to-noise ratio is 10 and 3 respectively. Line graph (b) plots SNR against mean pixel intensity, concerning 1, 10, and 100 percent with a rising trend.

    Demonstration of the effect of Poisson noise on SNR. Reproduced from [88], CC BY-SA 4.0. Inspired by [31]