Cortical long-range interactions embed statistical knowledge of natural sensory input: a voltage-sensitive dye imaging study

Selim Onat; Dirk Jancke; Peter König

doi:10.12688/f1000research.2-51.v2

Home Browse Cortical long-range interactions embed statistical knowledge of natural...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Updated

Cortical long-range interactions embed statistical knowledge of natural sensory input: a voltage-sensitive dye imaging study

[version 2; peer review: 2 approved]

Selim Onat¹, Dirk Jancke^2,3, Peter König¹

PUBLISHED 02 Aug 2013

Author details Author details

¹ Institute of Cognitive Science, Department of Neurobiopsychology, University of Osnabrück, Osnabrück, 49069, Germany
² Cognitive Neurobiology, Bernstein Group for Computational Neuroscience, Ruhr-University Bochum, Bochum, Germany
³ Optical Imaging Group, Institut für Neuroinformatik, Ruhr-University Bochum, Bochum, Germany

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

How is contextual processing as demonstrated with simplified stimuli, cortically enacted in response to ecologically relevant complex and dynamic stimuli? Using voltage-sensitive dye imaging, we captured mesoscopic population dynamics across several square millimeters of cat primary visual cortex. By presenting natural movies locally through either one or two adjacent apertures, we show that simultaneous presentation leads to mutual facilitation of activity. These synergistic effects were most effective when both movie patches originated from the same natural movie, thus forming a coherent stimulus in which the inherent spatio-temporal structure of natural movies were preserved in accord with Gestalt principles of perceptual organization. These results suggest that natural sensory input triggers cooperative mechanisms that are imprinted into the cortical functional architecture as early as in primary visual cortex.

Corresponding authors: Selim Onat, Dirk Jancke, Peter König

Competing interests: No competing interests were disclosed.

Grant information: This work has been financed by the Bundesministerium für Bildung und Forschung (BMBF), the Deutsche Forschungsgemeinschaft (DFG) SFB-874 (TP A2 Eysel, Jancke), the Bernstein Group for Computational Neuroscience Bochum, the German-Israeli Project Cooperation (JA 945/3-1, SL 185/1-1), the International Graduate
School of Neuroscience (IGSN) Ruhr-University Bochum, Germany and the EU commission (FP7-ICT-270212, eSMCs).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2013 Onat S et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

How to cite: Onat S, Jancke D and König P. Cortical long-range interactions embed statistical knowledge of natural sensory input: a voltage-sensitive dye imaging study [version 2; peer review: 2 approved]. F1000Research 2013, 2:51 (https://doi.org/10.12688/f1000research.2-51.v2) First published: 15 Feb 2013, 2:51 (https://doi.org/10.12688/f1000research.2-51.v1) Latest published: 02 Aug 2013, 2:51 (https://doi.org/10.12688/f1000research.2-51.v2)

Updated Changes from Version 1

This article has been updated to include a new paragraph in the discussion about the potential benefits of using natural stimuli to investigate contextual effects. Additionally, we added a paragraph highlighting the relationship between VSD imaging signals and spiking activity.

See the authors' detailed response to the review by Marcello Rosa, Nicholas S. C. Price and Hsin-Hao Yu
See the authors' detailed response to the review by Diego Contreras

Introduction

The early visual cortex comprises an extended and densely interwoven network, acting on millisecond time scales¹. Radially, activity is rapidly distributed by local feedback loops^2,3. Tangentially, long, horizontal fibers enable neurons to sense regions beyond their receptive field borders^4–7. Investigating the dynamics of this circuitry with simple parametric stimuli reveals well-defined selective response properties. In carnivores and primates, where these neurons are further organized in overlaid maps^8–11, it is possible to satisfactorily predict changes in the layout of these maps in accord with the distribution of the stimulus energy across different visual features¹², but see¹³.

However, these stimulus-response relationships can flexibly be modified by the specific connectivity patterns between distant neurons^4,14,15. For example, a surrounding stimulus which is in itself not sufficient to drive cortical neurons above firing thresholds may exert strong contextual influences on local processing^16–19. These integrative phenomena are conceived as the functional backbone of various Gestalt criteria of perceptual organization²⁰ and naturally occurring visual tasks such as contour forming, figure-ground separation, object segmentation, or perceptual completion. Hence, local-to-local interactions are intrinsically tied to the integrative functionality of cortical operation. They can be conceptualized as biases originating from the cortical architecture that foster optimal coordination of large numbers of neurons in accord with the statistics of incoming signals^21–23.

Today, a large body of evidence indicates that the functional properties of neurons are specifically adapted to process signals that are of ecological relevance^24–26. Neuronal stimulus-response properties exhibit higher sensitivity²⁷, selectivity²⁸ and reliability^29,30 in response to visual features when these are presented within their natural sensory context. However, direct functional evidence showing that cortical connectivity mediating local-to-local coupling embeds empirical statistical knowledge of natural inputs is at best scarce and it is not clear whether these interactions studied with simple stimulus configurations extrapolate to complex dynamic conditions that mimic natural input.

We recorded cortical activity using voltage-sensitive dye imaging³¹ in response to locally presented natural movies recorded by cats in a natural habitat²⁶. We characterized contextual effects by manipulating the spatiotemporal statistical regularities between two movie-patches; we tested the hypothesis that cortical circuits responsible of contextual effects are functionally adapted to natural input statistics. Our results show that, under dynamic natural stimulation conditions, facilitatory interactions across distances beyond the classical receptive field characterize contextual effects, demonstrating that cortical circuits embed functional knowledge about the spatiotemporal relationships inherent in natural scenes.

Materials and methods

Stimulus conditions

Stimulus acquisition and presentation hardware was the same as in a previous study²⁶. Natural movies (see Figure 1A, for two example frames) were recorded at a sampling rate of 25 Hz by freely moving cats exploring a natural habitat. The recorded natural movies were presented locally, through a single or a pair of Gaussian apertures (referred to as patches in the text) for a duration of 2 s including the 200 ms prestimulus interval. We included in the analysis presented here only the initial 750 ms. Local patches were created by modulating the contrast of the movies as a function of space according to a two dimensional Gaussian function with full width at half maximum (FWHM) (~3–4°). The FWHM depended on the distance of the center point to the area centralis in individual experiments (~3–6°). The average luminance value of pixels overlapping with the two dimensional Gaussian mask were first subtracted and then multiplied pixel-wise with the Gaussian mask. This effectively reduced the luminance contrast from center to periphery. Following this step the average values were set back to the background luminance level, which was kept constant across the whole experiment.

Figure 1. Acquisition and presentation of natural movies.

(A) Movie frames extracted from two different movies (marked orange/green) are shown. Prior to optical recordings retinotopy across the imaged cortical region was evaluated by hand mapping of receptive field locations (colored rectangles) at various penetration sites. This ensured that the distance between the mid-point of apertures was bigger than the range of classical receptive fields. (B) For each experiment, two base natural movies (marked orange/green) were used to create different stimulation conditions. Movies were presented through either one or two Gaussian masks (3° or 4° FWHM, see Materials and Methods) centered at positions A (top position) or B (bottom position), illustrated by white circles. According to the position label (A or B) and the index of the natural movies Movie 1 or Movie 2, the following conditions were displayed: single (A1, B1, A2, B2), in which only one local patch was shown; coherent (A1B1, A2B2), in which two local patches belonged to the same full-field movie; and incoherent (A1B2 and A2B1), with local patches derived from different movies (see rightmost column).

Local conditions were indexed by two parameters: position on the screen (A or B) and movie index (1 or 2) specifying which full-field movies were to be masked. We used different movies in different experimental sessions to increase the generalizability of our results. By displaying either one or simultaneously two local movies, the conditions A1, B1, A1B1, and A2, B2, A2B2 were created (Figure 1B and Supplemental Movie S1). For conditions with a pair of patches, the distance between the centers of the two Gaussian apertures was equal to 2.5 FWHM. Local movies were corrected for mean luminance so that the average of the pixels within the Gaussian apertures was always equal to the brightness of the background in which they were embedded. However, we did not equalize the contrast within each aperture, as it is not possible to do so without introducing strong artifacts, particularly in cases where the local portion of a movie frame contains regions with homogenous brightness values belonging to object surfaces.

Prior to optical recordings, the topographic mapping between the cortical surface and the visual field were scrutinized by means of several electrode penetrations, and local stimuli were positioned so that the upper movie-patch matched the receptive field position of the simultaneously recorded multiunit activity. This ensured that the distance between the centers of the Gaussian masks extended beyond the borders of classical receptive fields.

Experimental setup

Animals were initially anesthetized with ketamine (15 mg kg^-1 intramuscularly (i.m.)) and xylazine (1 mg kg^-1 i.m.), supplemented with atropine (0.05 mg kg^-1 i.m.). After tracheotomy, animals were artificially respirated, continuously anaesthetized with 0.8–1.5% isoflurane in a 1:1 mixture of O₂/N₂O, and fed intravenously. Heart rate, intratracheal pressure, expired CO₂, body temperature, and electroencephalograms (EEG) were monitored during the entire experiment. The skull was opened above the primary visual cortex and the dura was resected. Paralysis was induced and maintained by alcuronium dichloride (Alloferin^®). Eyes were covered with zero-power contact lenses for protection. External lenses were used to focus the eyes on the screen. To control for eye drift, the position of the area centralis and receptive field positions were repeatedly measured. A stainless steel chamber was mounted and the cortex was stained for 2–3 hours with voltage-sensitive dye (RH-1691), and unbound dye was subsequently washed out with artificial cerebrospinal fluid. All surgical and experimental procedures were approved by the German Animal Care and Use Committee (AZ 9.93.2.10.32.07.032) in accordance with the Deutsche Tierschutzgesetz and NIH guidelines.

Data acquisition and preprocessing

Optical imaging was accomplished using an Imager 3001 (Optical Imaging Inc, Mountainside, NY) and a tandem lens macroscope³², 85 mm/1.2 toward camera and 50 mm/1.2 toward subject, attached to a CCD camera (DalStar, Dalsa, Colorado Springs). The camera was focused ~400 µm below the cortical surface. For detection of changes in fluorescence, the cortex was illuminated with light of 630 ± 10 nm wavelength and emitted light was high-pass filtered with a cutoff of 665 nm using a dichroic filter system. Cortical images were acquired at a frame rate of 220 Hz and covered regions of approximately 10 × 5 mm² of primary visual cortex. The relevant retinotopic region of area 18 was captured (lower contralateral quadrant of the visual field), and parts of area 17 were also occasionally captured. For electrophysiological recordings, a custom-built device was used that allowed targeted penetrations at different locations without opening the sealed recording chamber.

The raw data was processed in two steps²⁶. First, in order to remove differences in illumination across different pixels, divisive normalization was performed on all the recorded raw samples of a given pixel by its mean during prestimulus period. Second, heart-beat and respiration-related artifacts were removed by subtracting the average blank signal recorded in the absence of stimulation. These differences were later normalized by the blank signal in order to gain independence from the global activity level fluctuations occurring during the course of an experiment. As our recordings were synchronized with the heart-beat cycle of the animal, this blank subtraction step effectively removes these artifacts. Moreover, this method is preferred over the cocktail blank correction because our conditions were not composed of orthogonal stimuli. These steps were applied for each trial separately and the outcome was averaged across trials. The number of trials ranged from 25 to 35 for different experiments.

Model fitting and statistical evaluation

We computed spatial profiles by averaging the evoked data across the temporal dimension. These were fitted with a two dimensional Gaussian function of the form:

G(x, y) = A exp((x – μ_x)² / σ_x + (x – μ_y)² / σ_y + (x – μ_x)(y – μ_y) / ρ)

Where A, σ_x, σ_y, μ_x, μ_y, and ρ represent the peak value, the horizontal (medio-lateral) and vertical (antero-posterior) spreads, the position of the Gaussian function (medio-lateral or antero-posterior), and the rotational parameter, respectively. We used lsqcurvefit function provided by Matlab (2007b, The MathWorks, Natick, MA) using a large-scale trust-region-reflective algorithm. Prior to optimization runs, initial parameters of the Gaussian function were roughly estimated using different heuristics for each experiment and condition separately. As the fitting function we used superposition of two Gaussian functions G₁(x,y) + G₂(x,y) centered roughly on separate activation spots. Almost always, the activity spots were clearly distant and well-isolated from each other. All parameters were estimated simultaneously. Using two Gaussians in all stimulation conditions, including single conditions, ensured that the activation at the distant locations did not influence the fit, therefore possible cross-contributions were negligible. Furthermore the center position of each single Gaussian fit was constrained around the location of the peak responses at locations A and B. This helped avoiding complications where Gaussians could overfit noise during single stimulation conditions and this was necessary to estimate the value of the indirect activation. We also constrained the value of different parameters to avoid fitting to irrelevant cortical activity; this was most useful in the single condition case, where there was only one single activity spot.

For the statistical evaluation of confidence intervals, we used the bootstrapping method with 1000 repetitions and an alpha value of 0.05. The confidence intervals are provided within square brackets following average values. We tested the significance of median activities using the Wilcoxon sign rank test; the p-values are provided within brackets.

Results

We performed voltage-sensitive dye imaging (VSDI) in the primary visual cortex of anesthetized cats (n = 4). Multi-unit recordings complemented these measurements and provided information on receptive field properties and localization and spatial extent (see Materials and methods). In order to investigate long-range cortical interactions under ecologically relevant dynamic stimulation conditions, natural movies were presented locally by applying either one or two Gaussian masks (3–4° FWHM) to the original full-field movies (Figure 1A). Presentation of two movies (Movie 1 and Movie 2) viewed through apertures at two different positions (position A and B) creates a total of 8 different local stimulation conditions (Figure 1B, please see also Video S1): Single conditions (A1, B1, A2 and B2) consisted of isolated local movie-patches that provided no contextual information (Figure 1B, first and second column). In contrast, coherent (A1B1 and A2B2) and incoherent (A1B2 and A2B1) conditions provided contextual information in the form of another distant movie-patch. Here, two movie-patches stemming from either the same (coherent) or different (incoherent) original natural movies were presented simultaneously at two locations that were larger than the typical classical receptive field sizes (Figure 1A, colored boxes representing receptive fields). Whereas coherent conditions leave the spatiotemporal characteristics of natural movies intact, incoherent stimulation eliminates naturally occurring correlations between apertures and induces an evident dissonance (please see Video S1).

Effect of contextual stimulation on activity amplitudes

Cortical responses to two different movies presented in single and coherent conditions for a duration of 750 ms are shown as space-time plots (Figure 2, top and bottom rows for Movie 1 and Movie 2, see also Video S2). Conditions are indicated on the left-most column. Upon localized stimulation by natural movie-patches, activity emerged from baseline level with variable delays among conditions. The cortical dynamics induced by the individual stimuli show different temporal profiles and suggest that instantaneous activity levels were determined by the specific properties of each natural movie. Each single movie evoked well-separated spots of activity on the cortical surface indicating that the thalamic input was spatially well resolved at this stimulus configuration. Furthermore, we observed large differences in the activity levels between single and coherent conditions (note the difference in color scale). In this experiment, while the peak activity (maximum activity level across all pixels) during single conditions (bottom two rows in each panel) was 6.67×10^-4 ΔF/F, coherent conditions (top row in each panel) led to a value of 7.86×10^-4 ΔF/F, corresponding to an increase of 18%. As the direct input to the recorded cortical region was identical during single and coherent conditions, we attribute these differences in activity to the impact of long-range interactions on cortical dynamics under natural stimulation conditions.

Figure 2. Dynamics of cortical activity evoked by locally presented natural stimuli.

(A) Spatiotemporal activity patterns produced by locally presented natural movies is shown overlaid on the vascular image. Leftmost column schematically shows the corresponding stimulus condition and the movie used (orange and green). Each single frame represents the average activity across trials (n = 35) within 50 ms of non-overlapping segments of recorded data. The cortical representations of these stimuli were located along the antero-posterior axis, reflecting the vertical positioning of the movie-patches in the visual field. Different spatial extents and peak values of cortical activation were observed along the presentation duration at different frames. Please note that different color-scales are used for different conditions in order to emphasize the detailed spatial structure. Color scale represents ΔF/F (x10^-4). Color scale maximum is set to 3.3 standard deviations of activity levels. Only the most active pixels corresponding to highest 25^th percentile are shown. For a video presentation please refer to the Video S2. Scale bar (shown in the first row in the frame zero) = 1 mm.

To quantify long-range interactions we computed spatial profiles of activity levels under stimulation conditions with or without context (Figure 3, first and third rows). These maps represent the average activity of each single pixel across stimulation duration and demonstrate clearly the spatially restricted non-overlapping foci of activity. Note that different movies produced different amplitudes of activity. Thus, occasionally, incoherent conditions, incorporating a movie that drives the cortex more strongly, can appear more highly activated than in coherent conditions. Therefore, cross-wise comparisons (as in Figure 5B) between both incoherent conditions are needed to calculate the net interaction effects between coherent and incoherent movies.

Figure 3. Observed and modeled spatial profiles of activity patterns.

Spatial activity profiles (first and third rows) in response to two different natural movies (upper and lower panels) presented at various stimulation conditions with (third and fourth columns) or without (first and second columns) contextual information. Icons schematically represent the stimulation conditions. These are computed by time-averaging the data presented in Figure 2. Activity during no-context conditions was used to define a pair of interest regions per movie. This was done by selecting the pixels with highest activity located within the top 5^th percentile (see contour lines). Within these regions of interest we derived direct (cyan), indirect (yellow) and coherent (black) and incoherent (magenta) activity levels. The observed activity profiles were parametrically fitted using a composite function involving two 2-D Gaussians (second and fourth rows). The goodness of fit values characterized by the correlation coefficient is shown for the whole data set in Figure 4D. Color scale represents ΔF/F (x10^-4).

The precise shape of spatial activity profiles shown in Figure 3 varied considerably across different experiments. This is to be expected, as the location and extent of activity spots depend strongly on the recording conditions specific for each experimental session. It was therefore not straightforward to compare these spatial activity profiles across different experiments. We used a parametric approach in order to circumvent this problem and modeled spatial activity profiles recorded during different experiments using two-dimensional Gaussian functions with 6 free parameters. These parameters consisted of peak value (A), its horizontal and vertical position (μ_x and μ_y), horizontal and vertical spread (σ_x and σ_y) as well as a rotational parameter (ρ) (see Materials and methods). The modeled spatial activity profiles are presented together with the empirical data in Figure 3 (second and fourth rows, same colorbar). The correlation coefficient between these fits and the empirical data was on average 0.88 [0.82, 0.91] (average, [95% bootstrap confidence intervals], same convention in the following). For the whole data set, the distribution of correlation coefficients (Figure 4D) was negatively skewed and equal to 0.82 [0.78, 0.86] on average. Hence, compared to many thousands of pixels typically recorded in optical imaging, our parametric approach provided a major reduction in dimensionality without compromising the precise characterization of response patterns.

Figure 4. Distant natural stimulus exerts facilitatory effects on local activity levels.

(A) Modeled activity profiles averaged across all experiments (n = 4) and movies in response to indirect, direct, coherent and incoherent stimulation types are depicted (first row, three dimensional depiction; second row, top view). Transparent orange plane (third and fourth columns) marks the peak level of the Gaussian function representing the responses to direct stimulation at the absence of contextual information (second column). We compared different parameters of the Gaussian fits across different stimulation types in order to characterize different contextual interactions (see red, green, blue arrows). (B–D) For each of the comparisons, peak (B) and joint spread values (C,D) of Gaussian fits are presented as scatter plots (see matching arrow and dot colors). Each experiment contributes 4 different points. The plus sign represents the mean. (D) The distribution of correlation coefficient between fits and the observed data for the whole dataset.

We computed 4 types of characteristic spatial activity profiles from 3 different stimulation conditions (Figure 4A, first row, three-dimensional depiction; second row, top view representation). From activity during single conditions we derived the characteristic activity profiles for direct (cyan border) and indirect (yellow border) stimulation types. Whereas the direct activity represents the baseline responses to a single movie-patch in the absence of any contextual stimuli, the indirect activity captures the influence of an isolated distant movie-patch. Similarly we computed the characteristic activity patterns in response to movie-patches presented either in coherent (dark gray border) or incoherent conditions (magenta border). In Figure 4A, we visualize these four characteristic activity profiles after normalizing separately peak and spread parameters by their corresponding values obtained during direct stimulation. This was done for each experiment separately and the median fitted values were computed subsequently (this was necessary in order to eliminate outliers that originated from the normalization procedure).

We observed major changes in the characteristic activity profiles that were reflected in peak and spread parameters (compare different columns in Figure 4A). Concerning the peak activity, the indirect effect (Figure 4A, first column, yellow borders) of a single movie-patch presented at a distant location was on average slightly excitatory; however, this was statistically not significant (sign-test = 0.8). However, it should be noted that we observed net-excitatory effects as frequently as suppressive effects with similar amplitudes at the distant non-stimulated locations. The occasional occurrence of suppression of net activity below baseline levels in the far periphery have been shown with VSDI when presenting local stimuli without contextual surround (see Figure 1 in³³). During the two conditions where context was present (Figure 4A, third and fourth columns, magenta and dark gray borders), the peaks were higher than during stimulation without context (compare to second column, cyan border). Importantly, among those conditions where context was present, coherent context resulted in higher peak activity values (compare third and fourth columns).

We quantified the total facilitation effect by comparing the activity induced by direct and coherent stimulation (Figure 4A, see blue lines). This is depicted in (Figure 4B, blue dots) for each individual comparison (4 dots per experiments). In nearly all cases, the peak activity during coherent stimulation was higher than direct activity measured during single conditions. In 2 cases, a positive activation was observed only during coherent conditions. While direct drive evoked an average peak activity of 0.23×10^-3 ΔF/F [0.16×10^-3, 0.29×10^-3 ΔF/F] (average, [95% bootstrap confidence intervals], same convention in the following) across experiments, a value of 0.34×10^-3 ΔF/F [0.27×10^-3, 0.39×10^-3 ΔF/F] was observed during coherent input. The pairwise difference between peak values was equal to 0.11×10^-3 ΔF/F [0.04×10^-3, 0.14×10^-3 ΔF/F], corresponding to an increase of 45.2% [19.8%, 62.3%] in peak value and this was significantly different than zero (p = 0.0011, pairwise t-test). We therefore conclude that contextual stimuli presented at distant locations have a substantial modulatory effect on local activity. We further compared the peak values between direct and incoherent stimulation conditions (not shown as a scatter plot). The presence of an incoherent context resulted in an increase of 24.5%; this increase was, however, not significantly different from zero (sign-test, p = 0.8; t-test, p = 0.08).

To what extent can the total facilitatory effect be accounted for by the indirect additive effect of the distant movie-patch? We compared the activity during coherent stimulation conditions to the predicted activity by the sum of direct and indirect responses (Figure 4A, green lines). We found a superadditive effect of contextual stimulation in nearly all comparisons (Figure 4B, green dots). The superadditive facilitatory effect quantified as the difference between coherent conditions and the sum of single and indirect activations corresponded to 41.9% [20.9%, 76.2%] (signtest, p = 0.004; t-test, p = 0.009), hence only about 3.3% of the contextual effect was accounted for by linear interactions. This shows that long-range interactions result to a large extent from non-linear interactions between cortical sites.

To what extent are the non-linear contextual influences adapted to the statistical regularities of natural movies? The total facilitatory effect quantified above incorporates both the specific and unspecific influences originating from contextual stimulation. While the modulation of peak activity by an incoherent context can be attributed to the unspecific effect of the distant stimuli, any incremental effect of a coherent context can be attributed to the specific adaptation of these interactions to the statistics of natural movies. To evaluate the specificity of these interactions we compared the peak values between coherent and incoherent conditions (Figure 4A, red line; Figure 4B, red dots). We observed an increase of 20.7% [4.18%, 38.12%] in the peak activity level, and this was found to be marginally significant (sign-test, p = 0.21; t-test, p = 0.053). However, compared to the 45.2% observed for total facilitation, this analysis shows that about 54.2% of the facilitation results from specific interactions. Therefore, the non-linear facilitatory effects were fully effective only when the two movie-patches complied with the statistical regularities specific to natural movies.

Effect of contextual stimulation on spatial extent of activity

It is possible that long-range facilitation by the contextual sources of information are accompanied by a modification of the total spatial extent of cortical activity. For example, the presence of contextual information could result in a more tuned spatial activity profile leading to a decrease in spread parameters with the presence of context. Alternatively, contextual information could potentially cause a larger number of cortical neurons to be allocated. In order to test these different hypotheses, we evaluated the influence of context on the spatial extent of activated cortical space and compared the average spread parameters (σ_x and σ_y) between single and coherent conditions. The cortical activation extended larger surfaces during conditions of stimulation where context was present (Figure 4C). We found that the presence of a coherent context increased 10.9% [1.4, 29.2%] the joint spread parameter (Figure 4C, left panel). Considering each dimension separately we found that contextual information increased the spread parameter only along the direction of the context (along anterio-posterior axis). Consequently while an increase of 18.1% [5.3, 39.4%] in σ_y was observed, there was no significant change in σ_x [-1.9%, 24.0%]. This effect was further boosted by the presence of a coherent context. Comparing incoherent and coherent activations (Figure 4C, right panel), we found a similar result. Here again only the σ_y parameter was significantly different and an increase of 8.8% [0.5, 19.9%] was observed. Therefore, as in the case of peak activity modulations, coherent context had a stronger impact on the spread parameter compared to the case where the contextual information was absent or incoherent. We conclude that rather than a sharpening of the spatial profile, more cortical space is allocated when contextual information is present. Furthermore, the direction of this increase is biased towards the location of the contextual information.

Effect of contextual stimulation on time-course of activity levels

In order to have a better grasp on the temporal unfolding of long-range interactions, we next characterized the time-course of the facilitatory effects. To this end we used the evoked activity values and limited the analysis to pixels that were most strongly driven by the movie-patches. Based on the activity profiles during two single conditions (Figure 3, first and second rows), we defined two non-overlapping regions of interest for each movie condition by choosing those pixels that lay within the highest 5^th percentile of activity (see contour lines). These most responsive pixels were typically located centrally with respect to the activity spot. For each of the afore-mentioned activation types (indirect, direct, incoherent and coherent) we computed the mean time-course of activity across all movies and experiments within these most strongly driven pixels (Figure 5A, same colors as in Figure 4A and Figure 3). Please note that here experiments were conducted using different natural movies leading to the loss of a specific temporal profile. Samples of the time course of the facilitatory effect with mean activity significantly different than zero are depicted with filled circles (t-test).

Figure 5. Time-course of contextual interactions.

(A) The temporal evolution of activity in response to indirect (yellow), direct (cyan), coherent (black) and incoherent (magenta) stimulation (left panel). Average activity within the regions of interest (depicted in Figure 3) is used. For each temporal window the median across experiments and movies is presented. Median values that are statistically different (sign-rank test, alpha = 0.05) from zero are depicted with filled circles. The pair-wise difference of amplitudes between coherent and single conditions (black line) as well as between coherent and incoherent (gray line) conditions is shown in the right panel. (B) The similarity between the time-courses of activity recorded at two distant regions of interest is evaluated for each movie separately under different conditions (2 dots per experiment). r_coherent measures the correlation between two time-courses of activity evoked by the simultaneous presentation of two local movie-patches. r_incoherent and r_single were computed using the activity evoked by exactly the same input however presented at different conditions (see icons and arrows). The scatter plot depicts the comparison of r_coherent to r_single and r_incoherent.

As noted before, the indirect influence of the distant single movie-patch was slightly excitatory (Figure 5A, yellow line). However, contrary to the previous parametric analysis, which was not temporally resolved, we detected here a significant effect of the indirect input at ~100 ms (p = 0.04, see filled circle). This confirms that the indirect influence of a movie-patch presented in isolation to its neighboring regions is of excitatory nature and occurs quickly.

During direct stimulation in the absence of context (Figure 5A, cyan line), activity increased with stimulus onset and quickly reached a plateau at 100 ms, exactly where the indirect drive reached the significance level. At this point, the activity was 3.7-times stronger than the indirect drive. All samples following stimulus onset were statistically different from zero (p < 0.002). As expected, with the presence of a coherent context, the facilitatory interactions caused stronger activity levels throughout stimulus presentation. These were effective as early as 100 ms following stimulus onset (Figure 5A, left panel, cyan). We computed the pair-wise differences between coherent and single conditions and evaluated whether these deviated significantly from zero level (Figure 5A, right panel, black line). These facilitatory effects quickly followed after stimulus onset and reached significance around 300 ms (p < 0.049). The presence of an incoherent context had a smaller impact on activity levels (left panel, magenta line) and consequently the time-course of activity was similar to conditions where no context was present. The difference between coherent and incoherent conditions (right panel, gray line) computed over the stimulus presentation was positive throughout the stimulus presentation and reached the significance levels at two time frames (p < 0.042, see filled circles). The time-resolved analysis presented here complements the parameteric approach. We conclude that the interactions between different cortical locations occur quickly following stimulus onset and they persist across the stimulus presentation.

During single conditions, the activity within the region of interest is mainly determined by direct sub-cortical input and, therefore, the bottom-up characteristics of the input stream are presumably the sole determinants of the precise time-course. Additionally, as natural movies contain non-zero correlations across long distances, it is expected that activity profiles at locations A and B exhibit certain amount of similarity that would lead into correlations in the activity time-courses. We quantified these similarities at locations A and B during two single conditions by measuring the correlation coefficient (Figure 5B, schematic representation). The correlations, r_single, between the time-courses of activity recorded in both locations were never negative. Temporal resolution of the time-courses in this analysis was 200 Hz in order to capture its detailed structure. We observed an average correlation of r_single = 0.57 [0.47, 0.67], suggesting that low-level characteristics of movies were to a large extent common to both locations. How do the lateral interactions, which are effective during simultaneous presentation of two movie-patches, influence the precise time-course of activity? To answer this question, we computed r_coherent by quantifying the correlation between activities evoked by two simultaneously presented movie patches (Figure 5B, see arrows). All correlation values were higher than corresponding r_single values (Figure 5, red dots). r_coherent was equal to 0.84 [0.74, 0.89], resulting in an increase of 46.9%. This result suggests that long-range interactions increase the similarity of the activity time-course. In accord with this conclusion, we observed that an incoherent movie-patch presented simultaneously had a detrimental effect on the correlation values. Consequently, r_incoherent was 30% smaller than r_coherent and equal to 0.58, ([0.38, 0.74], Figure 5B, black dots). This result suggests that long-range interactions, in addition to their facilitatory effects, lead to an increase in the similarity of the time-course.

Discussion

We used VSDI to investigate long-range cortical interactions during processing of natural images in the primary visual cortex at the mesoscopic population level. By using "keyhole-like" presentations of the original natural movies through either one or two distant Gaussian masks, we quantified the effect of surrounding stimulation on local activity. We provide evidence that contextual integrative mechanisms are indeed operative under natural stimulus conditions. We show that under these conditions the horizontal cortical network^34–37 forms the basis for synergistic interactions across several millimeters of cortex. Contextual stimulation led to a net facilitatory effect compared to the case when the movies were shown in isolation. An important attribute of these interactions was their sensitivity to the intrinsic spatiotemporal regularities of natural movies^38,39. Moreover these contextual interactions led to an increased similarity of the population dynamics across long-range cortical distances.

Contextual processing has been investigated extensively both experimentally at the single neuronal level⁴⁰ and in recent modeling approaches⁴¹. A large variety of facilitatory and/or inhibitory contextual effects have been observed, however the final outcome crucially depends on the precise configuration of the parametrized stimulus used to stimulate center and surround regions. While the surround effect was found to be mainly inhibitory^35,42,43 and spatially asymmetrically organized⁴⁴, the precise nature of the effect depends on the contrast of contextual stimuli relative to the contrast threshold of the recorded neuron^45–47.

An important cornerstone of long-range facilitation is its dependence on the precise spatial configuration of the surrounding context⁴⁸. It has been shown that facilitatory effects increase proportionally with the congruency of the contextual stimuli with respect to the center stimulus^18,49. Using static stimuli, such coherence is generally controlled parametrically by changing the orientation difference between center and surround patches^17,18,50,51. Since we here used natural movies recorded by cats that freely explored a natural habitat, our stimuli were complex and contained simultaneous multiple features. The head and body movements of the cats added to this complexity as the recorded visual stimuli contained motion cues that were correlated across large visual distances. In order to control the coherency of the stimuli between the apertures, we adopted a non-parametric method by exploiting the unique spatiotemporal characteristics of each original movie.

With surround gratings-dependent on their relative orientation, distance, and contrast-both facilitatory^{17,18,42,45,48,49,52,53} and suppressive neuronal effects^{17,18,42,45,50,52,54–57} on a center stimulus have been described in the literature. In contrast, using natural stimuli in our study, we exclusively observed facilitation. Interestingly in this regard, when using randomly placed oriented lines as contextual stimuli, Kapadia and colleagues (1995) found that “inhibition could be eliminated by changing the orientation of a few of these elements”. This led them to suggest that with the “appropriate configuration of contours surrounding the RF, the cell is lifted from a rather profound level of inhibition, and its excitatory inputs are unmasked”. In this sense, the specific regularities in our naturalistic movies could be viewed as “appropriate configuration” and were shown here to trigger facilitatory interactions across long-range cortical distances. Whether these super-additive interactions result from disinhibition⁴⁹ or from additional excitatory drive through “cross-orientation” mechanisms remains to be explored.

When the same movie was presented through both apertures they were perceptually grouped without effort, and appeared to belong to a single scenery. On the other hand, when two differing movies were used, the content within both apertures appeared to be immediately incompatible (please see Video S1). There are a number of factors that determine coherence between patches taken from the same movie. First, stimulus motion was similar between the two distant apertures. This was due to the body- and head-motions of the recording cat, which induced large and equal motion fields across the visual scene captured by the camera. It has been earlier noted that such temporal phase relationships across distant regions are perceptually salient and enable object segmentation even in the absence of any spatial information⁵⁸. Second, natural images tend to possess large spatial correlations because of the dominance of low spatial frequencies in their spectrum²¹. Moreover auto-correlations of orientations may cover large portions of visual field reaching up to 8 degrees⁵⁹. Therefore, our stimulus paradigm can be conceived as a dynamic illustration of Gestalt criteria of good continuation.

There are different idealized mechanisms, each based on different anatomical substrates, which could mediate the observed facilitatory long-range interactions. Overlapping feed-forward thalamo-cortical input could be an explanation for increased cortical drive during stimulation with adjacent movie-patches. However, there are a number of counter-arguments against this explanation. First, cortical locations driven most strongly by the individual movies were separated by distances larger than the anatomical spread of direct thalamo-cortical projections^5,15,60,61. This was in accord with relatively smaller spatial extents of mapped receptive fields. Second, the activity at the distant location during stimulation with one single movie-patch was only minimal and reached significant levels about 50 ms later in comparison to directly stimulated locations. Third, and most decisively, the total drive to the recorded cortical area was constant across the two coherent and incoherent conditions. Only the order of the presentation being different, it is not possible to account for facilitatory interactions in a purely feed-forward scheme.

Rather, the dense network of horizontal connections linking distant neurons across several millimeters is a likely candidate for the observed long-range effects. It has been shown that unmyelinated intralaminar connections contribute to subthreshold responses evoked from distant stimuli placed outside of the classical receptive fields both with intracellular⁵ and combined extracellular recordings and VSDI in cat⁷. Furthermore, the selective intracortical connectivity pattern of these tangential connections linking neurons with similar feature selectivity is well-suited to mediate the specific enhancement of activity levels dependent on the stimulus coherence. However, we cannot exclude that feedback signals originating from higher visual areas with larger receptive field sizes than in primary visual cortex could add to these interactions^62,63. Back-propagating waves of activity have been shown to be initiated in further downstream cortical areas as early as ~100 ms after stimulus onset^64,65. Thus, these connections act fast^66,67 and are likely to mediate surround modulations spanning considerable distances in visual space, while lateral intra-laminar connectivity may account for modulations within shorter distances⁶⁸.

The relationship between the VSD imaging signal and spiking activity can indeed be complex. For instance, a close relationship between spike rate and the derivative of the VSD response rather than its magnitude has been proposed⁶⁹. This might especially apply to the rising phase of the membrane potential after stimulus onset^6,70. Combined VSD and Calcium-sensitive dye imaging suggests that the relationship between spiking activity and the amplitude of the VSD response depends also on stimulus intensity⁷¹. However, Chen et al.⁷² found that these relationships are well-captured by a power function with an exponent of ~4 (similarly as for the relationship between average membrane potential and spike rates in single V1 neurons^73,74), indicating that the threshold for observing significant spiking activity is about 30–40% of the maximal VSD response. Amplitudes evoked by our natural movies were on average well-within this range as compared to full-field moving gratings that we used as a standard control for maximal stimulation. Moreover, only pixels fitted at the center of the movie representations with highest amplitude entered our analysis, furthermore ensuring that subthreshold activity was excluded. Thus, we are confident that we describe the effect of long-range connectivity on postsynaptic spiking activity.

We observed that, compared to incoherent stimulation, stimulation with coherent pairs of natural stimulus patches led to stronger facilitatory effects. Since the total input analyzed across coherent and incoherent stimulation was identical across the recorded cortical area, these results cannot be solely explained by the local properties of movie-patches. Rather, this facilitation necessarily reflects the outcome of an integrative phenomenon sensitive to the content of both local movie-patches when presented simultaneously. Therefore, we suggest that the functional architecture of early visual cortical circuits may have empirically internalized the typical contextual relationships^21–23 found in dynamic natural visual scenes.

Author contributions

SO, DJ, PK: Conceived and designed the experiments, analyzed the data, wrote the manuscript, contributed materials and analysis tools. SO, PK: Stimulus recording and preparation. DJ: performed the experiments.

Competing interests

No competing interests were disclosed.

Grant information

This work has been financed by the Bundesministerium für Bildung und Forschung (BMBF), the Deutsche Forschungsgemeinschaft (DFG) SFB-874 (TP A2 Eysel, Jancke), the Bernstein Group for Computational Neuroscience Bochum, the German-Israeli Project Cooperation (JA 945/3-1, SL 185/1-1), the International Graduate School of Neuroscience (IGSN) Ruhr-University Bochum, Germany and the EU commission (FP7-ICT-270212, eSMCs).

Acknowledgements

We thank Markus Swierczek for contribution to data acquisition; Stefan Dobers and the mechanical shop of the Ruhr-University Bochum for excellent technical support; Cliodhna Quigley and Benedict Ng for comments on the manuscript; Agnieszka Grabska and Alper Açık for helpful discussions; Jörg Conradt, Gudrun Möller, Rodrigo Salazar for their valuable help in natural movie recording.

Faculty Opinions recommended

References

1. Callaway EM: Local circuits in primary visual cortex of the macaque monkey. Annu Rev Neurosci. 1998; 21: 47–74. PubMed Abstract | Publisher Full Text
2. Tucker TR, Katz LC: Spatiotemporal patterns of excitation and inhibition evoked by the horizontal network in layer ⅔ of ferret visual cortex. J Neurophysiol. 2003; 89(1): 488–500. PubMed Abstract | Publisher Full Text
3. Douglas RJ, Martin KA: Recurrent neuronal circuits in the neocortex. Curr Biol. 2007; 17(13): R496–500. PubMed Abstract | Publisher Full Text
4. Gilbert CD, Wiesel TN: Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex. J Neurosci. 1989; 9(7): 2432–2442. PubMed Abstract
5. Bringuier V, Chavane F, Glaeser L, et al.: Horizontal propagation of visual activity in the synaptic integration field of area 17 neurons. Science. 1999; 283(5402): 695–699. PubMed Abstract | Publisher Full Text
6. Jancke D, Chavane F, Naaman S, et al.: Imaging cortical correlates of illusion in early visual cortex. Nature. 2004; 428(6981): 423–426. PubMed Abstract | Publisher Full Text
7. Jancke D, Erlhagen W, Schoner G, et al.: Shorter latencies for motion trajectories than for flashes in population responses of cat primary visual cortex. J Physiol. 2004; 556(Pt 3): 971–982. PubMed Abstract | Publisher Full Text | Free Full Text
8. Hubel DH, Wiesel TN: Sequence regularity and geometry of orientation columns in the monkey striate cortex. J Comp Neurol. 1974; 158(3): 267–293. PubMed Abstract | Publisher Full Text
9. Blasdel GG, Salama G: Voltage-sensitive dyes reveal a modular organization in monkey striate cortex. Nature. 1986; 321(6070): 579–585. PubMed Abstract | Publisher Full Text
10. Bonhoeffer T, Grinvald A: Iso-orientation domains in cat visual cortex are arranged in pinwheel-like patterns. Nature. 1991; 353(6343): 429–431. PubMed Abstract | Publisher Full Text
11. Benucci A, Frazor RA, Carandini M: Standing waves and traveling waves distinguish two circuits in visual cortex. Neuron. 2007; 55(1): 103–117. PubMed Abstract | Publisher Full Text | Free Full Text
12. Mante V, Carandini M: Mapping of stimulus energy in primary visual cortex. J Neurophysiol. 2005; 94(1): 788–798. PubMed Abstract | Publisher Full Text
13. Onat S, Nortmann N, Rekauzke S, et al.: Independent encoding of grating motion across stationary feature maps in primary visual cortex visualized with voltage-sensitive dye imaging. Neuroimage. 2011; 55(4): 1763–1770. PubMed Abstract | Publisher Full Text
14. Bosking WH, Zhang Y, Schofield B, et al.: Orientation selectivity and the arrangement of horizontal connections in tree shrew striate cortex. J Neurosci. 1997; 17(6): 2112–2127. PubMed Abstract
15. Chavane F, Jancke D, Marre O, et al.: Lateral spread of orientation selectivity in V1 is controlled by intracortical cooperativity. Front Syst Neurosci. 2011; 5: 4. PubMed Abstract | Publisher Full Text | Free Full Text
16. Allman J, Miezin F, McGuinness E: Stimulus specific responses from beyond the classical receptive field: neurophysiological mechanisms for local-global comparisons in visual neurons. Annu Rev Neurosci. 1985; 8: 407–430. PubMed Abstract | Publisher Full Text
17. Sillito AM, Grieve KL, Jones HE, et al.: Visual cortical mechanisms detecting focal orientation discontinuities. Nature. 1995; 378(6556): 492–496. PubMed Abstract | Publisher Full Text
18. Polat U, Mizobe K, Pettet MW, et al.: Collinear stimuli regulate visual responses depending on cell’s contrast threshold. Nature. 1998; 391(6667): 580–584. PubMed Abstract | Publisher Full Text
19. Series P, Lorenceau J, Fregnac Y: The "silent" surround of V1 receptive fields: theory and experiments. J Physiol Paris. 2003; 97(4–6): 453–474. PubMed Abstract | Publisher Full Text
20. Sasaki Y: Processing local signals into global patterns. Curr Opin Neurobiol. 2007; 17(2): 132–139. PubMed Abstract | Publisher Full Text
21. Simoncelli EP, Olshausen BA: Natural image statistics and neural representation. Annu Rev Neurosci. 2001; 24: 1193–1216. PubMed Abstract | Publisher Full Text
22. Olshausen BA, Field DJ: How close are we to understanding v1? Neural Comput. 2005; 17(8): 1665–1699. PubMed Abstract | Publisher Full Text
23. Howe CQ, Beau Lotto R, Purves D: Comparison of Bayesian and empirical ranking approaches to visual perception. J Theor Biol. 2006; 241(4): 866–875. PubMed Abstract | Publisher Full Text
24. Vinje WE, Gallant JL: Sparse coding and decorrelation in primary visual cortex during natural vision. Science. 2000; 287(5456): 1273–1276. Publisher Full Text
25. Touryan J, Felsen G, Dan Y: Spatial structure of complex cell receptive fields measured with natural images. Neuron. 2005; 45(5): 781–791. PubMed Abstract | Publisher Full Text
26. Onat S, König P, Jancke D: Natural Scene Evoked Population Dynamics across Cat Primary Visual Cortex Captured with Voltage-Sensitive Dye Imaging. Cereb Cortex. 2011; 21(11): 2542–2554. PubMed Abstract | Publisher Full Text
27. Felsen G, Touryan J, Han F, et al.: Cortical sensitivity to visual features in natural scenes. PLoS Biol. 2005; 3(10): e342. PubMed Abstract | Publisher Full Text | Free Full Text
28. Vinje WE, Gallant JL: Natural stimulation of the nonclassical receptive field increases information transmission efficiency in V1. J Neurosci. 2002; 22(7): 2904–2915. PubMed Abstract
29. Haider B, Krause MR, Duque A, et al.: Synaptic and network mechanisms of sparse and reliable visual cortical activity during nonclassical receptive field stimulation. Neuron. 2010; 65(1): 107–121. PubMed Abstract | Publisher Full Text | Free Full Text
30. Brenner N, Bialek W, de Ruyter van Steveninck R: Adaptive rescaling maximizes information transmission. Neuron. 2000; 26(3): 695–702. PubMed Abstract | Publisher Full Text
31. Grinvald A, Lieke EE, Frostig RD, et al.: Cortical point-spread function and long-range lateral interactions revealed by real-time optical imaging of macaque monkey primary visual cortex. J Neurosci. 1994; 14(5 Pt 1): 2545–2568. PubMed Abstract
32. Ratzlaff EH, Grinvald A: A tandem-lens epifluorescence macroscope: hundred-fold brightness advantage for wide-field imaging. J Neurosci Methods. 1991; 36(2–3): 127–137. PubMed Abstract
33. Sharon D, Jancke D, Chavane F, et al.: Cortical response field dynamics in cat visual cortex. Cereb Cortex. 2007; 17(12): 2866–2877. PubMed Abstract | Publisher Full Text
34. Kisvarday ZF, Kim DS, Eysel UT, et al.: Relationship between lateral inhibitory connections and the topography of the orientation map in cat visual cortex. Eur J Neurosci. 1994; 6(10): 1619–1632. PubMed Abstract
35. Hubel DH, Wiesel TN: Receptive fields and functional architecture in two non-striate visual areas (18 and 19) of the cat. J Neurophysiol. 1965; 28: 229–289. PubMed Abstract
36. Rockland KS, Lund JS: Widespread periodic intrinsic connections in the tree shrew visual cortex. Science. 1982; 215(4539): 1532–1534. PubMed Abstract | Publisher Full Text
37. Gilbert CD, Wiesel TN: Clustered intrinsic connections in cat visual cortex. J Neurosci. 1983; 3(5): 1116–1133. PubMed Abstract
38. Kayser C, König P: Stimulus locking and feature selectivity prevail in complementary frequency ranges of V1 local field potentials. Eur J Neurosci. 2004; 19(2): 485–489. PubMed Abstract | Publisher Full Text
39. Belitski A, Gretton A, Magri C, et al.: Low-frequency local field potentials and spikes in primary visual cortex convey independent visual information. J Neurosci. 2008; 28(22): 5696–5709. PubMed Abstract | Publisher Full Text
40. Albright TD, Stoner GR: Contextual influences on visual processing. Annu Rev Neurosci. 2002; 25: 339–379. PubMed Abstract | Publisher Full Text
41. Coen-Cagli R, Dayan P, Schwartz O: Cortical Surround Interactions and Perceptual Salience via Natural Scene Statistics. PLoS Comput Biol. 2012; 8(3): e1002405. PubMed Abstract | Publisher Full Text | Free Full Text
42. Maffei L, Fiorentini A: The unresponsive regions of visual cortical receptive fields. Vision Res. 1976; 16(10): 1131–1139. PubMed Abstract | Publisher Full Text
43. Jones HE, Grieve KL, Wang W, et al.: Surround suppression in primate V1. J Neurophysiol. 2001; 86(4): 2011–2028. PubMed Abstract
44. Walker GA, Ohzawa I, Freeman RD: Asymmetric suppression outside the classical receptive field of the visual cortex. J Neurosci. 1999; 19(23): 10536–10553. PubMed Abstract
45. Toth LJ, Rao SC, Kim DS, et al.: Subthreshold facilitation and suppression in primary visual cortex revealed by intrinsic signal imaging. Proc Natl Acad Sci U S A. 1996; 93(18): 9869–9874. PubMed Abstract | Publisher Full Text | Free Full Text
46. Sengpiel F, Sen A, Blakemore C: Characteristics of surround inhibition in cat area 17. Exp Brain Res. 1997; 116(2): 216–228. PubMed Abstract
47. Kapadia MK, Westheimer G, Gilbert CD: Dynamics of spatial summation in primary visual cortex of alert monkeys. Proc Natl Acad Sci U S A. 1999; 96(21): 12073–12078. PubMed Abstract | Publisher Full Text | Free Full Text
48. Nelson JI, Frost BJ: Intracortical facilitation among co-oriented, co-axially aligned simple cells in cat striate cortex. Exp Brain Res. 1985; 61(1): 54–61. PubMed Abstract
49. Kapadia MK, Ito M, Gilbert CD, et al.: Improvement in visual sensitivity by changes in local context: parallel studies in human observers and in V1 of alert monkeys. Neuron. 1995; 15(4): 843–856. PubMed Abstract | Publisher Full Text
50. Levitt JB, Lund JS: Contrast dependence of contextual effects in primate visual cortex. Nature. 1997; 387(6628): 73–76. PubMed Abstract | Publisher Full Text
51. Chisum HJ, Mooser F, Fitzpatrick D: Emergent properties of layer 2/3 neurons reflect the collinear arrangement of horizontal connections in tree shrew visual cortex. J Neurosci. 2003; 23(7): 2947–2960. PubMed Abstract
52. Stemmler M, Usher M, Niebur E: Lateral interactions in primary visual cortex: a model bridging physiology and psychophysics. Science. 1995; 269(5232): 1877–1880. PubMed Abstract | Publisher Full Text
53. Gilad A, Meirovithz E, Leshem A, et al.: Collinear stimuli induce local and cross-areal coherence in the visual cortex of behaving monkeys. PLoS One. 2012; 7(11): e49391. PubMed Abstract | Publisher Full Text | Free Full Text
54. Blakemore C, Tobin EA: Lateral inhibition between orientation detectors in the cat’s visual cortex. Exp Brain Res. 1972; 15(4): 439–440. PubMed Abstract
55. Knierim JJ, van Essen DC: Neuronal responses to static texture patterns in area V1 of the alert macaque monkey. J Neurophysiol. 1992; 67(4): 961–980. PubMed Abstract
56. Kitano M, Niiyama K, Kasamatsu T, et al.: Retinotopic and nonretinotopic field potentials in cat visual cortex. Vis Neurosci. 1994; 11(5): 953–977. PubMed Abstract
57. Li CY, Li W: Extensive integration field beyond the classical receptive field of cat’s striate cortical neurons--classification and tuning properties. Vision Res. 1994; 34(18): 2337–2355. PubMed Abstract | Publisher Full Text
58. Lee SH, Blake R: Visual form created solely from temporal structure. Science. 1999; 284(5417): 1165–1168. PubMed Abstract | Publisher Full Text
59. Kayser C, Einhäuser W, König P: Temporal correlations of orientations in natural scenes. Neurocomputing. 2003; 52–54: 117–123. Publisher Full Text
60. Humphrey AL, Sur M, Uhlrich DJ, et al.: Termination patterns of individual X- and Y-cell axons in the visual cortex of the cat: projections to area 18, to the 17/18 border region, and to both areas 17 and 18. J Comp Neurol. 1985; 233(2): 190–212. PubMed Abstract | Publisher Full Text
61. Chapman B, Zahs KR, Stryker MP: Relation of cortical cell orientation selectivity to alignment of receptive fields of the geniculocortical afferents that arborize within a single orientation column in ferret visual cortex. J Neurosci. 1991; 11(5): 1347–1358. PubMed Abstract
62. Ayzenshtat I, Meirovithz E, Edelman H, et al.: Precise Spatiotemporal Patterns among Visual Cortical Areas and Their Relation to Visual Stimulus Processing. J Neurosci. 2010; 30(33): 11232–11245. PubMed Abstract | Publisher Full Text
63. Li W, Piëch V, Gilbert CD: Learning to link visual contours. Neuron. 2008; 57(3): 442–451. PubMed Abstract | Publisher Full Text | Free Full Text
64. Roland PE, Hanazawa A, Undeman C, et al.: Cortical feedback depolarization waves: a mechanism of top-down influence on early visual areas. Proc Natl Acad Sci U S A. 2006; 103(33): 12586–12591. PubMed Abstract | Publisher Full Text | Free Full Text
65. Xu W, Huang X, Takagaki K, et al.: Compression and Reflection of Visually Evoked Cortical Waves. Neuron. 2007; 55(1): 119–129. PubMed Abstract | Publisher Full Text | Free Full Text
66. Lamme VA, Roelfsema PR: The distinct modes of vision offered by feedforward and recurrent processing. Trends Neurosci. 2000; 23(11): 571–579. PubMed Abstract | Publisher Full Text
67. Hupé JM, James AC, Girard P, et al.: Feedback connections act on the early part of the responses in monkey visual cortex. J Neurophysiol. 2001; 85(1): 134–145. PubMed Abstract
68. Angelucci A, Levitt JB, Walton EJS, et al.: Circuits for Local and Global Signal Integration in Primary Visual Cortex. J Neurosci. 2002; 22(19): 8633–8646. PubMed Abstract
69. Eriksson D, Tompa T, Roland PE: Non-linear population firing rates and voltage sensitive dye signals in visual areas 17 and 18 to short duration stimuli. PLoS One. 2008; 3(7): e2673. PubMed Abstract | Publisher Full Text | Free Full Text
70. Sit YF, Chen Y, Geisler WS, et al.: Complex dynamics of V1 population responses explained by a simple gain-control model. Neuron. 2009; 64(6): 943–956. PubMed Abstract | Publisher Full Text | Free Full Text
71. Berger T, Borgdorff A, Crochet S, et al.: Combined voltage and calcium epifluorescence imaging in vitro and in vivo reveals subthreshold and suprathreshold dynamics of mouse barrel cortex. J Neurophysiol. 2007; 97(5): 3751–3762. PubMed Abstract | Publisher Full Text
72. Chen Y, Palmer CR, Seidemann E: The relationship between voltage-sensitive dye imaging signals and spiking activity of neural populations in primate V1. J Neurophysiol. 2012; 107(12): 3281–3295. PubMed Abstract | Publisher Full Text | Free Full Text
73. Anderson JS, Lampl I, Gillespie DC, et al.: The contribution of noise to contrast invariance of orientation tuning in cat visual cortex. Science. 2000; 290(5498): 1968–1972. PubMed Abstract | Publisher Full Text
74. Finn IM, Priebe NJ, Ferster D: The emergence of contrast-invariant orientation tuning in simple cells of cat visual cortex. Neuron. 2007; 54(1): 137–152. PubMed Abstract | Publisher Full Text | Free Full Text

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 15 Feb 2013

Author details Author details

Competing interests

No competing interests were disclosed.

Grant information

This work has been financed by the Bundesministerium für Bildung und Forschung (BMBF), the Deutsche Forschungsgemeinschaft (DFG) SFB-874 (TP A2 Eysel, Jancke), the Bernstein Group for Computational Neuroscience Bochum, the German-Israeli Project Cooperation (JA 945/3-1, SL 185/1-1), the International Graduate
School of Neuroscience (IGSN) Ruhr-University Bochum, Germany and the EU commission (FP7-ICT-270212, eSMCs).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Published: 02 Aug 2013, 2:51

https://doi.org/10.12688/f1000research.2-51.v2

version 1

Published: 15 Feb 2013, 2:51

https://doi.org/10.12688/f1000research.2-51.v1

© 2013 Onat S et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Onat S, Jancke D and König P. Cortical long-range interactions embed statistical knowledge of natural sensory input: a voltage-sensitive dye imaging study [version 2; peer review: 2 approved] F1000Research 2013, 2:51 (https://doi.org/10.12688/f1000research.2-51.v2)

NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 2

VERSION 2

PUBLISHED 02 Aug 2013

Views

Reviewer Report 04 Mar 2014

Diego Contreras, Department of Neuroscience, University of Pennsylvania School of Medicine, Philadelphia, PA, USA

Approved

https://doi.org/10.5256/f1000research.1959.r3974

I confirm that I have read this submission and believe that I have an ... Continue reading

CITE

Report a concern

Respond or Comment

Views

Reviewer Report 23 Jan 2014

Marcello Rosa, Faculty of Medicine, Monash University, Melbourne, Victoria, Australia

Approved

https://doi.org/10.5256/f1000research.1959.r3280

I confirm that I have read this submission and believe that I have an ... Continue reading

CITE

Report a concern

Respond or Comment

Version 1

VERSION 1

PUBLISHED 15 Feb 2013

Views

Reviewer Report 06 Mar 2013

Diego Contreras, Department of Neuroscience, University of Pennsylvania School of Medicine, Philadelphia, PA, USA

Approved

https://doi.org/10.5256/f1000research.1179.r811

What is the point of using “ecological” stimuli if the response is then reduced to 6 parameters? How is that different from using drifting gratings? The authors should either provide the comparison with simpler stimuli or better justify their choices.

What is the point of using “ecological” stimuli if the response is then reduced to 6 parameters? How is that different from using drifting gratings? The authors should either provide the comparison with simpler stimuli or better justify their choices. The main point here is that the larger summation of two stimuli from the same movie versus two stimuli from different movies may be simply due to objects that span the two masks generating a similarity in the orientation and spatial frequency in the two stimuli, which, in turn, would lead to very well-known orientation-specific effects of contextual stimuli.
How is presenting another stimulus “contextual”? the stimuli where either one or two stimuli of identical size. It would be easier to read if the authors used only one terminology, such as single, coherent and incoherent.
How was a 2D Gaussian fit to the response of one of the stimulus while presenting both? The response is merged, doesn't this affect the fitting procedure?
What was learned from the multi-unit recordings?
Figure 2: It is pretty but mostly useless without quantification, the authors should show values of dF/F or contour plots of the significant response (3 or 5 SDs above baseline noise) so that the movies can be properly compared.
It is remarkable that the responses to the same mask in movie 1 and 2 are so different, are they really statistically different? Is the VSD signal capable of discriminating the differences between the two movies? Looking at the movies it seems that they have similar spectrotemporal composition.
Where were the values presented on top of page 5 taken from? What area? What time?
Figure 4:What is “modelized”? Does figure 4A not represent data?

Minor Points:
Page 4, 2nd column: Dissonance?
What is “superadditive”? Does it mean supralinear? Very difficult to understand, mainly because the term is used in combinations such as “superadditive facilitation”

Competing Interests: No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 09 Aug 2013

Selim Onat, University Medical Center Hamburg-Eppendorf, Germany

09 Aug 2013

Author Response
Point by point response to Dr Contreras:
1. These are important questions (please also see our response to question 3 from Dr Rosa below). Clearly, there exist a remarkable number of studies
... Continue reading
Point by point response to Dr Contreras:
These are important questions (please also see our response to question 3 from Dr Rosa below). Clearly, there exist a remarkable number of studies (references in main text) about contextual effects using local grating patches of certain orientations and spatial frequencies.

In our work we follow instead the hypothesis that the low-level sensory regions (if not the whole nervous system) are wired to optimally process inputs that are most likely to be received from the sensory organs. Our recent report ²⁶suggests that the cortical state (i.e. both the dynamic range and mean activity) is qualitatively different when stimulated with gratings as compared to ecological stimuli. Thus, response behavior to natural input can deviate significantly from predictions based on simple parameterized stimuli, probably due to the extensive spatial information in natural images ^{22, 29}_.
These observations may place the burden of justification on the use of simple stimuli, and not of natural stimuli as used in the present report. Many intra-areal long-range facilitation phenomena in the early visual system have been shown to occur in response to local presentations of simple stimuli. The evidence showing that these are also functional under more natural stimulation conditions that are more complex and also more ecological is not clear. For instance, both the extent and the impact of long-range connections may critically depend on the correlation structure across neuronal populations (Lindén et al., 2011), which in turn emerges from the specific spatio-temporal correlations of visual input. Therefore, in this manuscript, we aimed to demonstrate that long-range interactions are also functional and effective under more naturalistic stimulation conditions, which were so far not investigated. We provide this evidence in this manuscript and we have added a paragraph to the discussion section (paragraph 4) of our manuscript.

Finally, the choice of a low number of parameters to characterize the response is data driven. We demonstrate that our analysis captures the larger part of variance of the response as obtained by voltage sensitive dye imaging (VSDI) and is therefore a valid procedure. A comparison of the dimensionality of stimulus and response space has to take into account the non-linearity of processing in the visual system.

Our use of “contextual” is inspired by usage of the terms classical receptive field and non-classical receptive field/context in the visual sciences: Here one visual stimulus elicits a reliable response at a certain location. The other stimulus does not elicit detectable activity at that location, but as our data demonstrate modulates the response to the former stimulus. The term “context” is intended to denote that dependence. That is, we consider this neighboring information as a contextual input as it doesn’t directly influence the processing at the distant center.

In all cases, including single stimulation conditions, there were two 2D Gaussians that were combined to fit the observed responses. The parameters were estimated simultaneously. Using two Gaussians in all stimulation conditions ensured that the activation at the distant locations did not influence the fit. Most of the time, the activity spots were clearly distant and well-isolated from each other, therefore possible cross-contributions were negligible. Furthermore the center position of each single Gaussian fit was constrained around the location of the peak responses at locations A and B. This helped avoiding complications where Gaussians could overfit noise. We now included this information in the manuscript (Methods, page 8, last paragraph).

Electrical recordings were performed to allow for rapid retinotopic hand mapping of the imaged area before VSDI. Electrical recordings parallel to VSDI were done to verify occurrence of evoked spiking activity at those cortical locations where high levels of the dye signal was observed, i.e. around the center representation of one of the local movie patches. Note that a single recording site was used. Thus, analysis of reciprocal interactions at two locations simultaneously, as resolved by VSDI, was not feasible. Analysis of the electrical recordings related to particular stimulus features will be subject of a following report.

The colorbar depicts x10^-4 dF/F values. This is now added to the legends of Figures 2 to 4. Pixels with an activity smaller than 1.5x10^-4 dF/F are not shown. This corresponds to the 75^th percentile of the activity distribution, as noted on the figure legend. Responses of each single movie are depicted with their own colormap so that the precise detailed structure of the evoked responses can be clearly seen for each single condition. Using a common color scale would basically eliminate the possibility to clearly see responses to movies that evoked weaker responses. Furthermore, the colorbar helps comparison of amplitudes across different movies and positions.

VSDI can clearly distinguish these differences. In Onat et al.¹³, we have shown that VSDI can distinguish diverse stimulus related components in response to drifting gratings including actual position, i.e. the retinotopic trajectory, of the grating stripes. In the current manuscript, in Figure 2, it is also possible to see that the same movie during double or single conditions leads to similar temporal activity profiles (as analyzed in Figure 5).

These are peak values computed across the entire stimulation duration and all recorded pixels.

Figure 4A (top row) presents the spatial activity profiles observed on the cortical surface that are reconstructed using the Gaussian parameters (mu, sigma, amp) averaged across ROIs and cats. Please notice that averaging the raw recorded data across cats and ROIs (A and B) is not possible. This is due to the large differences in the shape of the activity profiles across different ROIs and cats, therefore we reconstructed these activity profiles using the fitted parameters. Each individual data point that contributes to these spatial profiles are depicted in Figure 4B. We replaced the word “modelized” with “modeled”. You might view this analysis technique as a low dimensional parametric description of the experimental data.

Minor Points

We used the term dissonance to describe the perceptual quality of movie patches originating from different movies when shown simultaneously.

We used the term super-additive, to indicate a response that is more than the sum of the two individual components when these individual components are presented simultaneously. The meaning is essentially the same as supralinear, although in the literature super-additive is used more often.

Reference numbers refer to the reference list in the manuscript
Point by point response to Dr Contreras:
These are important questions (please also see our response to question 3 from Dr Rosa below). Clearly, there exist a remarkable number of studies (references in main text) about contextual effects using local grating patches of certain orientations and spatial frequencies.

In our work we follow instead the hypothesis that the low-level sensory regions (if not the whole nervous system) are wired to optimally process inputs that are most likely to be received from the sensory organs. Our recent report ²⁶suggests that the cortical state (i.e. both the dynamic range and mean activity) is qualitatively different when stimulated with gratings as compared to ecological stimuli. Thus, response behavior to natural input can deviate significantly from predictions based on simple parameterized stimuli, probably due to the extensive spatial information in natural images ^{22, 29}_.
These observations may place the burden of justification on the use of simple stimuli, and not of natural stimuli as used in the present report. Many intra-areal long-range facilitation phenomena in the early visual system have been shown to occur in response to local presentations of simple stimuli. The evidence showing that these are also functional under more natural stimulation conditions that are more complex and also more ecological is not clear. For instance, both the extent and the impact of long-range connections may critically depend on the correlation structure across neuronal populations (Lindén et al., 2011), which in turn emerges from the specific spatio-temporal correlations of visual input. Therefore, in this manuscript, we aimed to demonstrate that long-range interactions are also functional and effective under more naturalistic stimulation conditions, which were so far not investigated. We provide this evidence in this manuscript and we have added a paragraph to the discussion section (paragraph 4) of our manuscript.

Finally, the choice of a low number of parameters to characterize the response is data driven. We demonstrate that our analysis captures the larger part of variance of the response as obtained by voltage sensitive dye imaging (VSDI) and is therefore a valid procedure. A comparison of the dimensionality of stimulus and response space has to take into account the non-linearity of processing in the visual system.

Our use of “contextual” is inspired by usage of the terms classical receptive field and non-classical receptive field/context in the visual sciences: Here one visual stimulus elicits a reliable response at a certain location. The other stimulus does not elicit detectable activity at that location, but as our data demonstrate modulates the response to the former stimulus. The term “context” is intended to denote that dependence. That is, we consider this neighboring information as a contextual input as it doesn’t directly influence the processing at the distant center.

In all cases, including single stimulation conditions, there were two 2D Gaussians that were combined to fit the observed responses. The parameters were estimated simultaneously. Using two Gaussians in all stimulation conditions ensured that the activation at the distant locations did not influence the fit. Most of the time, the activity spots were clearly distant and well-isolated from each other, therefore possible cross-contributions were negligible. Furthermore the center position of each single Gaussian fit was constrained around the location of the peak responses at locations A and B. This helped avoiding complications where Gaussians could overfit noise. We now included this information in the manuscript (Methods, page 8, last paragraph).

Electrical recordings were performed to allow for rapid retinotopic hand mapping of the imaged area before VSDI. Electrical recordings parallel to VSDI were done to verify occurrence of evoked spiking activity at those cortical locations where high levels of the dye signal was observed, i.e. around the center representation of one of the local movie patches. Note that a single recording site was used. Thus, analysis of reciprocal interactions at two locations simultaneously, as resolved by VSDI, was not feasible. Analysis of the electrical recordings related to particular stimulus features will be subject of a following report.

The colorbar depicts x10^-4 dF/F values. This is now added to the legends of Figures 2 to 4. Pixels with an activity smaller than 1.5x10^-4 dF/F are not shown. This corresponds to the 75^th percentile of the activity distribution, as noted on the figure legend. Responses of each single movie are depicted with their own colormap so that the precise detailed structure of the evoked responses can be clearly seen for each single condition. Using a common color scale would basically eliminate the possibility to clearly see responses to movies that evoked weaker responses. Furthermore, the colorbar helps comparison of amplitudes across different movies and positions.

VSDI can clearly distinguish these differences. In Onat et al.¹³, we have shown that VSDI can distinguish diverse stimulus related components in response to drifting gratings including actual position, i.e. the retinotopic trajectory, of the grating stripes. In the current manuscript, in Figure 2, it is also possible to see that the same movie during double or single conditions leads to similar temporal activity profiles (as analyzed in Figure 5).

These are peak values computed across the entire stimulation duration and all recorded pixels.

Figure 4A (top row) presents the spatial activity profiles observed on the cortical surface that are reconstructed using the Gaussian parameters (mu, sigma, amp) averaged across ROIs and cats. Please notice that averaging the raw recorded data across cats and ROIs (A and B) is not possible. This is due to the large differences in the shape of the activity profiles across different ROIs and cats, therefore we reconstructed these activity profiles using the fitted parameters. Each individual data point that contributes to these spatial profiles are depicted in Figure 4B. We replaced the word “modelized” with “modeled”. You might view this analysis technique as a low dimensional parametric description of the experimental data.

Minor Points

We used the term dissonance to describe the perceptual quality of movie patches originating from different movies when shown simultaneously.

We used the term super-additive, to indicate a response that is more than the sum of the two individual components when these individual components are presented simultaneously. The meaning is essentially the same as supralinear, although in the literature super-additive is used more often.

Reference numbers refer to the reference list in the manuscript
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 09 Aug 2013

Selim Onat, University Medical Center Hamburg-Eppendorf, Germany

09 Aug 2013

Author Response
Point by point response to Dr Contreras:
1. These are important questions (please also see our response to question 3 from Dr Rosa below). Clearly, there exist a remarkable number of studies
... Continue reading
Point by point response to Dr Contreras:
These are important questions (please also see our response to question 3 from Dr Rosa below). Clearly, there exist a remarkable number of studies (references in main text) about contextual effects using local grating patches of certain orientations and spatial frequencies.

In our work we follow instead the hypothesis that the low-level sensory regions (if not the whole nervous system) are wired to optimally process inputs that are most likely to be received from the sensory organs. Our recent report ²⁶suggests that the cortical state (i.e. both the dynamic range and mean activity) is qualitatively different when stimulated with gratings as compared to ecological stimuli. Thus, response behavior to natural input can deviate significantly from predictions based on simple parameterized stimuli, probably due to the extensive spatial information in natural images ^{22, 29}_.
These observations may place the burden of justification on the use of simple stimuli, and not of natural stimuli as used in the present report. Many intra-areal long-range facilitation phenomena in the early visual system have been shown to occur in response to local presentations of simple stimuli. The evidence showing that these are also functional under more natural stimulation conditions that are more complex and also more ecological is not clear. For instance, both the extent and the impact of long-range connections may critically depend on the correlation structure across neuronal populations (Lindén et al., 2011), which in turn emerges from the specific spatio-temporal correlations of visual input. Therefore, in this manuscript, we aimed to demonstrate that long-range interactions are also functional and effective under more naturalistic stimulation conditions, which were so far not investigated. We provide this evidence in this manuscript and we have added a paragraph to the discussion section (paragraph 4) of our manuscript.

Finally, the choice of a low number of parameters to characterize the response is data driven. We demonstrate that our analysis captures the larger part of variance of the response as obtained by voltage sensitive dye imaging (VSDI) and is therefore a valid procedure. A comparison of the dimensionality of stimulus and response space has to take into account the non-linearity of processing in the visual system.

Our use of “contextual” is inspired by usage of the terms classical receptive field and non-classical receptive field/context in the visual sciences: Here one visual stimulus elicits a reliable response at a certain location. The other stimulus does not elicit detectable activity at that location, but as our data demonstrate modulates the response to the former stimulus. The term “context” is intended to denote that dependence. That is, we consider this neighboring information as a contextual input as it doesn’t directly influence the processing at the distant center.

In all cases, including single stimulation conditions, there were two 2D Gaussians that were combined to fit the observed responses. The parameters were estimated simultaneously. Using two Gaussians in all stimulation conditions ensured that the activation at the distant locations did not influence the fit. Most of the time, the activity spots were clearly distant and well-isolated from each other, therefore possible cross-contributions were negligible. Furthermore the center position of each single Gaussian fit was constrained around the location of the peak responses at locations A and B. This helped avoiding complications where Gaussians could overfit noise. We now included this information in the manuscript (Methods, page 8, last paragraph).

Electrical recordings were performed to allow for rapid retinotopic hand mapping of the imaged area before VSDI. Electrical recordings parallel to VSDI were done to verify occurrence of evoked spiking activity at those cortical locations where high levels of the dye signal was observed, i.e. around the center representation of one of the local movie patches. Note that a single recording site was used. Thus, analysis of reciprocal interactions at two locations simultaneously, as resolved by VSDI, was not feasible. Analysis of the electrical recordings related to particular stimulus features will be subject of a following report.

The colorbar depicts x10^-4 dF/F values. This is now added to the legends of Figures 2 to 4. Pixels with an activity smaller than 1.5x10^-4 dF/F are not shown. This corresponds to the 75^th percentile of the activity distribution, as noted on the figure legend. Responses of each single movie are depicted with their own colormap so that the precise detailed structure of the evoked responses can be clearly seen for each single condition. Using a common color scale would basically eliminate the possibility to clearly see responses to movies that evoked weaker responses. Furthermore, the colorbar helps comparison of amplitudes across different movies and positions.

VSDI can clearly distinguish these differences. In Onat et al.¹³, we have shown that VSDI can distinguish diverse stimulus related components in response to drifting gratings including actual position, i.e. the retinotopic trajectory, of the grating stripes. In the current manuscript, in Figure 2, it is also possible to see that the same movie during double or single conditions leads to similar temporal activity profiles (as analyzed in Figure 5).

These are peak values computed across the entire stimulation duration and all recorded pixels.

Figure 4A (top row) presents the spatial activity profiles observed on the cortical surface that are reconstructed using the Gaussian parameters (mu, sigma, amp) averaged across ROIs and cats. Please notice that averaging the raw recorded data across cats and ROIs (A and B) is not possible. This is due to the large differences in the shape of the activity profiles across different ROIs and cats, therefore we reconstructed these activity profiles using the fitted parameters. Each individual data point that contributes to these spatial profiles are depicted in Figure 4B. We replaced the word “modelized” with “modeled”. You might view this analysis technique as a low dimensional parametric description of the experimental data.

Minor Points

We used the term dissonance to describe the perceptual quality of movie patches originating from different movies when shown simultaneously.

We used the term super-additive, to indicate a response that is more than the sum of the two individual components when these individual components are presented simultaneously. The meaning is essentially the same as supralinear, although in the literature super-additive is used more often.

Reference numbers refer to the reference list in the manuscript
Point by point response to Dr Contreras:
These are important questions (please also see our response to question 3 from Dr Rosa below). Clearly, there exist a remarkable number of studies (references in main text) about contextual effects using local grating patches of certain orientations and spatial frequencies.

In our work we follow instead the hypothesis that the low-level sensory regions (if not the whole nervous system) are wired to optimally process inputs that are most likely to be received from the sensory organs. Our recent report ²⁶suggests that the cortical state (i.e. both the dynamic range and mean activity) is qualitatively different when stimulated with gratings as compared to ecological stimuli. Thus, response behavior to natural input can deviate significantly from predictions based on simple parameterized stimuli, probably due to the extensive spatial information in natural images ^{22, 29}_.
These observations may place the burden of justification on the use of simple stimuli, and not of natural stimuli as used in the present report. Many intra-areal long-range facilitation phenomena in the early visual system have been shown to occur in response to local presentations of simple stimuli. The evidence showing that these are also functional under more natural stimulation conditions that are more complex and also more ecological is not clear. For instance, both the extent and the impact of long-range connections may critically depend on the correlation structure across neuronal populations (Lindén et al., 2011), which in turn emerges from the specific spatio-temporal correlations of visual input. Therefore, in this manuscript, we aimed to demonstrate that long-range interactions are also functional and effective under more naturalistic stimulation conditions, which were so far not investigated. We provide this evidence in this manuscript and we have added a paragraph to the discussion section (paragraph 4) of our manuscript.

Finally, the choice of a low number of parameters to characterize the response is data driven. We demonstrate that our analysis captures the larger part of variance of the response as obtained by voltage sensitive dye imaging (VSDI) and is therefore a valid procedure. A comparison of the dimensionality of stimulus and response space has to take into account the non-linearity of processing in the visual system.

Our use of “contextual” is inspired by usage of the terms classical receptive field and non-classical receptive field/context in the visual sciences: Here one visual stimulus elicits a reliable response at a certain location. The other stimulus does not elicit detectable activity at that location, but as our data demonstrate modulates the response to the former stimulus. The term “context” is intended to denote that dependence. That is, we consider this neighboring information as a contextual input as it doesn’t directly influence the processing at the distant center.

In all cases, including single stimulation conditions, there were two 2D Gaussians that were combined to fit the observed responses. The parameters were estimated simultaneously. Using two Gaussians in all stimulation conditions ensured that the activation at the distant locations did not influence the fit. Most of the time, the activity spots were clearly distant and well-isolated from each other, therefore possible cross-contributions were negligible. Furthermore the center position of each single Gaussian fit was constrained around the location of the peak responses at locations A and B. This helped avoiding complications where Gaussians could overfit noise. We now included this information in the manuscript (Methods, page 8, last paragraph).

Electrical recordings were performed to allow for rapid retinotopic hand mapping of the imaged area before VSDI. Electrical recordings parallel to VSDI were done to verify occurrence of evoked spiking activity at those cortical locations where high levels of the dye signal was observed, i.e. around the center representation of one of the local movie patches. Note that a single recording site was used. Thus, analysis of reciprocal interactions at two locations simultaneously, as resolved by VSDI, was not feasible. Analysis of the electrical recordings related to particular stimulus features will be subject of a following report.

The colorbar depicts x10^-4 dF/F values. This is now added to the legends of Figures 2 to 4. Pixels with an activity smaller than 1.5x10^-4 dF/F are not shown. This corresponds to the 75^th percentile of the activity distribution, as noted on the figure legend. Responses of each single movie are depicted with their own colormap so that the precise detailed structure of the evoked responses can be clearly seen for each single condition. Using a common color scale would basically eliminate the possibility to clearly see responses to movies that evoked weaker responses. Furthermore, the colorbar helps comparison of amplitudes across different movies and positions.

VSDI can clearly distinguish these differences. In Onat et al.¹³, we have shown that VSDI can distinguish diverse stimulus related components in response to drifting gratings including actual position, i.e. the retinotopic trajectory, of the grating stripes. In the current manuscript, in Figure 2, it is also possible to see that the same movie during double or single conditions leads to similar temporal activity profiles (as analyzed in Figure 5).

These are peak values computed across the entire stimulation duration and all recorded pixels.

Figure 4A (top row) presents the spatial activity profiles observed on the cortical surface that are reconstructed using the Gaussian parameters (mu, sigma, amp) averaged across ROIs and cats. Please notice that averaging the raw recorded data across cats and ROIs (A and B) is not possible. This is due to the large differences in the shape of the activity profiles across different ROIs and cats, therefore we reconstructed these activity profiles using the fitted parameters. Each individual data point that contributes to these spatial profiles are depicted in Figure 4B. We replaced the word “modelized” with “modeled”. You might view this analysis technique as a low dimensional parametric description of the experimental data.

Minor Points

We used the term dissonance to describe the perceptual quality of movie patches originating from different movies when shown simultaneously.

We used the term super-additive, to indicate a response that is more than the sum of the two individual components when these individual components are presented simultaneously. The meaning is essentially the same as supralinear, although in the literature super-additive is used more often.

Reference numbers refer to the reference list in the manuscript
Competing Interests: No competing interests were disclosed. Close
Report a concern

Views

Reviewer Report 19 Feb 2013

Marcello Rosa, Faculty of Medicine, Monash University, Melbourne, Victoria, Australia

Nicholas S. C. Price, Department of Physiology, Monash University, Melbourne, Australia

Hsin-Hao Yu, Department of Physiology, Monash University, Melbourne, Australia

Approved

https://doi.org/10.5256/f1000research.1179.r784

The authors seem to only deal with linear addition of activation. We are not sure that this is a reasonable assumption, without more information about how optical activity scales with spiking rate. Thus, supra-linear summation may not be that surprising.

The authors seem to only deal with linear addition of activation. We are not sure that this is a reasonable assumption, without more information about how optical activity scales with spiking rate. Thus, supra-linear summation may not be that surprising. i.e. the presence of one stimulus patch might lift the optical activity to close to threshold, so a second patch then seems to create a lot of activation, but it's just because there was a "hidden" threshold.
Figure 5 is pretty complicated. We had a hard time understanding it. Maybe it needs to be broken into smaller, more digestible bits of information and have a clearer legend.
This paper will be more useful if they also present data using gratings as stimuli.

Competing Interests: No competing interests were disclosed.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 02 Aug 2013

Selim Onat, University Medical Center Hamburg-Eppendorf, Germany

02 Aug 2013

Author Response
Point by point response to Dr Rosa:
1. This is an important point. The relationship between the VSD imaging signal and spiking activity can indeed be complex. For instance, a close relationship
... Continue reading
Point by point response to Dr Rosa:

This is an important point. The relationship between the VSD imaging signal and spiking activity can indeed be complex. For instance, a close relationship between spike rate and the derivative of the VSD response rather than its magnitude has been proposed ⁶⁹_. This might especially apply to the rising phase of the membrane potential after stimulus onset ^6,70. Combined VSD and Calcium-sensitive dye imaging suggests that the relationship between spiking activity and the amplitude of the VSD response depends also on stimulus intensity ⁷¹. However, Chen et al., ⁷² found that these relationships are well-captured by a power function with an exponent of ~4 [similarly as for the relationship between average membrane potential and spike rates in single V1 neurons ^73,74, indicating that the threshold for observing significant spiking activity is about 30–40% of the maximal VSD response. Amplitudes evoked by our natural movies were on average well-within this range as compared to full-field moving gratings that we used as a standard control for maximal stimulation. Moreover, we compared the fitted peak amplitudes and this ensures that only pixels with highest amplitude at the center of the movie representations entered our analysis. Therefore, we are confident that subthreshold activity was excluded and that we describe the effect of long-range connectivity on postsynaptic spiking activity. We added this information to Discussion (page 10, before the last paragraph)

Dividing the Figure 5 might make it more difficult to understand given that the data that is shown on the scatter plots are summarized on the top panels, which represents the average observations.

Interestingly, using LFP as a population measure of postsynaptic activity, a recent study showed that superimposed gratings (i.e. plaids) lead to responses that were not predicted by activity in response to their single components (Bartolo et al., 2011). Thus, a simple parametric manipulation (addition) of most simple stimuli like gratings can lead to drastic changes in neuronal population response behavior limiting explanatory power. If this is the case, the use of gratings may be a weak predictor with respect to complex stimulation, in particular with respect to natural scenes. Please also see above for our reply to Dr Contreras who had a similar concern.
Reference numbers refer to the reference list in the manuscript
Point by point response to Dr Rosa:

This is an important point. The relationship between the VSD imaging signal and spiking activity can indeed be complex. For instance, a close relationship between spike rate and the derivative of the VSD response rather than its magnitude has been proposed ⁶⁹_. This might especially apply to the rising phase of the membrane potential after stimulus onset ^6,70. Combined VSD and Calcium-sensitive dye imaging suggests that the relationship between spiking activity and the amplitude of the VSD response depends also on stimulus intensity ⁷¹. However, Chen et al., ⁷² found that these relationships are well-captured by a power function with an exponent of ~4 [similarly as for the relationship between average membrane potential and spike rates in single V1 neurons ^73,74, indicating that the threshold for observing significant spiking activity is about 30–40% of the maximal VSD response. Amplitudes evoked by our natural movies were on average well-within this range as compared to full-field moving gratings that we used as a standard control for maximal stimulation. Moreover, we compared the fitted peak amplitudes and this ensures that only pixels with highest amplitude at the center of the movie representations entered our analysis. Therefore, we are confident that subthreshold activity was excluded and that we describe the effect of long-range connectivity on postsynaptic spiking activity. We added this information to Discussion (page 10, before the last paragraph)

Dividing the Figure 5 might make it more difficult to understand given that the data that is shown on the scatter plots are summarized on the top panels, which represents the average observations.

Interestingly, using LFP as a population measure of postsynaptic activity, a recent study showed that superimposed gratings (i.e. plaids) lead to responses that were not predicted by activity in response to their single components (Bartolo et al., 2011). Thus, a simple parametric manipulation (addition) of most simple stimuli like gratings can lead to drastic changes in neuronal population response behavior limiting explanatory power. If this is the case, the use of gratings may be a weak predictor with respect to complex stimulation, in particular with respect to natural scenes. Please also see above for our reply to Dr Contreras who had a similar concern.
Reference numbers refer to the reference list in the manuscript
Competing Interests: No competing interests were disclosed. Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 02 Aug 2013

Selim Onat, University Medical Center Hamburg-Eppendorf, Germany

02 Aug 2013

Author Response
Point by point response to Dr Rosa:
1. This is an important point. The relationship between the VSD imaging signal and spiking activity can indeed be complex. For instance, a close relationship
... Continue reading
Point by point response to Dr Rosa:

This is an important point. The relationship between the VSD imaging signal and spiking activity can indeed be complex. For instance, a close relationship between spike rate and the derivative of the VSD response rather than its magnitude has been proposed ⁶⁹_. This might especially apply to the rising phase of the membrane potential after stimulus onset ^6,70. Combined VSD and Calcium-sensitive dye imaging suggests that the relationship between spiking activity and the amplitude of the VSD response depends also on stimulus intensity ⁷¹. However, Chen et al., ⁷² found that these relationships are well-captured by a power function with an exponent of ~4 [similarly as for the relationship between average membrane potential and spike rates in single V1 neurons ^73,74, indicating that the threshold for observing significant spiking activity is about 30–40% of the maximal VSD response. Amplitudes evoked by our natural movies were on average well-within this range as compared to full-field moving gratings that we used as a standard control for maximal stimulation. Moreover, we compared the fitted peak amplitudes and this ensures that only pixels with highest amplitude at the center of the movie representations entered our analysis. Therefore, we are confident that subthreshold activity was excluded and that we describe the effect of long-range connectivity on postsynaptic spiking activity. We added this information to Discussion (page 10, before the last paragraph)

Dividing the Figure 5 might make it more difficult to understand given that the data that is shown on the scatter plots are summarized on the top panels, which represents the average observations.

Interestingly, using LFP as a population measure of postsynaptic activity, a recent study showed that superimposed gratings (i.e. plaids) lead to responses that were not predicted by activity in response to their single components (Bartolo et al., 2011). Thus, a simple parametric manipulation (addition) of most simple stimuli like gratings can lead to drastic changes in neuronal population response behavior limiting explanatory power. If this is the case, the use of gratings may be a weak predictor with respect to complex stimulation, in particular with respect to natural scenes. Please also see above for our reply to Dr Contreras who had a similar concern.
Reference numbers refer to the reference list in the manuscript
Point by point response to Dr Rosa:

This is an important point. The relationship between the VSD imaging signal and spiking activity can indeed be complex. For instance, a close relationship between spike rate and the derivative of the VSD response rather than its magnitude has been proposed ⁶⁹_. This might especially apply to the rising phase of the membrane potential after stimulus onset ^6,70. Combined VSD and Calcium-sensitive dye imaging suggests that the relationship between spiking activity and the amplitude of the VSD response depends also on stimulus intensity ⁷¹. However, Chen et al., ⁷² found that these relationships are well-captured by a power function with an exponent of ~4 [similarly as for the relationship between average membrane potential and spike rates in single V1 neurons ^73,74, indicating that the threshold for observing significant spiking activity is about 30–40% of the maximal VSD response. Amplitudes evoked by our natural movies were on average well-within this range as compared to full-field moving gratings that we used as a standard control for maximal stimulation. Moreover, we compared the fitted peak amplitudes and this ensures that only pixels with highest amplitude at the center of the movie representations entered our analysis. Therefore, we are confident that subthreshold activity was excluded and that we describe the effect of long-range connectivity on postsynaptic spiking activity. We added this information to Discussion (page 10, before the last paragraph)

Dividing the Figure 5 might make it more difficult to understand given that the data that is shown on the scatter plots are summarized on the top panels, which represents the average observations.

Interestingly, using LFP as a population measure of postsynaptic activity, a recent study showed that superimposed gratings (i.e. plaids) lead to responses that were not predicted by activity in response to their single components (Bartolo et al., 2011). Thus, a simple parametric manipulation (addition) of most simple stimuli like gratings can lead to drastic changes in neuronal population response behavior limiting explanatory power. If this is the case, the use of gratings may be a weak predictor with respect to complex stimulation, in particular with respect to natural scenes. Please also see above for our reply to Dr Contreras who had a similar concern.
Reference numbers refer to the reference list in the manuscript
Competing Interests: No competing interests were disclosed. Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 15 Feb 2013

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 2 (update) 02 Aug 13	read	read
Version 1 15 Feb 13	read	read

Marcello Rosa, Monash University, Melbourne, Victoria, Australia

Hsin-Hao Yu, Monash University, Melbourne, Australia

Nicholas S. C. Price, Monash University, Melbourne, Australia
Diego Contreras, University of Pennsylvania School of Medicine, Philadelphia, PA, USA

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

16 Views

04 Mar 2014 | for Version 2

Diego Contreras, Department of Neuroscience, University of Pennsylvania School of Medicine, Philadelphia, PA, USA

16 Views Cite this report Responses(0)

Approved

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

15 Views

23 Jan 2014 | for Version 2

Marcello Rosa, Faculty of Medicine, Monash University, Melbourne, Victoria, Australia

15 Views Cite this report Responses(0)

Approved

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

23 Views

06 Mar 2013 | for Version 1

Diego Contreras, Department of Neuroscience, University of Pennsylvania School of Medicine, Philadelphia, PA, USA

23 Views Cite this report Responses(1)

Approved

What is the point of using “ecological” stimuli if the response is then reduced to 6 parameters? How is that different from using drifting gratings? The authors should either provide the comparison with simpler stimuli or better justify their choices. The main point here is that the larger summation of two stimuli from the same movie versus two stimuli from different movies may be simply due to objects that span the two masks generating a similarity in the orientation and spatial frequency in the two stimuli, which, in turn, would lead to very well-known orientation-specific effects of contextual stimuli.
How is presenting another stimulus “contextual”? the stimuli where either one or two stimuli of identical size. It would be easier to read if the authors used only one terminology, such as single, coherent and incoherent.
How was a 2D Gaussian fit to the response of one of the stimulus while presenting both? The response is merged, doesn't this affect the fitting procedure?
What was learned from the multi-unit recordings?
Figure 2: It is pretty but mostly useless without quantification, the authors should show values of dF/F or contour plots of the significant response (3 or 5 SDs above baseline noise) so that the movies can be properly compared.
It is remarkable that the responses to the same mask in movie 1 and 2 are so different, are they really statistically different? Is the VSD signal capable of discriminating the differences between the two movies? Looking at the movies it seems that they have similar spectrotemporal composition.
Where were the values presented on top of page 5 taken from? What area? What time?
Figure 4:What is “modelized”? Does figure 4A not represent data?

Minor Points:
Page 4, 2nd column: Dissonance?
What is “superadditive”? Does it mean supralinear? Very difficult to understand, mainly because the term is used in combinations such as “superadditive facilitation”

Competing Interests

No competing interests were disclosed.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

09 Aug 2013

Selim Onat, University Medical Center Hamburg-Eppendorf, Germany

Point by point response to Dr Contreras:

These are important questions (please also see our response to question 3 from Dr Rosa below). Clearly, there exist a remarkable number of studies (references in main text) about contextual effects using local grating patches of certain orientations and spatial frequencies.

In our work we follow instead the hypothesis that the low-level sensory regions (if not the whole nervous system) are wired to optimally process inputs that are most likely to be received from the sensory organs. Our recent report ²⁶suggests that the cortical state (i.e. both the dynamic range and mean activity) is qualitatively different when stimulated with gratings as compared to ecological stimuli. Thus, response behavior to natural input can deviate significantly from predictions based on simple parameterized stimuli, probably due to the extensive spatial information in natural images ^{22, 29}_.
These observations may place the burden of justification on the use of simple stimuli, and not of natural stimuli as used in the present report. Many intra-areal long-range facilitation phenomena in the early visual system have been shown to occur in response to local presentations of simple stimuli. The evidence showing that these are also functional under more natural stimulation conditions that are more complex and also more ecological is not clear. For instance, both the extent and the impact of long-range connections may critically depend on the correlation structure across neuronal populations (Lindén et al., 2011), which in turn emerges from the specific spatio-temporal correlations of visual input. Therefore, in this manuscript, we aimed to demonstrate that long-range interactions are also functional and effective under more naturalistic stimulation conditions, which were so far not investigated. We provide this evidence in this manuscript and we have added a paragraph to the discussion section (paragraph 4) of our manuscript.

Finally, the choice of a low number of parameters to characterize the response is data driven. We demonstrate that our analysis captures the larger part of variance of the response as obtained by voltage sensitive dye imaging (VSDI) and is therefore a valid procedure. A comparison of the dimensionality of stimulus and response space has to take into account the non-linearity of processing in the visual system.
Our use of “contextual” is inspired by usage of the terms classical receptive field and non-classical receptive field/context in the visual sciences: Here one visual stimulus elicits a reliable response at a certain location. The other stimulus does not elicit detectable activity at that location, but as our data demonstrate modulates the response to the former stimulus. The term “context” is intended to denote that dependence. That is, we consider this neighboring information as a contextual input as it doesn’t directly influence the processing at the distant center.
In all cases, including single stimulation conditions, there were two 2D Gaussians that were combined to fit the observed responses. The parameters were estimated simultaneously. Using two Gaussians in all stimulation conditions ensured that the activation at the distant locations did not influence the fit. Most of the time, the activity spots were clearly distant and well-isolated from each other, therefore possible cross-contributions were negligible. Furthermore the center position of each single Gaussian fit was constrained around the location of the peak responses at locations A and B. This helped avoiding complications where Gaussians could overfit noise. We now included this information in the manuscript (Methods, page 8, last paragraph).
Electrical recordings were performed to allow for rapid retinotopic hand mapping of the imaged area before VSDI. Electrical recordings parallel to VSDI were done to verify occurrence of evoked spiking activity at those cortical locations where high levels of the dye signal was observed, i.e. around the center representation of one of the local movie patches. Note that a single recording site was used. Thus, analysis of reciprocal interactions at two locations simultaneously, as resolved by VSDI, was not feasible. Analysis of the electrical recordings related to particular stimulus features will be subject of a following report.
The colorbar depicts x10^-4 dF/F values. This is now added to the legends of Figures 2 to 4. Pixels with an activity smaller than 1.5x10^-4 dF/F are not shown. This corresponds to the 75^th percentile of the activity distribution, as noted on the figure legend. Responses of each single movie are depicted with their own colormap so that the precise detailed structure of the evoked responses can be clearly seen for each single condition. Using a common color scale would basically eliminate the possibility to clearly see responses to movies that evoked weaker responses. Furthermore, the colorbar helps comparison of amplitudes across different movies and positions.
VSDI can clearly distinguish these differences. In Onat et al.¹³, we have shown that VSDI can distinguish diverse stimulus related components in response to drifting gratings including actual position, i.e. the retinotopic trajectory, of the grating stripes. In the current manuscript, in Figure 2, it is also possible to see that the same movie during double or single conditions leads to similar temporal activity profiles (as analyzed in Figure 5).
These are peak values computed across the entire stimulation duration and all recorded pixels.
Figure 4A (top row) presents the spatial activity profiles observed on the cortical surface that are reconstructed using the Gaussian parameters (mu, sigma, amp) averaged across ROIs and cats. Please notice that averaging the raw recorded data across cats and ROIs (A and B) is not possible. This is due to the large differences in the shape of the activity profiles across different ROIs and cats, therefore we reconstructed these activity profiles using the fitted parameters. Each individual data point that contributes to these spatial profiles are depicted in Figure 4B. We replaced the word “modelized” with “modeled”. You might view this analysis technique as a low dimensional parametric description of the experimental data.

Minor Points
We used the term dissonance to describe the perceptual quality of movie patches originating from different movies when shown simultaneously.
We used the term super-additive, to indicate a response that is more than the sum of the two individual components when these individual components are presented simultaneously. The meaning is essentially the same as supralinear, although in the literature super-additive is used more often.

Reference numbers refer to the reference list in the manuscript

View more View less

Competing Interests

No competing interests were disclosed.

Back to all reports

Reviewer Report

16 Views

19 Feb 2013 | for Version 1

Marcello Rosa, Faculty of Medicine, Monash University, Melbourne, Victoria, Australia

Nicholas S. C. Price, Department of Physiology, Monash University, Melbourne, Australia

Hsin-Hao Yu, Department of Physiology, Monash University, Melbourne, Australia

16 Views Cite this report Responses(1)

Approved

The authors seem to only deal with linear addition of activation. We are not sure that this is a reasonable assumption, without more information about how optical activity scales with spiking rate. Thus, supra-linear summation may not be that surprising. i.e. the presence of one stimulus patch might lift the optical activity to close to threshold, so a second patch then seems to create a lot of activation, but it's just because there was a "hidden" threshold.
Figure 5 is pretty complicated. We had a hard time understanding it. Maybe it needs to be broken into smaller, more digestible bits of information and have a clearer legend.
This paper will be more useful if they also present data using gratings as stimuli.

Competing Interests

No competing interests were disclosed.

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

02 Aug 2013

Selim Onat, University Medical Center Hamburg-Eppendorf, Germany

Point by point response to Dr Rosa:

This is an important point. The relationship between the VSD imaging signal and spiking activity can indeed be complex. For instance, a close relationship between spike rate and the derivative of the VSD response rather than its magnitude has been proposed ⁶⁹_. This might especially apply to the rising phase of the membrane potential after stimulus onset ^6,70. Combined VSD and Calcium-sensitive dye imaging suggests that the relationship between spiking activity and the amplitude of the VSD response depends also on stimulus intensity ⁷¹. However, Chen et al., ⁷² found that these relationships are well-captured by a power function with an exponent of ~4 [similarly as for the relationship between average membrane potential and spike rates in single V1 neurons ^73,74, indicating that the threshold for observing significant spiking activity is about 30–40% of the maximal VSD response. Amplitudes evoked by our natural movies were on average well-within this range as compared to full-field moving gratings that we used as a standard control for maximal stimulation. Moreover, we compared the fitted peak amplitudes and this ensures that only pixels with highest amplitude at the center of the movie representations entered our analysis. Therefore, we are confident that subthreshold activity was excluded and that we describe the effect of long-range connectivity on postsynaptic spiking activity. We added this information to Discussion (page 10, before the last paragraph)
Dividing the Figure 5 might make it more difficult to understand given that the data that is shown on the scatter plots are summarized on the top panels, which represents the average observations.
Interestingly, using LFP as a population measure of postsynaptic activity, a recent study showed that superimposed gratings (i.e. plaids) lead to responses that were not predicted by activity in response to their single components (Bartolo et al., 2011). Thus, a simple parametric manipulation (addition) of most simple stimuli like gratings can lead to drastic changes in neuronal population response behavior limiting explanatory power. If this is the case, the use of gratings may be a weak predictor with respect to complex stimulation, in particular with respect to natural scenes. Please also see above for our reply to Dr Contreras who had a similar concern.

Reference numbers refer to the reference list in the manuscript

View more View less

Competing Interests

No competing interests were disclosed.

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Callaway EM: Local circuits in primary visual cortex of the macaque monkey. Annu Rev Neurosci. 1998; 21: 47–74. PubMed Abstract | Publisher Full Text

[2] 2. Tucker TR, Katz LC: Spatiotemporal patterns of excitation and inhibition evoked by the horizontal network in layer ⅔ of ferret visual cortex. J Neurophysiol. 2003; 89(1): 488–500. PubMed Abstract | Publisher Full Text

[3] 3. Douglas RJ, Martin KA: Recurrent neuronal circuits in the neocortex. Curr Biol. 2007; 17(13): R496–500. PubMed Abstract | Publisher Full Text

[4] 4. Gilbert CD, Wiesel TN: Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex. J Neurosci. 1989; 9(7): 2432–2442. PubMed Abstract

[5] 5. Bringuier V, Chavane F, Glaeser L, et al.: Horizontal propagation of visual activity in the synaptic integration field of area 17 neurons. Science. 1999; 283(5402): 695–699. PubMed Abstract | Publisher Full Text

[6] 6. Jancke D, Chavane F, Naaman S, et al.: Imaging cortical correlates of illusion in early visual cortex. Nature. 2004; 428(6981): 423–426. PubMed Abstract | Publisher Full Text

[7] 7. Jancke D, Erlhagen W, Schoner G, et al.: Shorter latencies for motion trajectories than for flashes in population responses of cat primary visual cortex. J Physiol. 2004; 556(Pt 3): 971–982. PubMed Abstract | Publisher Full Text | Free Full Text

[8] 8. Hubel DH, Wiesel TN: Sequence regularity and geometry of orientation columns in the monkey striate cortex. J Comp Neurol. 1974; 158(3): 267–293. PubMed Abstract | Publisher Full Text

[9] 9. Blasdel GG, Salama G: Voltage-sensitive dyes reveal a modular organization in monkey striate cortex. Nature. 1986; 321(6070): 579–585. PubMed Abstract | Publisher Full Text

[10] 10. Bonhoeffer T, Grinvald A: Iso-orientation domains in cat visual cortex are arranged in pinwheel-like patterns. Nature. 1991; 353(6343): 429–431. PubMed Abstract | Publisher Full Text

[11] 11. Benucci A, Frazor RA, Carandini M: Standing waves and traveling waves distinguish two circuits in visual cortex. Neuron. 2007; 55(1): 103–117. PubMed Abstract | Publisher Full Text | Free Full Text

[12] 12. Mante V, Carandini M: Mapping of stimulus energy in primary visual cortex. J Neurophysiol. 2005; 94(1): 788–798. PubMed Abstract | Publisher Full Text

[13] 13. Onat S, Nortmann N, Rekauzke S, et al.: Independent encoding of grating motion across stationary feature maps in primary visual cortex visualized with voltage-sensitive dye imaging. Neuroimage. 2011; 55(4): 1763–1770. PubMed Abstract | Publisher Full Text

[14] 14. Bosking WH, Zhang Y, Schofield B, et al.: Orientation selectivity and the arrangement of horizontal connections in tree shrew striate cortex. J Neurosci. 1997; 17(6): 2112–2127. PubMed Abstract

[15] 15. Chavane F, Jancke D, Marre O, et al.: Lateral spread of orientation selectivity in V1 is controlled by intracortical cooperativity. Front Syst Neurosci. 2011; 5: 4. PubMed Abstract | Publisher Full Text | Free Full Text

[16] 16. Allman J, Miezin F, McGuinness E: Stimulus specific responses from beyond the classical receptive field: neurophysiological mechanisms for local-global comparisons in visual neurons. Annu Rev Neurosci. 1985; 8: 407–430. PubMed Abstract | Publisher Full Text

[17] 17. Sillito AM, Grieve KL, Jones HE, et al.: Visual cortical mechanisms detecting focal orientation discontinuities. Nature. 1995; 378(6556): 492–496. PubMed Abstract | Publisher Full Text

[18] 18. Polat U, Mizobe K, Pettet MW, et al.: Collinear stimuli regulate visual responses depending on cell’s contrast threshold. Nature. 1998; 391(6667): 580–584. PubMed Abstract | Publisher Full Text

[19] 19. Series P, Lorenceau J, Fregnac Y: The "silent" surround of V1 receptive fields: theory and experiments. J Physiol Paris. 2003; 97(4–6): 453–474. PubMed Abstract | Publisher Full Text

[20] 20. Sasaki Y: Processing local signals into global patterns. Curr Opin Neurobiol. 2007; 17(2): 132–139. PubMed Abstract | Publisher Full Text

[21] 21. Simoncelli EP, Olshausen BA: Natural image statistics and neural representation. Annu Rev Neurosci. 2001; 24: 1193–1216. PubMed Abstract | Publisher Full Text

[22] 22. Olshausen BA, Field DJ: How close are we to understanding v1? Neural Comput. 2005; 17(8): 1665–1699. PubMed Abstract | Publisher Full Text

[23] 23. Howe CQ, Beau Lotto R, Purves D: Comparison of Bayesian and empirical ranking approaches to visual perception. J Theor Biol. 2006; 241(4): 866–875. PubMed Abstract | Publisher Full Text

[24] 24. Vinje WE, Gallant JL: Sparse coding and decorrelation in primary visual cortex during natural vision. Science. 2000; 287(5456): 1273–1276. Publisher Full Text

[25] 25. Touryan J, Felsen G, Dan Y: Spatial structure of complex cell receptive fields measured with natural images. Neuron. 2005; 45(5): 781–791. PubMed Abstract | Publisher Full Text

[26] 26. Onat S, König P, Jancke D: Natural Scene Evoked Population Dynamics across Cat Primary Visual Cortex Captured with Voltage-Sensitive Dye Imaging. Cereb Cortex. 2011; 21(11): 2542–2554. PubMed Abstract | Publisher Full Text

[27] 27. Felsen G, Touryan J, Han F, et al.: Cortical sensitivity to visual features in natural scenes. PLoS Biol. 2005; 3(10): e342. PubMed Abstract | Publisher Full Text | Free Full Text

[28] 28. Vinje WE, Gallant JL: Natural stimulation of the nonclassical receptive field increases information transmission efficiency in V1. J Neurosci. 2002; 22(7): 2904–2915. PubMed Abstract

[29] 29. Haider B, Krause MR, Duque A, et al.: Synaptic and network mechanisms of sparse and reliable visual cortical activity during nonclassical receptive field stimulation. Neuron. 2010; 65(1): 107–121. PubMed Abstract | Publisher Full Text | Free Full Text

[30] 30. Brenner N, Bialek W, de Ruyter van Steveninck R: Adaptive rescaling maximizes information transmission. Neuron. 2000; 26(3): 695–702. PubMed Abstract | Publisher Full Text

[31] 31. Grinvald A, Lieke EE, Frostig RD, et al.: Cortical point-spread function and long-range lateral interactions revealed by real-time optical imaging of macaque monkey primary visual cortex. J Neurosci. 1994; 14(5 Pt 1): 2545–2568. PubMed Abstract

[32] 32. Ratzlaff EH, Grinvald A: A tandem-lens epifluorescence macroscope: hundred-fold brightness advantage for wide-field imaging. J Neurosci Methods. 1991; 36(2–3): 127–137. PubMed Abstract

[33] 33. Sharon D, Jancke D, Chavane F, et al.: Cortical response field dynamics in cat visual cortex. Cereb Cortex. 2007; 17(12): 2866–2877. PubMed Abstract | Publisher Full Text

[34] 34. Kisvarday ZF, Kim DS, Eysel UT, et al.: Relationship between lateral inhibitory connections and the topography of the orientation map in cat visual cortex. Eur J Neurosci. 1994; 6(10): 1619–1632. PubMed Abstract

[35] 35. Hubel DH, Wiesel TN: Receptive fields and functional architecture in two non-striate visual areas (18 and 19) of the cat. J Neurophysiol. 1965; 28: 229–289. PubMed Abstract

[36] 36. Rockland KS, Lund JS: Widespread periodic intrinsic connections in the tree shrew visual cortex. Science. 1982; 215(4539): 1532–1534. PubMed Abstract | Publisher Full Text

[37] 37. Gilbert CD, Wiesel TN: Clustered intrinsic connections in cat visual cortex. J Neurosci. 1983; 3(5): 1116–1133. PubMed Abstract

[38] 38. Kayser C, König P: Stimulus locking and feature selectivity prevail in complementary frequency ranges of V1 local field potentials. Eur J Neurosci. 2004; 19(2): 485–489. PubMed Abstract | Publisher Full Text

[39] 39. Belitski A, Gretton A, Magri C, et al.: Low-frequency local field potentials and spikes in primary visual cortex convey independent visual information. J Neurosci. 2008; 28(22): 5696–5709. PubMed Abstract | Publisher Full Text

[40] 40. Albright TD, Stoner GR: Contextual influences on visual processing. Annu Rev Neurosci. 2002; 25: 339–379. PubMed Abstract | Publisher Full Text

[41] 41. Coen-Cagli R, Dayan P, Schwartz O: Cortical Surround Interactions and Perceptual Salience via Natural Scene Statistics. PLoS Comput Biol. 2012; 8(3): e1002405. PubMed Abstract | Publisher Full Text | Free Full Text

[42] 42. Maffei L, Fiorentini A: The unresponsive regions of visual cortical receptive fields. Vision Res. 1976; 16(10): 1131–1139. PubMed Abstract | Publisher Full Text

[43] 43. Jones HE, Grieve KL, Wang W, et al.: Surround suppression in primate V1. J Neurophysiol. 2001; 86(4): 2011–2028. PubMed Abstract

[44] 44. Walker GA, Ohzawa I, Freeman RD: Asymmetric suppression outside the classical receptive field of the visual cortex. J Neurosci. 1999; 19(23): 10536–10553. PubMed Abstract

[45] 45. Toth LJ, Rao SC, Kim DS, et al.: Subthreshold facilitation and suppression in primary visual cortex revealed by intrinsic signal imaging. Proc Natl Acad Sci U S A. 1996; 93(18): 9869–9874. PubMed Abstract | Publisher Full Text | Free Full Text

[46] 46. Sengpiel F, Sen A, Blakemore C: Characteristics of surround inhibition in cat area 17. Exp Brain Res. 1997; 116(2): 216–228. PubMed Abstract

[47] 47. Kapadia MK, Westheimer G, Gilbert CD: Dynamics of spatial summation in primary visual cortex of alert monkeys. Proc Natl Acad Sci U S A. 1999; 96(21): 12073–12078. PubMed Abstract | Publisher Full Text | Free Full Text

[48] 48. Nelson JI, Frost BJ: Intracortical facilitation among co-oriented, co-axially aligned simple cells in cat striate cortex. Exp Brain Res. 1985; 61(1): 54–61. PubMed Abstract

[49] 49. Kapadia MK, Ito M, Gilbert CD, et al.: Improvement in visual sensitivity by changes in local context: parallel studies in human observers and in V1 of alert monkeys. Neuron. 1995; 15(4): 843–856. PubMed Abstract | Publisher Full Text

[50] 50. Levitt JB, Lund JS: Contrast dependence of contextual effects in primate visual cortex. Nature. 1997; 387(6628): 73–76. PubMed Abstract | Publisher Full Text

[51] 51. Chisum HJ, Mooser F, Fitzpatrick D: Emergent properties of layer 2/3 neurons reflect the collinear arrangement of horizontal connections in tree shrew visual cortex. J Neurosci. 2003; 23(7): 2947–2960. PubMed Abstract

[52] 52. Stemmler M, Usher M, Niebur E: Lateral interactions in primary visual cortex: a model bridging physiology and psychophysics. Science. 1995; 269(5232): 1877–1880. PubMed Abstract | Publisher Full Text

[53] 53. Gilad A, Meirovithz E, Leshem A, et al.: Collinear stimuli induce local and cross-areal coherence in the visual cortex of behaving monkeys. PLoS One. 2012; 7(11): e49391. PubMed Abstract | Publisher Full Text | Free Full Text

[54] 54. Blakemore C, Tobin EA: Lateral inhibition between orientation detectors in the cat’s visual cortex. Exp Brain Res. 1972; 15(4): 439–440. PubMed Abstract

[55] 55. Knierim JJ, van Essen DC: Neuronal responses to static texture patterns in area V1 of the alert macaque monkey. J Neurophysiol. 1992; 67(4): 961–980. PubMed Abstract

[56] 56. Kitano M, Niiyama K, Kasamatsu T, et al.: Retinotopic and nonretinotopic field potentials in cat visual cortex. Vis Neurosci. 1994; 11(5): 953–977. PubMed Abstract

[57] 57. Li CY, Li W: Extensive integration field beyond the classical receptive field of cat’s striate cortical neurons--classification and tuning properties. Vision Res. 1994; 34(18): 2337–2355. PubMed Abstract | Publisher Full Text

[58] 58. Lee SH, Blake R: Visual form created solely from temporal structure. Science. 1999; 284(5417): 1165–1168. PubMed Abstract | Publisher Full Text

[59] 59. Kayser C, Einhäuser W, König P: Temporal correlations of orientations in natural scenes. Neurocomputing. 2003; 52–54: 117–123. Publisher Full Text

[60] 60. Humphrey AL, Sur M, Uhlrich DJ, et al.: Termination patterns of individual X- and Y-cell axons in the visual cortex of the cat: projections to area 18, to the 17/18 border region, and to both areas 17 and 18. J Comp Neurol. 1985; 233(2): 190–212. PubMed Abstract | Publisher Full Text

[61] 61. Chapman B, Zahs KR, Stryker MP: Relation of cortical cell orientation selectivity to alignment of receptive fields of the geniculocortical afferents that arborize within a single orientation column in ferret visual cortex. J Neurosci. 1991; 11(5): 1347–1358. PubMed Abstract

[62] 62. Ayzenshtat I, Meirovithz E, Edelman H, et al.: Precise Spatiotemporal Patterns among Visual Cortical Areas and Their Relation to Visual Stimulus Processing. J Neurosci. 2010; 30(33): 11232–11245. PubMed Abstract | Publisher Full Text

[63] 63. Li W, Piëch V, Gilbert CD: Learning to link visual contours. Neuron. 2008; 57(3): 442–451. PubMed Abstract | Publisher Full Text | Free Full Text

[64] 64. Roland PE, Hanazawa A, Undeman C, et al.: Cortical feedback depolarization waves: a mechanism of top-down influence on early visual areas. Proc Natl Acad Sci U S A. 2006; 103(33): 12586–12591. PubMed Abstract | Publisher Full Text | Free Full Text

[65] 65. Xu W, Huang X, Takagaki K, et al.: Compression and Reflection of Visually Evoked Cortical Waves. Neuron. 2007; 55(1): 119–129. PubMed Abstract | Publisher Full Text | Free Full Text

[66] 66. Lamme VA, Roelfsema PR: The distinct modes of vision offered by feedforward and recurrent processing. Trends Neurosci. 2000; 23(11): 571–579. PubMed Abstract | Publisher Full Text

[67] 67. Hupé JM, James AC, Girard P, et al.: Feedback connections act on the early part of the responses in monkey visual cortex. J Neurophysiol. 2001; 85(1): 134–145. PubMed Abstract

[68] 68. Angelucci A, Levitt JB, Walton EJS, et al.: Circuits for Local and Global Signal Integration in Primary Visual Cortex. J Neurosci. 2002; 22(19): 8633–8646. PubMed Abstract

[69] 69. Eriksson D, Tompa T, Roland PE: Non-linear population firing rates and voltage sensitive dye signals in visual areas 17 and 18 to short duration stimuli. PLoS One. 2008; 3(7): e2673. PubMed Abstract | Publisher Full Text | Free Full Text

[70] 70. Sit YF, Chen Y, Geisler WS, et al.: Complex dynamics of V1 population responses explained by a simple gain-control model. Neuron. 2009; 64(6): 943–956. PubMed Abstract | Publisher Full Text | Free Full Text

[71] 71. Berger T, Borgdorff A, Crochet S, et al.: Combined voltage and calcium epifluorescence imaging in vitro and in vivo reveals subthreshold and suprathreshold dynamics of mouse barrel cortex. J Neurophysiol. 2007; 97(5): 3751–3762. PubMed Abstract | Publisher Full Text

[72] 72. Chen Y, Palmer CR, Seidemann E: The relationship between voltage-sensitive dye imaging signals and spiking activity of neural populations in primate V1. J Neurophysiol. 2012; 107(12): 3281–3295. PubMed Abstract | Publisher Full Text | Free Full Text

[73] 73. Anderson JS, Lampl I, Gillespie DC, et al.: The contribution of noise to contrast invariance of orientation tuning in cat visual cortex. Science. 2000; 290(5498): 1968–1972. PubMed Abstract | Publisher Full Text

[74] 74. Finn IM, Priebe NJ, Ferster D: The emergence of contrast-invariant orientation tuning in simple cells of cat visual cortex. Neuron. 2007; 54(1): 137–152. PubMed Abstract | Publisher Full Text | Free Full Text

Cortical long-range interactions embed statistical knowledge of natural sensory input: a voltage-sensitive dye imaging study

Abstract

Updated Changes from Version 1

Introduction

Materials and methods

Stimulus conditions

Figure 1. Acquisition and presentation of natural movies.

Experimental setup

Data acquisition and preprocessing

Model fitting and statistical evaluation

Results

Effect of contextual stimulation on activity amplitudes

Figure 2. Dynamics of cortical activity evoked by locally presented natural stimuli.

Figure 3. Observed and modeled spatial profiles of activity patterns.

Figure 4. Distant natural stimulus exerts facilitatory effects on local activity levels.

Effect of contextual stimulation on spatial extent of activity

Effect of contextual stimulation on time-course of activity levels

Figure 5. Time-course of contextual interactions.

Discussion

Author contributions

Competing interests

Grant information

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated