Elsevier

Neuroscience

Volume 330, 25 August 2016, Pages 109-120
Neuroscience

Dynamics of population coding for object views following object discrimination training

https://doi.org/10.1016/j.neuroscience.2016.05.039Get rights and content

Highlights

  • Responses in a population of monkey inferotemporal cells to prior experienced object views have been investigated.

  • High-response similarity to larger separated object views was observed later after object image presentation.

  • The results demonstrate the dynamics of population coding for object views following object discrimination.

Abstract

We have previously demonstrated that inferotemporal neurons respond to objects viewed from a range of angles, even without any prior experience in learning the associations among the views. Several models have been proposed to explain object recognition across disparate views. However, direct neuronal evidence is rare. In the present study, we focused on the response similarity of a population of inferotemporal cells to object views, following different prior experiences. Two monkeys were subjected to a task in which object discrimination across views was required. We found significantly higher neural response similarity to 30° separated views, 190 ms after object image presentation, than without any prior discrimination experience across views. The time period over which the similarity was significant began and endured similarly for 60° separated views at 190–850 ms. For 90° separated views, the time period over which the similarity was significant was shorter and started later, at 230–550 ms. The results demonstrate the dynamics of cell population activity and suggest a possible explanation for object recognition across disparate views.

Introduction

Our visual system can discriminate one object from others despite changes in viewing angle. Rotation in depth is one of the most difficult and challenging problems since features can appear drastically different from one view to another (Tarr and Bülthoff, 1995, Logothetis and Sheinberg, 1996, Ullman, 1996, Edelman, 1998, Biederman, 2000, Hayward, 2003, Kersten and Yuille, 2003, Kersten et al., 2004, Palmeri and Gauthier, 2004, Pinto et al., 2008, Poggio and Ullman, 2013). For a 3D object, it is always the case that features on the surface of an object that are closer to the observer are visible but those on farther surfaces are not. Nearby views largely share common visible object features, whereas widely separated views do not. Our previous study (Wang et al., 2005) showed that once monkeys experienced discrimination of objects from several individual views, they were able to recognize the objects over a certain range of viewpoints, but views with large angular differences required additional learning of the association among views. The association of nearby views developed intrinsically owing to their common features, whereas the underlying mechanism was different for the association of widely separated views.

The inferotemporal cortex (IT) is the final stage of the ventral cortical pathway representing information on object shape. It has been demonstrated to be critical for object recognition and discrimination (for a review, see Mishkin et al., 1983). The representation of objects in IT has been studied extensively. IT neurons usually have large receptive fields and show response tolerance to some object attributes (for a review, see Tanaka, 1996). However, reports on view invariance remain controversial. Familiarity with the targeted objects has a significant impact on the formation of stimulus selectivity. IT neurons respond tolerantly to familiar natural objects or faces over almost all viewing angles (Perrett et al., 1991, Booth and Rolls, 1998, Freiwald and Tsao, 2010, Eifuku et al., 2011), whereas for unfamiliar artificial objects, the viewing angles over which the neurons respond tolerantly are much narrower (Yamane et al., 2008, Okamura et al., 2014). For wireframe objects, the response was found to be view-dependent although with extensive prior training (Logothetis et al., 1994, Logothetis et al., 1995). In our previous experiment, even though we extensively trained the association between views of 3D artificial objects across 90° shifts in viewing angle, we failed to find any inferotemporal cell showing perfect view-invariant responses (Okamura et al., 2014). Association learning across views of the same objects broadened the cell tuning across views, but did not lead to full view invariance. All cells had tuning widths less than 60°, even following extensive long-term training. The present study asked how fully-invariant object recognition is completed, if indeed it is, in IT. We clarified the processing of view-invariant object discrimination in IT by using population activity, in addition to analysis at the single-cell level.

Section snippets

Animals

Two male macaque monkeys (Macaca fuscata) were used in the current study. We conducted a preparatory surgery to implant a titanium head holder on the skull under aseptic conditions. Each monkey was trained to sit quietly in a chair with its head position fixed by the holder. A CRT display was placed 50 cm in front of the animal. The monkeys were trained to use a lever for their responses. Eye position was measured by an infra-red system (http://staff.aist.go.jp/k.matsuda/eye/). Monkeys were

Results

We recorded single-unit activities from a total of 213 inferotemporal cells. Among the 213 cells, 117 showed significant responses to at least one of the object images. The results described hereafter are all derived from the 117 cells, 43 of which were from Monkey K and 74 cells from Monkey H. In Monkey K, 9 and 11 cells demonstrated significant responses respectively to object set A and B which were used for the prior experience in the object task, whereas 12 and 11 cells responded to set C

Discussion

In the present study, we compared the population responses of IT neurons to object images with different prior long-term experiences. We used correlations between the responses of a cell population to a pair of images to evaluate the neural distance between their representations in IT. With the prior experience of the object task, the neural distance between views of the same object were significantly smaller than views of different objects. Namely, the response similarity for a cell population

Acknowledgements

This research was supported in part by a Grant-in-Aid for Scientific Research (23500521) to G.W.

References (37)

  • S. Edelman

    Representation is representation of similarities

    Behav Brain Sci

    (1998)
  • S. Eifuku et al.

    Neural representations of personally familiar and unfamiliar faces in the anterior inferior temporal cortex of monkeys

    PLoS One

    (2011)
  • D.J. Freedman et al.

    A comparison of primate prefrontal and inferior temporal cortices during visual categorization

    J Neurosci

    (2003)
  • D.J. Freedman et al.

    Experience-dependent sharpening of visual shape selectivity in inferior temporal cortex

    Cereb Cortex

    (2006)
  • W.A. Freiwald et al.

    Functional compartmentalization and viewpoint generalization within the macaque face-processing system

    Science

    (2010)
  • P. Hayward

    Neuropharmacological key to cortical plasticity found

    Lancet Neurol

    (2003)
  • W.G. Hayward et al.

    Viewpoint dependence and object discriminability

    Psychol Sci

    (2000)
  • D. Kersten et al.

    Object perception as Bayesian inference

    Annu Rev Psychol

    (2004)
  • Cited by (5)

    • Object discrimination performance and dynamics evaluated by inferotemporal cell population activity

      2021, IBRO Neuroscience Reports
      Citation Excerpt :

      After saturation of the behavioral performance, the electrophysiological activities of single cells were recorded from the inferotemporal cortex. We pooled and re-analyzed the data previously obtained from the monkeys’ inferotemporal cortex (Okamura et al., 2014; Yamaguchi et al., 2016; Zhao et al., 2018; Okamura et al., 2018). In total, 1032 individual cells responding to images with prior training experiences were pooled, including 223, 241, 198, 203, and 167 cells from five macaque monkeys, respectively.

    • Long-term Object Discrimination at Several Viewpoints Develops Neural Substrates of View-invariant Object Recognition in Inferotemporal Cortex

      2018, Neuroscience
      Citation Excerpt :

      Training broadened the range of response tolerance to viewing angle, but no complete view invariance could be confirmed at the single-cell level. Analysis of cell population activity demonstrated significantly higher neural response similarity between views of the same objects than between views of different objects after discrimination training across views (Yamaguchi et al., 2016). Previously, we stopped discrimination training across views after saturation by replacing the task with a simple image exposure task for electrode recording.

    • Change in Event-Related Potential accompanying View-invariant Object Discrimination Learning

      2022, IEEJ Transactions on Electronics, Information and Systems
    View full text