Introduction

The brain rapidly distinguishes between new and old stimuli, which is adaptive when animals must process large amounts of sensory information (Ranganath & Rainer, 2003). Although novelty detection is universal, novelty evokes widely different behaviors across individuals. The chronic tendency to avoid novel events and situations characterizes those with behavioral inhibition or an “inhibited temperament” (Kagan, Reznick, Snidman, Gibbons, & Johnson, 1988b). Inhibited temperament is defined as the biologically-based predisposition to respond to novel object, places, and objects with wariness or avoidance behaviors. Individuals with an inhibited temperament are shy, cautious, and restrained, whereas those who are uninhibited are more often outgoing, adventurous, and spontaneous. Approximately 15% of infants are born with an inhibited temperament (Kagan, Reznick, & Snidman, 1988a), and the trait is relatively stable across development (Kagan, Reznick, & Snidman, 1987; Goodwin, Fergusson, & Horwood, 2004; Windle & Windle, 2006; Kagan et al., 1987). Inhibited temperament has robust physiological correlates including a high heart rate with low variability (Kagan et al., 1988a) and greater right frontal EEG asymmetry (Calkins, Fox, & Marshall, 1996; Fox, Henderson, Rubin, Calkins, & Schmidt, 2001). Children with an inhibited temperament are at increased risk for anxiety disorders (Kagan, Snidman, Zentner, & Peterson, 1999; Schwartz, Snidman, & Kagan, 1999), with the risk highest for those children that continue to be inhibited throughout adolescence (Chronis-Tuscano et al., 2009; Essex, Klein, Slattery, Goldsmith, & Kalin, 2010; Hirshfeld et al., 1992). Identifying the functional brain differences underlying an inhibited temperament will enhance our understanding of these behavioral biases and how they modulate the risk for anxiety disorders.

The amygdala has been regarded as an important structure in the development of an inhibited temperament (Kagan et al., 1988b). The amygdala reliably responds to all novelty, including novel faces (Schwartz et al., 2003b; Wright et al. 2003; Zald, 2003) and scenes (Blackford, Buckholtz, Avery, & Zald, 2010). Both timing and magnitude of amygdalar blood-oxygenation-level-dependent (BOLD) responses differ by temperament, with those possessing an inhibited temperament producing both faster (Blackford, Avery, Shelton, & Zald, 2009) and larger (Schwartz, Wright, Shin, Kagan, & Rauch, 2003a) responses to novel faces. In inhibited individuals, amygdala BOLD response is sustained over time when viewing newly familiarized faces; that is the amygdala fails to show the typical pattern of habituation seen in individuals with an uninhibited temperament (Blackford, Avery, Cowan, Shelton, & Zald, in press). While these studies provide initial support for the amygdala as a neural substrate of inhibited temperament, previous studies focused on only one brain region, and one origin of novelty. But, the term novel can refer to an event that is unfamiliar, or one that is unexpected.

Expectation plays a critical role in information processing, with past and current information constantly used to predict future events (Bubic, von Cramon, & Schubotz, 2010). Differences in expectancies engage the amygdala (Belova, Paton, Morrison, & Salzman, 2007; Bermpohl, Pascual-Leone, Amedi et al., 2006; Johansen, Tarpley, LeDoux, & Blair, 2010; Onoda et al., 2008). Given the key role of the response to novelty in inhibited temperament, one would predict that these individuals would also be affected by variation in expectancy. However, differences in expectancy could produce either increased amygdala BOLD response to the unexpected events or increased amygdala BOLD response to the expected events in those with an inhibited temperament. Based on the definition of inhibited temperament and the previous findings of increased amygdala BOLD response to novel or newly familiar faces (Blackford et al., 2009; Blackford et al., in press; Schwartz et al., 2003a), one might predict increased amygdala BOLD response to the unexpected, or novel, events. This finding would provide converging evidence for the theory that a hyper-excitable amygdala underlies an inhibited temperament. However, another prediction is equally plausible. Prior knowledge of an upcoming negative event may increase vigilance for that negative event. Given the conceptualization of inhibited temperament, we would predict that these individuals to demonstrate enhanced vigilance relative to those with an uninhibited temperament. A biological basis of increased vigilance may be an increased signal-to-noise ratio in neural structures specialized for the processing of threatening stimuli (Foote, Bloom, & Astonjones, 1983). Specifically, under conditions of vigilance, neural responses may be potentiated (Whalen, 1998), resulting in a larger amygdala BOLD response to the expected negative event. Our goals for this study were to examine temperament differences in expectancy effects and to search for additional neural correlates of inhibited temperament.

Methods

Subjects

Forty-two adults, with either an inhibited (n = 20; 12 females, eight males) or an uninhibited (n = 22; 12 females, ten males) temperament, participated in the study. Subjects had an average age of 21 years (SD = 0.9), were predominantly right-handed (93%), and represented a variety of ethnic groups (64% Caucasian, non-Hispanic, 12% Caucasian, Hispanic, 14% African-American, 5% Asian-American, 2% other). Inhibited and uninhibited subjects did not differ on age, gender, handedness, or ethnicity (see Table 1). Five subjects were removed from the analysis due to image quality (see Methods), resulting in a final analytic sample size of 37 subjects (18 inhibited, 19 uninhibited). The Vanderbilt University Institutional Review Board approved the study and we obtained written informed consent after providing subjects with a complete description of the study.

Table 1 Subject characteristics

Subject selection

Subjects were recruited for a larger study of temperament by placing advertisements seeking persons who were “especially shy or outgoing as a child”. Interested individuals completed two screening questionnaires measuring childhood temperament and adult temperament. Childhood temperament was assessed using the Retrospective Self-Report of Inhibition (RSRI), a 30-item measure that asks about childhood behaviors on a 1–5 Likert scale (1 = uninhibited, 5 = inhibited). The companion measure, the Current Self-Report of Inhibition (CSRI), was used to assess adult temperament (31 items; 1–5 Likert scale). The RSRI and CSRI have good reliability and construct validity in unselected samples (Cronbach's alpha RSRI = .79, ASRI = .77; Reznick, Hegeman, Kaufman, Woods, & Jacobs, 1992) and excellent reliability in this sample (Cronbach’s alpha RSRI = .97, ASRI = .97). We used normative guidelines (Reznick et al., 1992) to define cut-offs for extreme (top and bottom 15%) temperament scores on both of the temperament questionnaires. The use of two extreme groups is based on the original description of inhibited temperament and is commonly used in studies of this temperament trait (Blackford et al., 2009; Guyer et al., 2006; Perez-Edgar et al., 2007; Schwartz et al., 2003a; Bar-Haim et al., 2009; Beaton et al., 2008). In addition, the extreme groups approach (Nebert, 2000) provides two homogenous and distinguishable behavioral profiles, which maximizes the likelihood of discovering differences in underlying brain function. Furthermore, we specifically recruit individuals with a stable pattern of inhibition (or lack thereof) because individuals who were inhibited as both children and still as adults are a more homogenous group than those that were only inhibited as children; and individuals who are still inhibited as adults are at greater risk for anxiety disorders compared to those who are no longer inhibited (Chronis-Tuscano et al., 2009; Essex et al., 2010; Hirshfeld et al., 1992).

For this study, we enrolled subjects who: were between 18 and 30 years of age; had extreme retrospective and current temperament scores; passed an MRI safety screen; and were free of psychoactive medications, substance abuse during the past six months, major medical illness, and history of brain trauma. Subjects were not excluded for psychiatric illness because inhibited temperament is associated with increased rates of anxiety. Psychiatric status was assessed by a trained clinical interviewer using the Structured Clinical Interview for DSM-IV (Spitzer, Williams, Gibbon, & First, 1992). Consistent with other studies (Blackford et al., 2009; Schwartz et al., 1999), inhibited subjects had higher rates of current internalizing disorders compared to the uninhibited group. Of the inhibited subjects, six met criteria for at least one anxiety disorder (5 Social Anxiety Disorder, 3 Generalized Anxiety Disorder, 1 Specific Phobia, 1 Anxiety Not Otherwise Specified [NOS]) and two met criteria for Dysthymia (1 comorbid with anxiety). Of the uninhibited subjects, one met criteria for Post Traumatic Stress Disorder and one had comorbid Depression NOS and Anxiety NOS.

Experimental design

To study temperament differences in expectancy effects, we experimentally manipulated expectation between-subjects within each of the two temperament groups (see Fig. 1). All subjects had previously seen a series of human faces displaying a neutral facial expression during a prior task. For half of the subjects, an expectation was created by stating that they would be seeing “some pictures of people making a scared face” followed by an example picture. The other half of the subjects were told that they would be seeing “some more pictures of faces” and did not see an example, providing a baseline, no expectation, condition. We chose a between-subjects design to maximize the effect of the expectancy manipulation because once the fear faces were shown, future expectations would be changed, especially for the no expectation condition. Assignment to an expectation condition was stratified by temperament and gender. Thus, the design was completely between-groups with Temperament (Inhibited/Uninhibited) and Expectation (Expected/Not Expected) as factors (see Fig. 1).

Fig. 1
figure 1

fMRI task design. We used a block design with four blocks of 12 fear faces per run. a Temperament Group and Expectancy Condition were both between-subjects factors. b Half of each Temperament Group saw the Expected instructions, which read “In this block of trials I am going to show you some pictures of people making a scared face. In a minute, I’ll show you an example.” and were followed by an example picture. The other half of each Temperament Group saw the Not Expected instructions, which read “In this block of trials I am going to show you some more pictures of faces.” and were followed by a blank screen

fMRI task

In the fMRI task, subjects passively viewed images of humans displaying a scared facial expression or “fear face.” We chose fear faces for our stimuli because facial expressions of fear reliably produce the strongest amygdala activations (Sergerie, Chochol, & Armony, 2008; Zald, 2003) and are more unfamiliar than other facial expressions (Somerville & Whalen, 2006). The fear faces were presented in four blocks during a single 98-s run. We used a block design to increase our power to detect activation differences (Birn, Cox, & Bandettini, 2002). Similarly, we chose a short session to maximize amygdala activation since the amygdala habituates rapidly with repeated exposure (Breiter et al., 1996; Fischer et al., 2003; Wright et al., 2001). Within the run, there were four 12-s image blocks, each preceded and followed by a 10-s fixation cross (see Fig. 1). Within each block, 12 faces were presented for 0.5 s each followed by a 0.5-s interstimulus interval. Faces were randomly arranged within each block for each subject and each face was only presented once.

Stimuli

The stimuli were human facial expressions of fear (24 females, 24 males) selected from the Karolinska Directed Emotional Faces database (Goeleven, De Raedt, Leyman, & Verschuere, 2008; Lundqvist, Flykt, & Öhman, 1998). Human faces may be especially salient for individuals with inhibited temperament. Previous studies report temperament differences in amygdala response to faces despite intact facial recognition memory (Blackford et al., 2009; Blackford et al., in press). The stimuli were presented in the scanner using Eprime software (Version 1.1, Psychology Software Tools, Pittsburgh, PA). Images were projected onto a screen behind the subject’s head and were viewed on a mirror in the scanner.

Data collection

fMRI data acquisition

Anatomical and EPI images were collected on 3 Tesla Phillips Achieva magnet (Philips Healthcare, Inc., Best, The Netherlands). High resolution T1-weighted anatomical images were collected (256 mm FOV, 170 slices, 1 mm, 0 mm gap). EPI images were acquired using a sequence optimized for the amygdala: 2 s TR, 22 ms TE; 90° flip angle; 1.8 SENSE; 240 mm FOV; 3 × 3 mm in plane resolution using an 80 x 80 matrix (reconstructed to 128 x 128), and higher-order shimming to limit susceptibility artifacts. Each volume comprised 36 2.5 mm (.25 gap) axial oblique slices (tilted 15˚ anterior higher than posterior relative to the intercommisural plane) which provided complete anterior-posterior coverage and inferior-superior coverage from the bottom of the temporal lobe to the top of the most dorsal part of the cingulate gyrus.

fMRI data preprocessing

MRI data were pre-processed using SPM5 (http://www.fil.ion.ucl.ac.uk/) and MATLAB (Version 7.1, The MathWorks, Inc, Natick, MA). Data were corrected for motion (aligned to the mean), coregistered to the structural image, normalized into standard stereotactic space (MNI T1 template), resampled to 3 × 3 × 3 mm voxels, and high pass filtered (128 s). Data were smoothed with a 6-mm FWHM Gaussian kernel to account for individual differences in brain anatomy.

For each subject, motion was assessed and EPI images were visually inspected for artifacts, signal dropout, and coverage of the amygdala. Five subjects had artifacts in the EPI images, caused by scanner issues, and were removed from subsequent analyses (2 Inhibited-Expected, 2 Uninhibited-Expected, 1 Uninhibited-Not Expected).

Data analysis

fMRI data modeling

Using SPM5, a general linear model was estimated for each subject with the fear face blocks as the regressor (Friston et al., 1994). Next, contrast images were created for fear faces minus baseline (fixation cross) for each subject. Contrast images were entered into a full factorial model in SPM5 with two between-subjects factors: Temperament (Inhibited/Uninhibited) and Expectation (Expected/Not Expected).

Amygdala region of interest

The primary analysis focused on the amygdala, based on our a priori hypothesis that the amygdala is a key brain region mediating an inhibited temperament. The bilateral amygdala region of interest was defined using anatomical guidelines specified by the AAL templates (Tzourio-Mazoyer et al., 2002) implemented in WFU Pick Atlas (Version 2.4; Maldjian, Laurienti, Burdette, & Kraft, 2003). SPM5 was used to test for the interaction of Temperament X Expectation within the amygdala search region. To control for Type I error due to multiple comparisons, we used a cluster-based threshold. Based on simulations performed with AlphaSim (http://afni.nimh.nih.gov/pub/dist/doc/manual/AlphaSim.pdf), a voxel p-value of .05 and a cluster size of 11 provided a family-wise corrected p < .05 for each amygdala.

Whole-brain analysis

To determine whether there were temperament group differences in expectancy effects on BOLD response in other areas of the brain, we performed an exploratory whole-brain analysis. As with the amygdala analysis, SPM5 was used to test for the interaction of Temperament X Expectation. For the whole-brain analysis, a voxel p value < .005 and a contiguous cluster size of 23 voxels provided a family-wise corrected p < .05.

Percent signal change

For significant clusters in both the amygdala and whole-brain analysis, percent signal change values were extracted for each subject using MarsBar (Brett, Anton, Valabregue, & Poline, 2002). SPSS (Version 17; SPSS, Inc; Chicago, IL) was used to perform post-hoc analyses (ANOVA and t-tests) and to confirm the SPM findings.

Results

Amygdala results

To determine if the expectancy effects differed between temperament groups, we compared amygdala BOLD signal during viewing of fear faces in persons with an inhibited or an uninhibited temperament who either were or were not expecting to see fear faces. The effect of expectation on amygdala BOLD signal was different for the two temperament groups in both the left and right amygdalae (group x condition interaction, p < .05 corrected; see Fig. 2 and Supplemental Fig. 1). Among subjects with an inhibited temperament who expected to see fear faces, BOLD signal in the left amygdala was significant larger (p < .05) compared to those who were not expecting to see fear faces, but was not significantly different in the right amygdala. By contrast, subjects with an uninhibited temperament who were expecting to see fear faces had smaller amygdalar BOLD signal (p < .05) than uninhibited subjects who were not expecting to see fear faces, in both the left and right amygdalae. There were no main effects of temperament or expectation.

Fig. 2
figure 2

Amygdala BOLD response is modulated by both temperament and expectation. a For the interaction of Temperament X Expectation, there were significant clusters (p < .05, corrected) in both the left and right amygdala. Activation maps show t-values, threshold at p < .05 with cluster size > 11 voxels and masked to show the amygdala region of interest. The color bar represents t values. b, c Bar graphs show percent signal change values for the left (b) and right (c) amygdala clusters. Error bars represent standard error of the mean. Post-hoc analysis of variance confirmed the interaction for both the left (p = .003) and right (p = .02) amygdala. Asterisks indicate significant differences (p < .05) in post-hoc t tests within each temperament group

Whole-brain results

Expectancy effects also differed by temperament group in other brain regions, including dorsal anterior cingulate cortex, dorsomedial prefrontal cortex, inferior frontal gyrus, middle temporal gyrus, and cerebellum (Table 2). For the test of temperament by expectancy, the largest cluster of significant voxels was mainly in an area of the dorsal anterior cingulate cortex (dACC, p < .05 corrected, Fig. 3) which included pregenual ACC, supragenual ACC, and also dorsomedial prefrontal cortex. Unlike the pattern in the amygdala, among inhibited subjects BOLD response in the dACC was significantly smaller (p < .05) for those expecting to see fear faces compared to those who were not. Among uninhibited subjects, however, dACC BOLD signal in the Expected group was significantly greater (p < .05) than in the Not Expected group. The interaction of temperament by expectancy in the inferior frontal gyrus and middle temporal gyrus followed the same pattern. In the cerebellum, the findings were similar to those found in the amygdala, with larger BOLD signals among inhibited subjects, in the Expected group, and among uninhibited subjects, in the Not Expected group. There were no main effects of temperament or expectation for any of these brain regions.

Table 2 Brain regions that differed by temperament group and expectation
Fig. 3
figure 3

Dorsal anterior cingulate cortex BOLD response is modulated by temperament group and expectation. For the interaction of Temperament X Expectation, there were significant clusters (p < .05, corrected) in the dorsal anterior cingulate cortex. Activation maps show t values, threshold at p < .005 with cluster size > 23 voxels. The color bar represents t values. Bar graphs show percent signal change values and error bars represent standard error of the mean. Post-hoc analysis of variance confirmed the interaction (p < .001). Asterisks indicate significant differences (p < .05) in post-hoc t tests within each temperament group

Discussion

The level of expectation for viewing fear faces modulated amygdala and dorsal anterior cingulate cortex (dACC) activation, but the direction of the effects differed in the two temperament groups. In the inhibited temperament group, the amygdalar BOLD response was greater when the subjects expected to see fear faces. This finding provides initial evidence that expectancies critically modulate brain response in people with inhibited temperament by increasing amygdala response to aversive stimuli. We propose that for persons with inhibited temperament, knowing that an aversive event is imminent may increase vigilance for that event, resulting in an enhanced ability to detect potentially threatening stimuli in the environment. Previous studies support the view that emotional context can modulate physiological response, with startle responses augmented when the emotional valence of the stimulus matches the emotional context, but reduced when they do not match (Lang, Bradley, & Cuthbert, 1990). In contrast, among people with uninhibited temperament, amygdala activation was stronger when the fear faces were not expected, consistent with prior reports of increased amygdala activation to novel faces (Schwartz et al., 2003b), unexpected images (Bermpohl et al., 2006; Onoda et al., 2008), or unexpected sounds (Koelsch, Fritz, & Schlaug, 2008).

This study also provides evidence for the dACC as a brain structure that may mediate differences in inhibited temperament. In inhibited subjects, dACC BOLD signal decreased upon exposure to fear faces, whereas for uninhibited subjects, BOLD signal in this region increased when viewing the same stimuli. The dACC is a brain region implicated in cognitive, inhibitory control over emotional processes (Phillips, Drevets, Rauch, & Lane, 2003). In typical subjects, dACC is engaged by a variety of tasks including anticipation of viewing aversive images (Herwig, Abler, Walter, & Erk, 2007; Nitschke, Sarinopoulos, Mackiewicz, Schaefer, & Davidson, 2006; Onoda et al., 2008), anticipation of pain (Straube, Schmidt, Weiss, Mentzel, & Miltner, 2009), anticipation of performing a cognitive task (Murtha, Chertkow, Beauregard, Dixon, & Evans, 1996), aversive conditioning (Buchel, Morris, Dolan, & Friston, 1998; Buchel, Dolan, Armony, & Friston, 1999), and viewing expressions of fear (Milad et al., 2007). Hypoactivity in the dACC is associated with fearful temperament in children (Perlman & Pelphrey, in press), social anxiety disorder (Lorberbaum et al., 2004), and generalized anxiety disorder (Palm, Elliott, McKie, Deakin, & Anderson, 2010). Increased BOLD signal in the dACC is often inversely correlated with BOLD response in the amygdala (Etkin, Egner, Peraza, Kandel, & Hirsch, 2006; Hariri, Mattay, Tessitore, Fera, & Weinberger, 2003; Sarinopoulos et al., 2010). This inverse relationship between the amygdala and dACC is consistent with anatomical projections between the two regions (Bracht, 2009; Carmichael & Price, 1995; Ghashghaei, Hilgetag, & Barbas, 2007; Morecraft et al., 2007; Porrino, Crane, & Goldmanrakic, 1981; Stein et al., 2007), however, see Beckmann, Johansen-Berg, and Rushworth (2009). The strength of the dACC – amygdala relationship is reduced in persons with trait anxiety (Kienast et al., 2008) and high neuroticism (Cremers et al., 2010). We propose that weak inhibitory control by the dACC may contribute to the increased amygdala activation seen in persons with inhibited temperament. Future studies should study the neural substrates of emotion regulation in inhibited temperament.

These data suggest that expectations impact BOLD signal in the amygdala and dACC when viewing fear faces, and that the effect of expectation is moderated by temperament. However, several critical questions remain. First, we do not know whether the patterns in BOLD response reflect preparatory states or states created by the viewing of the fear faces. Other studies have successfully used a slow event-related design to separately measure BOLD response during anticipation and during viewing (Sarinopoulos et al., 2010). Future studies should measure BOLD response during both a preparatory period and during picture viewing to tease apart the underlying process. Second, while the comparison of two extreme groups provides important information about temperament differences, this approach cannot provide evidence about which of the extreme groups has a dysfunctional response. Findings from other expectancy studies with unselected samples have found a similar pattern to our uninhibited group, suggesting that responses in the inhibited group are dysfunctional, but future studies should include an average temperament group for direct comparison. Third, because neuroimaging analyses are based on contrasts (fear faces minus fixation) it is not possible to determine which part of the contrast is driving the outcome measure. For example, a lower value of percent signal change could represent either small BOLD signal to both fixation and fear face conditions or large BOLD signal to both conditions. Neuroimaging researchers typically assume that response to the fixation cross is similar across groups and conditions, but that is unlikely to be the case. It will be important to develop new experimental designs that test the effects of different baseline conditions in order to better characterize group differences in brain responses.

This study had several limitations. Although the overall study sample size was moderate, sample sizes within each of the four groups were more modest. Also, while we assume that subjects in the baseline condition were expecting to see neutral faces, we did not explicitly set this expectation, leaving open the possibility for other interpretations. However, our findings in the uninhibited group are consistent with expectancy studies directly comparing aversive to neutral images (Bermpohl et al., 2006; Onoda et al., 2008; Sarinopoulos et al., 2010).

A seminal feature of both an inhibited temperament and social anxiety is excessive worry over future social events. Here, the prior expectation of viewing aversive social stimuli increased BOLD responses in the amygdalae among inhibited subjects only, and decreased responses in the dorsal anterior cingulate cortex. Dysfunction in the amygdala-dACC circuit may be one mechanism by which an inhibited temperament confers risk for anxiety disorders.