Do emotion-induced blindness and the attentional blink share underlying mechanisms? An event-related potential study of emotionally-arousing words

MacLeod, Jeffrey; Stewart, Brandie M.; Newman, Aaron J.; Arnell, Karen M.

doi:10.3758/s13415-017-0499-7

Do emotion-induced blindness and the attentional blink share underlying mechanisms? An event-related potential study of emotionally-arousing words

Published: 06 March 2017

Volume 17, pages 592–611, (2017)
Cite this article

Download PDF

Cognitive, Affective, & Behavioral Neuroscience Aims and scope Submit manuscript

Do emotion-induced blindness and the attentional blink share underlying mechanisms? An event-related potential study of emotionally-arousing words

Download PDF

Jeffrey MacLeod¹,
Brandie M. Stewart¹,
Aaron J. Newman¹ &
…
Karen M. Arnell²

4059 Accesses
22 Citations
1 Altmetric
Explore all metrics

Abstract

When two targets are presented within approximately 500 ms of each other in the context of rapid serial visual presentation (RSVP), participants’ ability to report the second target is reduced compared to when the targets are presented further apart in time. This phenomenon is known as the attentional blink (AB). The AB is increased in magnitude when the first target is emotionally arousing. Emotionally arousing stimuli can also capture attention and create an AB-like effect even when these stimuli are presented as to-be-ignored distractor items in a single-target RSVP task. This phenomenon is known as emotion-induced blindness (EIB). The phenomenological similarity in the behavioral results associated with the AB with an emotional T1 and EIB suggest that these effects may result from similar underlying mechanisms – a hypothesis that we tested using event-related electrical brain potentials (ERPs). Behavioral results replicated those reported previously, demonstrating an enhanced AB following an emotionally arousing target and a clear EIB effect. In both paradigms highly arousing taboo/sexual words resulted in an increased early posterior negativity (EPN) component that has been suggested to represent early semantic activation and selection for further processing in working memory. In both paradigms taboo/sexual words also produced an increased late positive potential (LPP) component that has been suggested to represent consolidation of a stimulus in working memory. Therefore, ERP results provide evidence that the EIB and emotion-enhanced AB effects share a common underlying mechanism.

Emotion-induced blindness reflects competition at early and late processing stages: An ERP study

Article 05 June 2014

Briana L. Kennedy, Jennifer Rawding, … James E. Hoffman

More than a feeling: The emotional attentional blink relies on non-emotional “pop out,” but is weak compared to the attentional blink

Article 14 March 2023

Lindsay A. Santacroce, Apurva L. Swami & Benjamin J. Tamber-Rosenau

Dissociating different temporal stages of emotional word processing by feature-based attention

Article Open access 06 October 2023

Sebastian Schindler, Ria Vormbrock, … Thomas Straube

Introduction

Humans receive a variety of sensory inputs at any one time, and typically do not have the resources to process all inputs with equally. Attention is used to select the most important inputs for further processing and awareness. If attentional selection exists to prioritize the processing of important sensory inputs, stimuli that are relevant to the welfare of an individual, such as threat or sexual stimuli, should be efficiently selected for further processing by the attentional system. Indeed, human behavioral research using a variety of cognitive paradigms and participant groups has shown that emotionally charged stimuli are processed preferentially by the attentional system (e.g., Aquino & Arnell, 2007; Frischen, Eastwood, & Smilek, 2008; MacKay, Shafto, Taylor, Marian, Abrams, & Dyer, 2004; Vogt, De Houwer, Koster, Van Damme, & Crombez, 2008). An examination of attention to emotional words in rapid serial visual presentation (RSVP) is the focus of the current work.

In RSVP, stimuli are presented one-at-a-time at a central location with approximately 100 msec separating each item. This method of presentation causes each item to mask the previously presented item. Participants are typically able to identify a single target within an RSVP stream with a high level of accuracy. However, if participants are asked to identify two targets within a single RSVP stream, performance on the second target (T2) is detected with reduced accuracy when it appears less than 500 ms after the first target (T1; Raymond, Shapiro, & Arnell, 1992). This phenomenon is called the attentional blink (AB; Raymond et al., 1992).

Emotionally arousing stimuli have consistently been found to modulate the AB. When emotionally arousing words are presented in the T1 position, they result in a larger and more prolonged AB compared to non-arousing T1 words (Mathewson, Arnell, & Mansfield, 2008). Emotionally arousing words presented as T2 are less susceptible to the AB than non-arousing T2 words (Anderson, 2005; Keil & Ihssen, 2004; Milders et al., 2006). However, emotional T2 words are more susceptible to the AB when preceded by an emotional T1 word (Schwabe & Wolf, 2010; Schwabe et al., 2011). Together these results suggest that emotional stimuli receive additional attentional resources at the expense of neutral stimuli in the AB paradigm.

Emotionally arousing stimuli have also been presented as to-be-ignored distractor items in RSVP tasks (e.g., Most, Chun, Widders, & Zald, 2005; Arnell, Killman, & Fijavz, 2007). In these tasks, the emotional distractor appears prior to the target item, just as T1 would precede T2 in a typical AB task – but unlike in the AB task, these distractor items are not relevant to any task the participant is instructed to perform (see Fig. 1b for a depiction). In this emotional-distractor paradigm, emotionally arousing distractors have been shown to capture attention at the expense of accuracy for closely trailing target items; similar to T1 capturing attention at the expense of T2 in the AB paradigm (Arnell et al., 2007; Mathewson et al., 2008; Most et al., 2005; Most, Smith, Cooter, Levy, & Zald, 2007). This effect has been termed emotion-induced blindness (EIB; Most et al., 2005) or the emotional attentional blink (McHugo, Olatunji, & Zald, 2013). For example, Arnell and colleagues (2007) showed that relative to emotionally-neutral words, a sexual/taboo distractor word (e.g., orgasm, bitch) captured attention and resulted in an EIB for neutral word targets presented soon after the emotional word. Higher arousal ratings and greater surprise recognition memory for the emotional distractor word were found to predict the accuracy for subsequent targets presented at short lags, and memory fully mediated the relationship between arousal rating and target accuracy. Arnell et al. (2007) concluded that arousing distractor words were encoded into memory at the expense of targets presented at short lags following the distractor.

The AB and EIB: Similar mechanisms?

Mathewson et al. (2008) showed that similar patterns of behavioral data result from AB tasks with emotionally arousing T1 words, and EIB tasks with emotionally arousing distractors. Both with the AB and EIB, higher arousal ratings and greater surprise recognition memory for the emotional words were found to predict reduced accuracy for subsequent targets presented at short lags, and memory fully mediated the relationship between arousal rating and target accuracy. Mathewson et al. (2008) used the same words in the two tasks and used a correlational analysis to show that the words that reduced T2 accuracy when presented as T1 in an AB task were highly similar to the words that reduced target accuracy when presented as to-be-ignored distractor words in the EIB task. As discussed by Mathewson and colleagues (2008), these results suggest that the same mechanism may underlie the effect of emotionally arousing words in both tasks.

An important distinction between the standard emotional AB and EIB is the task-relevance of the emotional word. In both tasks, the emotional nature of the word is task-irrelevant. However, the word itself is task-relevant in the emotional AB and task-irrelevant in the EIB task. Although the similarities in the behavioral results presented above suggest that the same mechanisms may underlie the emotional AB and EIB, further investigation of this issue is warranted given the differences between these two tasks. The goal of the current experiments was to compare the AB and EIB effects using event-related brain potentials (ERPs) to determine whether common neural patterns underlie these two effects.

Electrophysiological investigations of the AB

The Central Interference Theory proposes two processing stages for detecting and remembering stimuli presented in RSVP (e.g., Chun & Potter, 1995; Jolicoeur, 1999; Jolicoeur & Dell’Acqua, 1998). According to this theory, there is an initial high-capacity processing stage in which perceptual and semantic representations of an object are formed. In this initial stage, features of stimuli, including conceptual representations relevant to target detection are processed. This initial stage is followed by a limited capacity serial stage in which selected stimuli receive sustained attention and are consolidated into working memory (Potter, 1993; Potter & Lombardi, 1990). The AB has been proposed to occur when T2 is presented while T1 is still being consolidated into working memory. As such, T2 receives stage 1 processing but not stage 2 consolidation into WM.

Electrophysiological investigations of brain activity during AB tasks have provided support for these two stages in the AB. Examinations of ERP components following the presentation of T2 stimuli in a typical AB task have demonstrated that N1/P1 components – reflecting earlier perceptual analysis – appear intact during the AB (Vogel, Luck, & Shapiro, 1998), as does the N400 component reflecting semantic analysis (Luck, Vogel, & Shapiro, 1996; but see Batterink, Karns, Yamada, & Neville, 2010). However, both the N2 and P3 components are present when T2 can be consciously accessed, but are suppressed during the AB (Kranczioch, Debener, Maye, & Engel, 2007; Sergent, Baillet, & Dehaene, 2005; Vogel et al., 1998; Vogel & Luck, 2002). The N2 is thought to index a stage responsible for selecting a relevant object among distractors (Kennedy, Rawding, Most, & Hoffman, 2014; Woodman, Arita, & Luck, 2009), and P3 is thought to index the consolidation of the object in working memory (Donchin, 1981; Kranczioch et al., 2007; Sergent et al., 2005; Vogel et al., 1998; Vogel & Luck, 2002). Additionally, there is evidence that a tradeoff exists between T1 and T2 processing (Kranczioch et al., 2007; Sergent et al., 2005; Shapiro, Schmitz, Martens, Hommel, & Schnitzler, 2006) as there is a positive correlation between the amplitude of the T1 M300 component (MEG equivalent of the P3) and the magnitude of the AB (Shapiro et al., 2006), and a negative correlation between P3 amplitudes of the first and second RSVP targets (Kranczioch et al., 2007). Indeed, it has been demonstrated that the requirement to attend to T1 primarily affects the N2 and P3 ERP components that are involved in conscious access to T2 (Sergent et al., 2005). Therefore, the N2 and P3 components have been suggested to index the capacity-limited stage in the AB.

Event-related brain potentials following emotionally arousing stimuli

Two ERP components are typically reported to differentiate emotionally arousing stimuli from neutral stimuli (see Schupp, Flaisch, Stockburger, & Junghöfer, 2006, for a review), and interestingly these appear to be the same components whose modulation has been shown to predict the AB as discussed above. The first is the early posterior negativity (EPN; e.g., Junghöfer, Bradley, Elbert, & Lang, 2001; Schacht & Sommer, 2009a). The EPN is a negativity over temporo-occipital sites that peaks at approximately 200–300 ms for picture stimuli (e.g., Junghöfer et al., 2001; Schupp et al., 2007), and at approximately 450 ms for word stimuli (Schacht & Sommer, 2009a). The EPN has been observed to be larger for more emotionally arousing stimuli, as compared to less arousing stimuli of a similar valence (Junghöfer et al., 2001; Schupp, Junghöfer, Weike, & Hamm, 2004). The EPN reliably occurs in response to emotionally arousing words, even when they are presented while participants are engaged in an interference task (Kissler et al., 2009). The EPN has been suggested to index an automatic allocation of visual attention to a stimulus of motivational significance such as stimuli that are emotionally arousing or task relevant so these stimuli can be selected for enhanced encoding (Kissler et al., 2009; Schupp et al., 2006). The timing, scalp distribution, and eliciting conditions of the EPN suggest that it may be the same component referred to outside of the emotion literature as N2; indeed, Schupp et al. (2003a, b) defined the EPN as a modulation of the N2. For the purpose of the current paper, we adopt the hypothesis that the N2 and EPN are two labels for the same component, reflecting the same neurocognitive process, and thus use these labels interchangeably.

The second ERP component that is modulated by emotional stimuli is the late positive potential (LPP). The LPP is a positive increase in amplitude that occurs at centro-parietal areas beginning around 300-540 ms and lasting for several hundred milliseconds (Kissler et al., 2009; Schupp et al., 2006). The LPP is considered part of the P3 family, which includes components that reflect the encoding into working memory of stimuli that are task relevant or otherwise capture attention (e.g., Schupp et. al., 2006). The LPP has been variously referred to as the P3, P3b, LPP, and late positive complex in different studies (Kissler et al., 2009; Schupp et al., 2006). The timing and length of the LPP are task and stimulus-dependent (e.g., Schacht & Sommer, 2009a, 2009b; Schupp, Ohman, Junghöfer, et al., 2004); the LPP tends of begin around 540 ms for emotionally arousing word stimuli (Schacht & Sommer, 2009a). Similar to the EPN, the LPP is enhanced in response to highly emotionally arousing stimuli, such as erotica or pictures of mutilation, as compared to stimuli of the same valence but lower arousal (Schupp et al., 2000, 2004, 2006).

ERPs following emotion words in the AB and EIB

Given the ERP patterns associated with processing emotionally arousing stimuli, it may be expected that emotionally arousing T1 words presented during an AB task would result in enhancement of both the EPN and LPP. Also, given the tradeoff between T1 and T2 stage 2 processing suggested to exist for short-lag AB trials (as described above, Kranczioch et al., 2007; Sergent et al., 2005; Shapiro et al., 2006), an enhancement of the EPN and LPP components following a T1 stimulus should result in an increase in AB magnitude. Behavioral data demonstrating an increase in AB magnitude following emotionally arousing T1 stimuli are consistent with this prediction, but no research to date has used ERP techniques to examine these predictions in the AB paradigm.

A single study (Kennedy, Rawding, Most, & Hoffman, 2014) collected ERPs during an EIB task in order to examine the mechanisms underlying EIB. In this study, emotionally arousing distractor images or emotionally neutral distractor images were presented prior to single target items within an RSVP stream of filler images. Negative emotional distractor images elicited enhanced N2 (EPN) and P3b (LPP) components relative to emotionally neutral images. Additionally, the amplitude of these components following the distractor stimulus was inversely related to their amplitude following the target stimulus. Finally, the P3b component associated with observing the emotional distractor image was observed to be larger for error trials in which the subsequent target was missed, than for correct trials. This pattern provides electrophysiological evidence of a distractor-target processing tradeoff similar to the T1-T2 processing tradeoff observed in the context of the AB (Kranczioch et al., 2007; Sergent et al., 2005; Shapiro et al., 2006), suggesting that the AB and EIB have similar underlying mechanisms in that they are both a result of central interference in a limited capacity system.

The current study

The current study asked participants to complete either an AB task with emotionally arousing T1 words (Experiment 1) or an EIB task with an emotionally arousing distractor words (Experiment 2). ERPs time-locked to T1 or the emotional distractor words were collected as participants completed these tasks. To our knowledge, no research has used electrophysiological recording to examine the impact of emotionally arousing T1 words on the in the AB paradigm. Additionally, no research has used ERPs to examine the brain activity during both AB and EIB tasks with closely matched stimuli and procedures in order to allow a direct comparison of the mechanisms underlying these two tasks.

The behavioral methodology in the current study replicates that of Mathewson et al. (2008). For each of the RSVP tasks, words from six emotion categories were used as T1 or the distractor words (sexual/taboo, positive, negative (sadness), threat, anxiety, neutral). The T1-T2 (AB task) and distractor-target (EIB task) lags were either three items or eight items. Following the RSVP tasks, participants completed an unexpected recognition memory test and provided ratings of valence and arousal for each of the emotion words.

As described above, previous research has observed quite similar patterns of behavioral data resulting from the two tasks used in the current study (Mathewson et al., 2008). It was expected that we would replicate the behavioral results of Mathewson et al. (2008) in this study. Given the similarities in behavioral results resulting from these tasks, and the suggestion that both EIB and the AB are a result of central interference in a limited capacity system (Kennedy et al., 2014; Kranczioch et al., 2007; Sergent et al., 2005; Shapiro et al., 2006), we expected that emotionally arousing T1 words in the AB task (Experiment 1) would evoke ERP effects similar to those resulting from emotionally arousing distractor words in the EIB task (Experiment 2). More specifically, based on the results of Kennedy et al. (2014), we expected that words from the sexual/taboo category would evoke the largest EPN and LPP components in both tasks, and that these components would demonstrate a tradeoff with T2/target accuracy. It was also expected that, in both tasks, amplitude of the EPN and LPP components resulting from T1/emotional distractors would be positively correlated with word arousal ratings and post-task memory scores, and negatively correlated with T2/target accuracy at lag 3, but not lag 8.

Experiment 1

Method

Participants

Thirty-one Brock University undergraduate students (22 female) with a mean age of 19.5 years participated in this study for course credit or a small monetary payment. They were tested individually in a single session lasting about 4 h. All reported normal or corrected-to-normal vision, English as a first language, and no history of neurological problems. Both experiments reported here received clearance from the Brock University Research Ethics Board and were run in accordance with the approved protocol that conforms to the principles of the Declaration of Helsinki, including obtaining written informed consent from all participants. Two participants were removed from all analyses for their failure to show any clear ERP components for the RSVP task.

Design and stimuli

The AB task used a 6 (emotion condition for T1) X 2 (T1-T2 lag) within-participants design. In each RSVP stream there were two targets. On each trial T1 was a word from one of six emotion categories: positive (e.g., fun, happy, winner), negative words that were sadness-related (e.g., weep, dreary, gloom), threat-related (e.g., scream, cancer, stabbed), words that were either sexual and/or taboo (e.g., fuck, pussy, shit), anxiety-related (e.g., worry, humiliated, disappoint), or emotionally neutral (e.g., jacket, vote, chew). There were 26 words in each emotion category. The T1 emotional word lists were the same as those used by Arnell et al. (2007) and Mathewson et al. (2008) and were adapted originally from stimuli used by Anderson (2005) and McKenna and Sharma (1995). The T1 word was always presented in red, and was presented in capital letters on half of the trials and lowercase letters on the other half of the trials. Each T1 word appeared in uppercase once at each lag, and in lowercase once at each lag. T2 was one of ten words that spelled a color name (blue, green, yellow, orange, white, silver, pink, purple, brown, black) but was always presented in black font, in capitals. Distractor stimuli for the RSVP task were created a priori to be 60 neutral valence and low arousal words from four to seven letters in length. For presentation, distractor words were chosen randomly without replacement for each trial and presented in black, capital letters. All words were presented in 18-point bold Courier New font. The letters subtended approximately 1.4° of visual angle in height and 3.6°–7.2° in width at an unfixed binocular viewing distance of approximately 40 cm.

Procedure

The experiment consisted of three parts: (1) the RSVP task, (2) a surprise recognition memory task where participants were asked to check-off any T1 words that they recalled seeing in the RSVP task, and (3) a ratings task where participants rated the arousal and valence of each of the T1 words.

For the RSVP task, participants were instructed to report whether the red T1 word was presented in uppercase or lowercase letters, and the identity of the T2 color word. Participants were not told that some of the T1 words would be emotionally charged. They were shown the ten color words, and informed that T2 would always be from this set, and only these responses would be allowed. Approximately five practice RSVP trials preceded the experimental trials. See Fig. 1a for a graphical depiction of a trial.

Each trial began with the presentation of a black fixation cross in the center of the screen for 500 ms, followed by a 500-ms blank interval before the start of the RSVP stream. Eighteen words, including T1, T2, and 16 distractors, were presented on each trial using RSVP, where each stimulus is presented one-at-a-time in the same spatial location. Each word was presented in the center of a uniform gray screen for 117 ms with no inter-stimulus interval between words. T1 was presented in stream position 5 or 8, and each position was used equally often for each combination of emotion condition and lag. The identity of the T1 word was chosen randomly within each emotion condition with the constraint that each word was shown once in each block of 156 trials, and that each word was presented twice in the lag 3 condition and twice in the lag 8 condition. The identity of the T2 color word was chosen randomly by the computer with the constraint that each word was used once every 10 trials. T2 was presented either three or eight words after the T1 word, corresponding to 351 or 936 ms of separation. The levels of the emotion and lag factors varied randomly for each participant, with the constraint that each possible combination of the factors occurred twice every 24 trials. Each participant performed 624 trials in a single session.

One second after the end of each stream, a sentence appeared on the screen prompting participants to press one of two keys indicating whether the T1 was presented in upper or lower case letters. Immediately after their T1 response a second sentence appeared prompting them to press the labeled key matching the color identity of T2. Accuracy was stressed and responses were not speeded. Participants were asked to minimize their physical movements while viewing the stream, and to refrain from blinking until they saw the first sentence prompting their response after the stream. Two seconds after their button press the fixation cross for the next trial appeared. ERPs were recorded during the RSVP task and were time-locked to the onset of the T1 stimulus.

Within 3 min of completing the RSVP task, participants were given a surprise recognition memory test. A piece of paper contained a list of all 156 T1 emotion words plus three word foils from each of the six emotion categories that had not been presented during the RSVP task. All words were presented in alphabetical order. Participants were told that some of the words on the list were presented as red words in the RSVP streams, and that they should check off any words that they remember seeing from the RSVP task. Participants were allowed to go through the list in any order with no time constraints, and could check as many or as few items as they wanted. ERPs were not recorded during the memory test. The number of participants, out of 29, who checked off each word on the memory checklist was calculated separately for each of the target words and each of the 18 memory foil words to create a memory score for each word. These memory scores were then used to calculate the average memory score for each of the emotion conditions and in the correlations.^{Footnote 1}

Following the completion of the memory task, participants then received the ratings task. On each trial one of the 156 words that had been presented as T1 was presented at the center of the computer screen for one second. After one second the word remained on the screen, but the prompt “Valence?” was then added just below. The word and prompt stayed on the screen until the participant gave the word a valence rating. The prompt then changed to “Arousal?” and the prompt and the word remained on the screen until the participant gave the word an arousal rating. A 7-point Likert scale was used for both valence and arousal ratings. The valence scale was anchored by “unpleasant” for the 1 response and “pleasant” for the 7 response, with 4 being “neutral.” The arousal scale was anchored by “low” for the 1 response and “high” for the 7 response. Participants used the numbered keys from 1 to 7 to make their response. Participants were asked to make the valence and arousal ratings independently, and to try to use the whole scale. Participants were encouraged to take their time and provide accurate ratings based on their own personal views about the word. The 156 T1 words were each presented once in random order. ERPs were recorded during the ratings task and were time-locked to the onset of the emotion word on each trial. The Ratings task ERP results are not a focus of the current study and will not be discussed further. Mean valence ratings and mean arousal ratings were calculated for each word by averaging ratings for each word across participants. An absolute valence extremity score was also calculated for each word by subtracting the valence rating for each word from “4,” which is the midpoint on the valence scale. In this manner the absolute difference reflects how valent the stimulus was without regard for the direction, as one might expect highly positive and highly negative words to have greater impact.

Apparatus and ERP recordings

A desktop PC with 17-in. color monitor, running E-Prime (Psychology Software Tools, Pittsburgh, PA) was used to present stimuli and record behavioral responses. Neuroscan software running on a desktop PC was used to acquire and analyze electroencephalographic (EEG) data recordings from 64 sites (cap by Electrocap International) referenced to linked earlobes. Electro-ocular (EOG) recordings were taken by affixing electrodes to the outer canthi of each eye and the top and bottom of the orbit of both eyes. Signals were amplified with a band-pass of 0.15–30 Hz, and digitized at a rate of 500 Hz. ERPs were time-locked to the onset of T1. Epochs were created that began 200 ms prior to T1 presentation and ended 1000 ms after T1 presentation. Eye blink artifacts were corrected using an algorithm implemented in Neuroscan’s SCAN software. This algorithm creates a model of the subject-specific blink response by first identifying the maximum deviation in the VEOG channel across all the data (i.e., the largest blink artifact), and then defining as blinks all other events that exceed 10% of this maximum in the VEOG channel. Transmission coefficient of the blinks is then estimated based on the covariance of the averaged potentials of the ocular channel with the EEG channels, and this is then used to subtract the blink artifact form each channel, on each trial in which a blink was detected. If this correction appeared insufficient for a given trial based on visual inspection then the trial was removed by hand prior to averaging. Trials with incorrect T1 responses were also removed.

Results

Behavioral analyses

Mean differences in RSVP performance

Figure 2 shows the mean T2 accuracy (percent correct responses) separately for each T1 emotion condition as a function of the lag between T1 and T2. In both experiments, T2 accuracy was calculated for T1-correct trials only; however, the same data patterns were observed for all analyses when T2 accuracy was not made conditional on T1 accuracy. A repeated measures analysis of variance (ANOVA) was performed on T2 accuracy rates with T1 emotion condition and lag as factors. The ANOVA revealed a significant main effect of lag [F(1,28) = 56.25, p < .001, partial η² = 0.67] where T2 accuracy was reduced at lag 3 as compared to lag 8. There was also a significant main effect of emotion condition [F(5,140) = 17.79, p < .001, partial η² = 0.39]. Importantly, there was a significant interaction between lag and T1 emotion condition [F(5,140) = 13.97, p < .001, partial η² = 0.33], which resulted from the particularly poor T2 accuracy at lag 3 when taboo/sexual words were presented as T1.

Based on paired samples t-tests comparing T2 accuracy at lag 3 versus lag 8, a reliable AB was observed for all T1 emotion conditions [all ps < .001]. Simple-effects analyses using one-way ANOVAs showed no significant difference in T2 accuracy rates as a function of T1 emotion conditions at lag 8 [F(5,140) = 1.32, p > .26; partial η² = 0.05]. However, a significant difference in T2 accuracy was found at lag 3 [F(5,140) = 22.21, p < .001; partial η² = 0.44]. Pairwise comparisons with Bonferroni correction showed that at lag 3 T2 accuracy in the taboo/sexual condition was significantly lower than T2 accuracy in each of the other emotion conditions [all ps < .001]. There were no differences in T2 accuracy between any other conditions [all ps > .91].

T1 accuracy was 92.3% (S.E. = 1.2) overall, and did not vary with lag [F(1,28) = 1.63, p > .21, partial η² = 0.06], and there was no significant interaction between lag and emotion condition, [F < 1,]. However, T1 accuracy did vary slightly, but significantly, across emotion conditions [F(5,140) = 3.13, p = .01, partial η² = 0.10]. T1 accuracy was between 92% and 93% for each of the emotion conditions except for the taboo/sexual condition where it was 90.9%. Bonferroni-corrected pairwise comparisons showed significant differences in T1 accuracy only between the taboo/sexual condition and the neutral condition, and the taboo/sexual condition and the anxiety condition [ps < .05].

Memory for T1 words by emotion condition

Table 1 shows the average memory score for the words in each emotion condition. A one-way ANOVA showed a significant effect of emotion condition on memory for the target words [F(5,148) = 54.04, p < .001]. Bonferroni-corrected pairwise comparisons showed that memory for taboo/sexual words was significantly higher than memory for words in each of the other emotion conditions [all ps < .001]. Memory for threat words was also significantly higher than memory for neutral or sad words (ps < .05).^{Footnote 2}

Table 1 Mean number of participants in Experiment 1 reporting the T1 words as remembered on the surprise recognition memory test as a function of T1 emotion category

Full size table

Relationships amongst behavioral measures

The mean accuracy of T2 at each lag, collapsed across participants, was calculated separately for each T1 emotion word. Correlational analyses were conducted to examine the relationships between arousal ratings, valence ratings, valence extremity scores, word memory scores, and T2 accuracy on trials with a specific emotion word presented as T1. For example, the mean arousal, valence, valence extremity, and memory scores for the word orgasm were examined with respect to T2 accuracy on short and long lag trials where orgasm was presented as T1 (see Table 2). Replicating the pattern found by Mathewson and colleagues (2008), T2 accuracy at lag 3, but not lag 8, was negatively related to arousal ratings and memory for T1, and memory for T1 and T1 arousal ratings were positively correlated. Therefore, T2 accuracy was lower at lag 3, when T1 was rated as highly arousing and well remembered. However, these relationships were not observed when T2 was presented at lag 8.

Table 2 Zero-order correlations between behavioral and ERP measures for emotional words in Experiment

Full size table

Mediation analysis

If highly arousing words are encoded into memory more often than less arousing words, and this encoding is at the expense of accuracy for T2s that were presented soon after T1, then the relationship between arousal and T2 accuracy at lag 3 should be mediated by memory for the words. A simultaneous regression with arousal rating and memory as predictors of T2 accuracy at lag 3 showed that while both arousal and memory were significant predictors of T2 accuracy when entered alone, only memory was a significant predictor of T2 accuracy at lag 3 when both arousal and memory were entered into the model [semipartial r = .01, p > .77 for arousal and semipartial r = -.52, p < .001 for memory]. These results support a fully mediated model (Baron & Kenny, 1986) where T1 arousal influences encoding of T1 into memory at the expense of accuracy for closely trailing T2s (see Fig. 3).

ERP analyses

Effects of T1 word type

Electrode Pz was used for all figures and analyses, as activation related to the LPP component was of particular interest, and the P3 family is often maximal in centro-parietal areas (e.g., Donchin, 1981; Schupp et al., 2006). In previous research, the early posterior negativity (EPN) component has been observed over lateralized temporo-occipital sites (Schupp et al., 2006). However, examination of topographical maps of the data in the present study, shown in Fig. 4, revealed that the EPN was present over a large area of the scalp, centered over centro-parietal electrodes. As a result, Pz was deemed a suitable site for measurement of both the EPN and LPP in the current study.

Only T1-correct trials were included in the averages. Figure 5a shows the grand average waveforms from the RSVP task separately for each emotion condition, time-locked to the presentation of the T1 emotion word. Figure 5b shows grand average difference waves from five emotion conditions once the average waves from neutral trials were subtracted out.^{Footnote 3}

The first goal was to look for potential amplitude differences in the EPN and LPP for different emotion conditions. However, accurate estimation of individual components for each participant is difficult with ERPs collected during RSVP due to the noise that results from each item in the stimulus stream producing ERPs that overlap in time. One solution is to examine the difference waves by subtracting out activation from the neutral condition – as shown in Figs. 4 and 5b. However, we were interested in statistically comparing ERPs to the neutral condition, which would not be possible with neutral waves subtracted out. Furthermore, we were also interested in looking for potential differences amongst emotion conditions outside the EPN and LPP windows. Therefore, each participant’s average post-T1 waveform for each of the emotion conditions was divided into 10-ms intervals, and each of the intervals from 0 to 1,000 ms post-T1 were used to compare the amplitude across emotion conditions in a series of t-tests. To account for experiment-wise alpha inflation, an alpha level of .005 was used as the significance cut-off and at least two consecutive 10-ms intervals were required to have a p-value less than the .005 level. Paired t-tests revealed no significant amplitude differences at any time points between waveforms from the neutral, threat, sadness, positive and anxiety conditions (each compared to each other).^{Footnote 4} In contrast to the null effects for other emotion categories, there were several differences between the waveform for the taboo/sexual condition and other conditions.

T-tests revealed a significantly increased negativity for taboo/sexual words relative to all other emotion categories from about 400 to 450 ms (see Fig. 5b). This was confirmed with an analysis of items means,^{Footnote 5} p < .01. In the subsequent discussion, we will refer to this effect as the EPN. The amplitude of the taboo/sexual waveform was also significantly more positive than the neutral waveform from 520 to 600 ms post-T1 (see Fig. 5b). This was also confirmed with an analysis of item means, p < .01. In subsequent discussion, we will refer to this effect as the LPP. Although both the EPN and LPP appear later than has been observed in ERP studies using picture stimuli (see timing suggested by Schupp et al., 2006), this timing is expected for word stimuli, and is similar to that observed for word stimuli in previous research (Schacht & Sommer, 2009b). No other significant differences in amplitude between the taboo/sexual waveform and any other waveforms were noted. Figure 5b also shows an enhanced positivity for sexual/taboo words just prior to 800 ms. However, this difference was not significant with the alpha correction used to control Type I error rates. Given its late duration, this positivity likely represents modulation of T2 processing when T2 was presented at short lags. This is discussed below and in Experiment 2 where there is further analysis and discussion.

T1-locked ERPs were compared for trials where T2 was correct and trials where T2 was incorrect, separately for each lag (see Fig. 6).^{Footnote 6} At long lags there was expected to be no difference in T1 ERPs as a function of accurate T2 detection, as T1 processing would be over by the time T2 was presented. However, a tradeoff was expected between T1 resources and T2 performance at short lags, such that T1 EPN and LPP amplitudes were expected to be enhanced for T2-incorrect trials relative to T2-correct trials. As expected, paired sample-tests, corrected as above, showed no differences in T1-locked ERPs at any time point when comparing T2-correct and -incorrect trials at lag 8. However, at lag 3, the LPP from 500 to 600 ms post-T1 was significantly larger (more positive) on T2-incorrect trials than on T2-correct trials, suggesting that trials with a large LPP to T1 resulted in poorer T2 performance; i.e., the expected tradeoff. Interestingly, there were no significant amplitude differences for T2-correct and -incorrect short-lag trials during the T1 EPN time window.

At lag 3, T2-correct trials did show a significantly greater negativity than T2-incorrect trials from 650 to 760 ms post-T1, presumably reflecting the EPN that was present for T2 when it was detected, but not when missed due to an AB effect. Similarly, at lag 3 T2-correct trials showed a significantly greater positivity than T2-incorrect trials from 870 to 940 ms post-T1, presumably reflecting the LPP that was present for T2 on T2-correct trials, but attenuated on T2-incorrect trials.

Relationships between behavioral and ERP measures

The mean T1-EPN was estimated individually for each word by taking the mean area of the component with the largest negative peak between 350 and 500 ms post-T1 in the grand average waveform averaged across participants for each word. The mean T1-LPP was estimated individually for each word by taking the mean area of the component with the largest positive peak between 500 and 600 ms post-T1 in the grand average waveform averaged across participants for each word (see above for justification for these time windows). Although mean area was used as a measure of component amplitude for all correlations, the same pattern of significant relationships was also observed when component amplitude was estimated using peak amplitude or when using the summed area of the component. Correlations between the EPN and LPP individually for each T1 word and behavioral measures for those words are provided in Table 2. It is noteworthy that the amplitude of both of the ERP components was significantly positively associated with memory and arousal ratings for T1 words, and negatively associated with T2 accuracy at lag 3 but not lag 8.^{Footnote 7}

Predicting T2 accuracy

In the above analyses, arousal ratings, memory performance, and LPP and EPN amplitudes each predicted T2 accuracy at lag 3, but not at lag 8. To see how much total variability in T2 accuracy could be explained, these predictors were entered as simultaneous predictors of lag 3 T2 accuracy in a multiple regression. Results showed that, as a group, these predictors explained almost half (45.5%) of the variability in T2 accuracy at lag 3 (R = 0.675, F(4,146) = 30.39, p < .001). Both memory performance (semipartial r = -.48, p < .001) and LPP amplitude (semipartial r = -.16, p = .009) contributed significant unique variability over and above that contributed by the other predictors. When the same regression was performed on T2 accuracy at lag 8, a non-significant 2.3% of the variability in T2 accuracy was explained (R = .153, F < 1, p = .47), and there were no significant unique predictors.

Discussion

The AB was equal in size for all emotion conditions except the taboo/sexual condition, where a larger AB was observed. These results replicate those of Mathewson et al. (2008) when using emotion words as T1s. Also replicating Mathewson et al. (2008), we found that memory was better for the taboo/sexual words than for other emotion words and enhanced memory for a T1 word was associated with lower accuracy when T2 was presented in the lag 3 position. We further replicated Mathewson et al.’s (2008) finding that higher word arousal ratings predicted enhanced memory for the word and lower accuracy for subsequent targets, and that the relationship between arousal and T2 accuracy was mediated by memory performance (Fig. 3). This pattern suggests that arousing T1s are better encoded into memory than other T1s, and that this increase in encoding reduces accuracy for subsequent T2s.

As expected, taboo/sexual T1s showed an enhanced EPN relative to all other emotion conditions. Taboo/sexual words were also the only word type to show an enhanced LPP when compared to emotionally neutral words. There was also a tradeoff between the amplitude of the LPP to T1 and the LPP to T2, where the T1 LPP was larger for incorrect T2 trials compared to correct T2 trials at short lags, and the LPP (and EPN) to T2 were greater on correct T2 short lag trials. Correlational analyses demonstrated that EPN and LPP amplitudes increased with arousal ratings, were associated with better memory for T1 words, and were negatively related to accuracy for T2s at short the lag, but not at long lags. This pattern suggests that: (1) there was increased activation for arousing T1s, (2) this increased activation enhanced their encoding into memory, and/or resulted from their encoding into memory, and (3) this enhanced encoding came at the expense of T2 encoding at short lags.

Experiment 1 is the first study to provide evidence that arousing T1 words in the AB achieve enhanced processing in stages associated with attentional selection of a target among distractors (N2/EPN; Kennedy et al., 2014; Woodman et al., 2009) and consolidation in working memory (LPP; Sergent et al., 2005; Vogel & Luck, 2002; Vogel et al., 1998). This is despite the fact that the semantics of the T1 words were irrelevant for the lowercase/uppercase font decision that was the T1 task. Furthermore, degree of enhancement of T1 word processing at these stages, operationalized as EPN and LPP amplitude associated with the word, was related to the magnitude of the AB that followed the T1 word and how well those T1 words were remembered.

Experiment 2

As described in the Introduction, arousing words have also been shown to capture participants’ attention and set off an EIB effect when they are presented as to-be-ignored distractors in an RSVP stream. Experiment 2 examined whether similar ERP results to those in Experiment 1 are found when emotion words are presented as to-be-ignored distractors but all other experimental factors remain the same.