The impact of skim reading and navigation when reading hyperlinks on the web

Gemma Fitzsimmons; Lewis T. Jayes; Mark J. Weal; Denis Drieghe

doi:10.1371/journal.pone.0239134

Abstract

It has been shown that readers spend a great deal of time skim reading on the Web and that this type of reading can affect lexical processing of words. Across two experiments, we utilised eye tracking methodology to explore how hyperlinks and navigating webpages affect reading behaviour. In Experiment 1, participants read static Webpages either for comprehension or whilst skim reading, while in Experiment 2, participants additionally read through a navigable Web environment. Embedded target words were either hyperlinks or not and were either high-frequency or low-frequency words. Results from Experiment 1 show that while readers lexically process both linked and unlinked words when reading for comprehension, readers only fully lexically process linked words when skim reading, as was evidenced by a frequency effect that was absent for the unlinked words. They did fully lexically process both linked and unlinked words when reading for comprehension. In Experiment 2, which allowed for navigating, readers only fully lexically processed linked words compared to unlinked words, regardless of whether they were skim reading or reading for comprehension. We suggest that readers engage in an efficient reading strategy where they attempt to minimise comprehension loss while maintaining a high reading speed. Readers use hyperlinks as markers to suggest important information and use them to navigate through the text in an efficient and effective way. The task of reading on the Web causes readers to lexically process words in a markedly different way from typical reading experiments.

Citation: Fitzsimmons G, Jayes LT, Weal MJ, Drieghe D (2020) The impact of skim reading and navigation when reading hyperlinks on the web. PLoS ONE 15(9): e0239134. https://doi.org/10.1371/journal.pone.0239134

Editor: Veronica Whitford, University of New Brunswick Fredericton, CANADA

Received: March 19, 2020; Accepted: August 31, 2020; Published: September 17, 2020

Copyright: © 2020 Fitzsimmons et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data underlying the results presented in the experiments in this manuscript are available from the UK Data Service. The DOI is: 10.5255/UKDA-SN-854153.

Funding: GF was funded by an EPSRC grant for the Doctoral Training Centre in Web Science: EP/G036926/1. This work formed a part of a PhD completed in the Web Science DTC. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

When we investigate the real-world task of reading online text (termed hypertext [1]), we need to take into consideration the way reading on the Web differs from the reading for comprehension task that is traditionally used in reading research. Typically, reading research uses trials that contain a single, stand-alone sentence (for a review, see [2]) to explore lexical processing. The reader is asked to read the sentence for comprehension and there will often be a comprehension question following the sentence. If the participant has a low accuracy for answering these usually rather easy questions, the experimenter knows that the participant was not fully engaged with the task. This process is of course different from everyday reading. Additionally, while many experiments have examined reading paragraphs, in the vast majority of reading experiments, participants only read a single line of text and as such, do not have to integrate information across multiple sentences. As useful as this experimental design is for exploring factors that affect lexical processing within a controlled setting, this is not a common reading behaviour that people engage in in everyday life. Reading on the Web is clearly a very different task. Primarily, there is so much information on the Web it is less likely that anyone will be able to read all available information for comprehension due to time constraints. Moreover, often not all text is equally important to the reader and/or their task. To make the task of reading on the Web more manageable the reader may skim read to try and gain as much important information as possible in the most efficient way. Skim reading is a commonplace strategy whereby readers employ some form of rapid, selective reading strategy, often omitting words [3–5]. Another difference with typical lab-based reading experiments is the presence of hyperlinks. Hyperlinks refer to words that enable users to navigate from one webpage to another, when clicked. hyperlinks can have a two-fold impact on readers. Firstly, hyperlinks are salient words in the text and can signal to the reader where important information may lie in the text. Secondly, hyperlinks serve as a tool for navigation between Webpages. A decision needs to be made about which hyperlinks to click on to navigate to other Webpages.

In this paper, we conducted two experiments exploring task differences between reading for comprehension versus skim reading as well as the impact of hyperlinks, by specifically looking at how these tasks and text elements affect lexical processing. Experiment 1 displayed Web pages, but the user could not click or navigate. In Experiment 2 a functional, clickable environment was utilised, where the impact of clicking and navigating on reading and comprehension of the text was investigated. The combination of these two experiments allows us to dissociate the unique impact of hyperlinks as a salient text item that potentially indicates important information from the impact caused by the hyperlink being a navigational tool as well. Unlike previous research, the current studies investigate the effect of navigation and skim reading on lexical access during reading. The use of eye tracking allows for a temporally sensitive measure of the degree of processing readers engage in during reading words on the Web, and how navigation and skim reading affect this, providing novel insight into the depth of processing of words during the real-world task of reading online text.

Skim reading

When reading outside of the laboratory, people may ‘skim’ through text and not fully process all aspects of the text that has been presented to them. Research literature suggests that reading on the Web is more likely to involve skim reading [3, 4]. Liu suggests that screen-based reading behaviour is characterised by ‘more time spent browsing and scanning, keyword spotting, one-time reading, non-linear reading, and reading more selectively, while less time is spent on in-depth reading, and concentrated reading’ [3, pp.700].

Previous research has directly compared reading for comprehension and skim reading. In an early experiment, Just and Carpenter explored skim reading and compared eye movements to when the participants were engaged in reading for comprehension [6, 7]. They found that skim readers were about two and a half times faster than normal readers. Furthermore, their eye movement analysis showed that the skim readers fixated on fewer words than normal readers. When examining gaze durations on words, Just and Carpenter also found them to be shorter for skim readers, who spent on average 100ms less time on each fixation (around one–third of the average fixation time during normal reading) [6]. However, even with this reduction in fixation times the skim readers still showed effects of word frequency (low frequency words had longer fixation times compared to high frequency words, [6–8]) and word length (longer words had longer fixation times compared to shorter words [9–12]) This was similar to those seen in normal readers, but the sizes of these two effects were much smaller. Clearly, eye movement behaviour is affected by the task of skim reading versus reading for comprehension.

There is a great deal of evidence suggesting that during skim reading some comprehension is lost [6, 13–17]. One of the causes for this loss in comprehension could be that readers can often solve comprehension problems by re-reading the text that has caused the issue. There is, however, very little re-reading when skim reading, perhaps due to the self-imposed time constraints that are caused by the nature of skim reading. However, loss in comprehension is not consistent across all of the text being read. There appears to be a difference between information regarded as important or unimportant. The important information does not receive the same loss of comprehension that is observed for the unimportant information [5, 15, 18]. To explain these findings, it has been suggested that the reader engages in an adaptive strategy in order to gain as much information from the text as possible, in a reduced time.

Clicking, navigating and decision making

One of the main differences between reading plain text and hypertext is the fact that hypertext is non-linear and has no strict route through the information. Conklin suggests that a reader could easily become ‘lost in hyperspace’ when trying to navigate through a website due to the mass interlinking of webpages [19]. McKnight, Dillon and Richardson suggested that the unknown scope of the hypertext document could lead to incorrect assumptions about the scope of the documents’ content and result in a poor reading strategy [20]. In linear text it is much easier to see the scope of the document and browse through the content. Dillon, Richardson and McKnight argue that if the user does not know how the information is organised, it makes it more difficult to find specific information [21]. In comparison, paper-based documents, such as books, tend to have a convention for how the information is organised, such as index pages and contents pages, which catalogue the location of topics and convey the overall organisation of the whole text.

It is not just the issue of getting lost in a hypertext environment that we need to consider; there is also the issue of the large amount of choices and decisions that need to be made. Elm and Woods suggest that users may be overwhelmed and disorientated by the sheer amount of choice offered by a complex, large network of information [22]. The users may not understand the structure of the system and what potentially exists in the hypertext document. McDonald and Stevenson argue that although a large linear text can also be confusing for a reader, there are typically a number of discourse cues such as page numbers, contents listings and headings that the reader can take advantage of [23]. Non-linear hypertext lacks a lot of these types of cues. The same text presented as a linear document may cause no issues to the reader, but in its hypertext format, it might lead to navigational problems where the reader is confused by the non-linear structure.

There is an on-going debate about whether in-text hyperlinks hinder reading. Carr suggests that hyperlinks within the text are a distraction and hinder comprehension of the text. He argues that having to evaluate hyperlinks and navigating a path through them is a demanding task that substantially increases readers' cognitive load and thereby weakens their ability to comprehend and retain what they are reading ([24], pp.126).

Carr’s argument is based on research investigating the cognitive load of hypertext on users [e.g. 25], which suggests that comprehension increased when participants read plain text compared to when they read hyperlinked text. However, their study is somewhat limited in being able to generalise to other forms of hypertext, including reading webpages. The text used by Miall and Dobson [25] was a piece of literary fiction that had been converted to hypertext and hyperlinks were added to it. The text had not originally been created to be displayed in a hypertext format, making the hyperlinked document quite artificial. This artificial hypertext document may be the reason for the increase in cognitive load, in turn making it difficult to generalise these results to reading on the Web. This being said, other research does corroborate with Carr’s suggestion that extra cognitive demands are associated with having to make decisions about whether to follow hyperlinks [24].

Some researchers have explored working memory and the concept of cognitive load and its impact on reading hypertext. DeStefano and LeFevre conducted a review of cognitive load in relation to reading hypertext [26]. They argued that the extra task demands of reading hypertext causes an increased cognitive load to the readers in comparison to linear text. Because the readers must make decisions about which hyperlinks to follow, additional cognitive demands are placed on working memory. Recently, Scharinger, Kammerer and Gerjets measured both the EEG and pupil size of readers engaging in a task that closely simulated hypertext reading and link selection [27]. They found evidence of increased load on executive functions when the reader had to perform hyperlink-like selection.

It is not just the decision of whether or not to click a hyperlink that could increase cognitive load. The reader’s decision to follow a hyperlink and explore different content could interrupt on-going comprehension processes. Comprehension involves the creation and development of situation models, which are complex mental representations that the reader instantiates in order to integrate statements from the text they are reading into their knowledge [28]. For example, Dee-Lucas and Larkin found that hyperlinks in text distract users by interrupting information processing [29]. While reading, users may stop to click on hyperlinks in the middle of text content, thus interrupting their cognitive processing and leaving the reader with a fragmented representation of the text content. Because of the nonlinear nature of hypertext, when a reader is reading text on one topic on a webpage, if they choose to click a hyperlink, it takes them to another webpage. This new webpage may contain content that is unrelated to the content they have just come from on the previous webpage. This could cause disruption to the reader’s development of a situation model and result in the readers’ comprehension of the text being reduced.

These suggestions of disruption caused by hyperlinks were questioned in a previous set of experiments [30]. Most importantly, in an environment resembling a Wikipedia page, we demonstrated that, at least when reading for comprehension, the use of blue hyperlinks does not have a negative influence on reading. This was shown by a lack of an effect of a word being a hyperlink compared to when it was not a hyperlink on eye movement behaviour (specifically, early fixation measures or skipping probability). However, one single effect did demonstrate that readers treat hyperlinked words differently to non-hyperlinked text; readers were more likely to re-read text when encountering a low frequency, hyperlinked word [30]. Specifically, when encountering a low-frequency word, we observed an increase in re-reading of the preceding text but more so when the word was also a hyperlink compared to when it was not. This finding suggests that hyperlinks highlight important information and suggest additional content which, for more difficult concepts, invites rereading of the preceding text. However, while interesting, the task used in this study was not entirely typical of reading on the Web, as readers read for comprehension only, and on static, experimentally manipulated Wikipedia pages. The experiments reported here aimed to build upon this finding by testing a more typical environment for reading on the Web. In Experiment 1, this is done through the introduction of the task of skim reading, to explore the impact of hyperlinks within this task on both eye movements and comprehension within a static Webpage environment. In Experiment 2, readers are also allowed to navigate through their environment, by clicking links, providing an environment matching typical reading on the Web.

Through the addition of these task elements, we have explored whether the impact of hyperlinks on lexical processing differed from our original findings when participants are engaged in a reading behavior more typical of Web browsing. Furthermore, by exploring skimming and navigation separately, we could dissociate the effects of these two manipulations on reading within a controlled environment. While Experiment 1 is not typical of reading on the Web, due to the lack of navigation, it does allow for a baseline to be set of how skim reading affects lexical processing of text within static Webpages. This allows us to draw comparisons with Experiment 2 to disentangle the unique effects of reading for comprehension and skim reading when navigating through a dynamic Web environment.

Our predictions regarding the degree of lexical processing are made on the basis that the frequency effect is traditionally considered a reliable indices of lexical processing. The frequency effect is a robust effect within psycholinguistic literature [6–8], and has previously been used to assess skim reading [4, 5, 29]. Furthermore, it has been taken as an indices of the depth of lexical processing during tasks such as visual searches of word lists (e.g. reduced depth of lexical processing compared to reading for comprehension as indicated by a lack of word frequency effect [31]) and proofreading (e.g. larger frequency effect when proofreading due to increased levels of lexical processing [32]). Following in this tradition, we investigated depth of lexical processing using the frequency effect. From previous research we predicted that readers would read faster when asked to skim read, but would have reduced comprehension [6]. We predicted that we would observe shorter fixation times and more word skipping in the skim reading condition. However, we also predicted that because the linked target words are salient, they might attract the attention of the reader in the skim reading condition resulting in less skipping of linked words. We also included a word frequency manipulation in this experiment in order to explore whether common lexical effects are present in hyperlinked text and to investigate if they are modulated by the word being hyperlinked. Our previous research [30] suggested that, if reading behaviour is unchanged by introducing skim reading and the ability to click links, we will observe a frequency effect, whereby low frequency words are less likely to be skipped and have longer fixation time. Whether the target word is linked or not, however, will modulate the effect in re-reading, such that low-frequency hyperlinks elicit more re-reading.

Experiment One

Method

Participants.

Thirty-two native English speakers (2 male, 30 female) with an average age of 20.00 years (range– 18–31) participated in exchange for course credits or payment (£9) and were members of the University of Southampton community. All had normal or corrected-to-normal vision and no known reading disabilities. None of the participants took part in Experiment 2. All samples reported in this paper are typical of eye movement and reading studies.

Apparatus.

Eye movements were measured with an SR-Research Eyelink 1000 eye tracker operating at 1000 Hz (1 sample every millisecond). Participants viewed the stimuli binocularly, but only the right eye was tracked. Words were presented in 14pt mono-spaced Courier font. The participant’s eye was 73 cm from the display; at this distance three characters equalled about 1° of visual angle.

Materials and design.

The stimuli in Experiment 1 consisted of forty edited Wikipedia articles (example stimuli available: https://goo.gl/JLvvMD) taken from Experiment Three of Fitzsimmons et al. [30]. One-hundred and sixty target words were embedded in sentences (one target word per sentence) and four sentences were inserted into each Wikipedia article. The text was created by taking existing Wikipedia articles on neutral topics and inserting four experimental sentences into the existing text. The experimental sentences were designed to be semantically consistent with the text already present, so as not to stand out from the existing text. The rest of the text on each screen was identical to the source material on Wikipedia, including additional words that were linked, for additional naturalness. This decision was made so that the articles were as close to a natural Web environment as possible, while gaining the additional control experimental sentences. The Wikipedia articles were ten to twelve lines long. The target words were nouns and the location of the target words were scattered across the sentences, but they were never on the start or end of a line. All these design decisions were made to align with the traditional eye movements and reading methodology. The target words within these articles were either displayed in blue or black to denote if the word was a hyperlink or not, respectively (see Fig 1).

Download:

Fig 1. Example Wikipedia stimulus with examples of high and low frequency words in linked and unlinked form.

Note. Wikipedia branding removed from example for copyright purposes–full version of stimuli can be seen here: https://goo.gl/JLvvMD.

https://doi.org/10.1371/journal.pone.0239134.g001

In total there were 8 conditions in a 2 (Task Type: Comprehension, Skimming) x 2 (Word Type: Linked, Unlinked) x 2 (Word Frequency: High, Low) within participants design. At a target word level, the target words within these articles were either displayed in blue or black to denote if the word was a hyperlink or not. There was also a word frequency manipulation where the frequency of the target word was either high or low frequency. The word frequencies were taken from the Hyperspace Analogue to Language (HAL) corpus [33]. The frequency norms were used to extract both high and low frequency words to create the experimental stimuli. The high frequency words had an average log transformed HAL frequency of 9.94 and the low frequency words had an average log transformed HAL frequency of 5.81. There was a significant difference between the high and low word frequency stimuli, t(159) = 29.66, p < .001. All target words were 4–7 characters in length with an average of 5.60 characters and the high/low frequency pairs were matched on word length. The various versions of each stimulus were presented according to a Latin square design, meaning every participant saw only one version of each edited Wikipedia article.

Procedure.

Before any of the experiments in this article took place, ethics approval was applied for, peer-reviewed and granted by the University of Southampton Psychology Department Ethics Committee. Ethics approval was sought and approved for all experiments within this article. Participants were given an information sheet and a verbal description of the experimental procedure and informed that they would be reading passages on a monitor while their eyes were being tracked. The text on the screen gave the instructions to read either for comprehension or to skim read. This was blocked such that the first twenty stimuli were to be read for comprehension and the second twenty to be skim read.

When the skim reading portion of the experiment began the participants were instructed to ‘skim read as you would naturally, as if you are reading a large text book that you need to read quickly’. Participants were told there was no time limit, and they simply had to skim read naturally. We did not counterbalance the Task Type. Participants were not told they were going to be skim reading until just before that half of the experiment was due to begin, so as not to influence the first part of the experiment which was to be read for comprehension. We worried if participants were first asked to skim read, it may become difficult to slow down and read “normally”. This was also suggested by participants in a pilot study. Participants during piloting of the study indicated they found it much easier to adapt to skim reading after typical reading behaviour than vice versa. Both experiments took 60–90 minutes, with breaks, which are not atypical of other eye movement and reading studies, ensuring tiredness had a reduced influence on our results.

The participants’ head was stabilised in a head/chin rest to reduce head movements that could adversely affect the quality of the calibration of the eye tracker. At the beginning of each trial the participant had to look at a fixation point on the screen. When the eye tracker registered a stable fixation on the fixation point, the stimulus was displayed ensuring that the first fixation fell at the beginning of the text. When participants finished reading they confirmed they had finished by pressing a button on the response box in front of them.

The participants were informed that they were to respond to comprehension questions presented after each trial when four comprehension questions were presented to the participants, one at a time. The comprehension questions were designed to be simple and only ever required a yes or no response. They asked about information across the whole webpage (not just the target sentences), ensuring readers were reading the entire passages of text. Participants responded to the questions by pressing the appropriate button on a response box. They were designed to ensure readers were reading the text and understanding the text, as such, they ensured task validity in our reading task. After the questions the next trial would appear.

Results

Trials where there was tracking loss were removed prior to the analysis. Fixations shorter than 80ms that were within one character of the previous or following fixation were merged and all fixations shorter than 80ms or longer than 800ms were removed to eliminate outliers, resulting in the removal of 5.43% of the total dataset [33, see also: 34]. Finally, when calculating the eye movement measures, data that were more than 2.5 standard deviations from the mean for a participant within a specific condition were removed (<1% of dataset). Data loss affected all conditions similarly.

For the local target word analyses an interest area was drawn around each target word. The interest area is the size of the target word including the space preceding it. The local analyses below are conducted using the fixations that landed on the target word, within the interest area drawn around it.

We focused our analysis on three key eye movement measures: Skipping probability, single fixation duration and go-past times (means shown in Table 1). Skipping probability is the probability that a target word does not receive a direct fixation during the first-pass. Single fixation duration is the duration of the fixation if the reader made exactly one first-pass fixation on the target word. Go-past time is the time between first fixating the word and moving past it to the right (including any time fixating previous content via regressions that originate from the target word). In this experiment when the target word was fixated, in 93.91% of the cases it received a single fixation. Therefore, we limited the fixation duration analyses to when there was a single fixation on the target word.

Download:

Table 1. Means of eye movement measures for Experiment 1.

Standard deviation in parentheses.

https://doi.org/10.1371/journal.pone.0239134.t001

We ran Linear Mixed Models (LMMs) using the lme4 package (Version 1.1–12) in R [35] to explore the impact of three variables. Logistic General Linear Mixed Models were used for the skipping probability measure. The three independent variables were included as fixed factors: Task Type (Comprehension, Skimming), Word Type (Linked, Unlinked) and Word Frequency (High, Low). Participants and items were included as random effects variables. A maximal random model was initially specified for the random factors [36]. If a model did not converge, the random effect structure was pruned first by removing the interactions between the slopes, then correlations in the random structure and finally by successively removing the slopes for the random effects explaining the least variance until the maximal converging model was identified. Additionally, the interactions between Word Frequency and Task Type, Word Frequency and Word Type, and the three-way interaction between Word Frequency, Word Type and Task Type were removed from the skipping probability model, as comparisons showed they did not contribute towards the fit of the model. The go-past time model excluded the interaction between Word Frequency and Word Type, and the three-way interaction between Word Frequency, Word Type and Task Type for the same reason. All the patterns observed in the models were identical whether they were run on log-transformed or untransformed fixation durations, allowing us to present the data run on the untransformed fixation durations in order to increase transparency. The only exception was for go-past times measures where the fixation times were log transformed. This was due to the data needing to be normalised because it was skewed and resulted in qualitatively different models for log transformed versus untransformed go-past times. All fixed effects estimates are shown in Table 2 and were calculated using successive differences contrasts so that the intercept corresponds to the grand mean. Absolute values of t equal to or bigger than 1.96 were interpreted as significant because for high degrees of freedom as is typically the case in LMMs, the t statistic approximates the z statistic.

Download:

Table 2. Fixed effect estimates for skipping probability percentage of the target word and the fixation times on the target word in ms for Experiment 1.

https://doi.org/10.1371/journal.pone.0239134.t002

Comprehension.

The comprehension question accuracy was consistently high, regardless of whether the reader was reading for comprehension of skim reading, with 89% of questions correctly answered (Reading for comprehension: 91%; Skim reading: 86%).

Word skipping.

There was a main effect of Word Frequency in skipping probability such that the high frequency words were skipped significantly more often than the low frequency words (see Table 1). There was also a main effect of Task Type, where there was more skipping when the reader was skim reading compared to when they were reading for comprehension. This replicates the research conducted by Just and Carpenter [6] who found similar results in skim reading.

In skipping probability, as well as the significant main effects of Word Frequency and Task Type, there was also a main effect of Word Type. When the target word was unlinked it was more likely to be skipped compared to when it is linked. This effect of Word Type was qualified by an interaction with Task Type (see Fig 2). Subsequent contrasts showed that there was no difference in skipping probability when the target word was linked or unlinked during comprehension reading (z = 1.46, SE = 0.09, p = .150), but there was a significant difference in the skim reading condition. Linked words were significantly less likely to be skipped compared to unlinked words in the skim reading condition (z = 7.54, SE = 0.09, p < .001). This suggests that when the readers are skim reading they are attempting to fixate the linked words more than the unlinked words and avoid skipping them.

Download:

Fig 2. Two-way interaction between Word Frequency and Task Type in Experiment 1.

Means and standard error bars for skipping probability.

https://doi.org/10.1371/journal.pone.0239134.g002

Fixation duration measures.

There was a main effect of Word Frequency in single fixation duration. The low frequency words had significantly longer fixation durations than the high frequency words. This replicates previous research where low frequency words are fixated for longer because they are more difficult to process than highly frequent words [9]. There was also a main effect of Task Type in single fixation durations where there were shorter fixation durations when the participant was skim reading. This replicates the research conducted by Just and Carpenter [6] who found similar results in skim reading.

For single fixation duration, these main effects were qualified by multiple two-way interactions, and these were qualified by a three-way interaction between Word Frequency, Word Type and Task Type (see Fig 3). To explore this three-way interaction, additional contrasts were conducted.

Download:

Fig 3. Three-way interaction between Word Frequency, Word Type and Task Type in Experiment 1.

Means and standard error bars for single fixation durations.

https://doi.org/10.1371/journal.pone.0239134.g003

As Fig 3 clearly shows, the three-way interaction was caused by the fact that, when participants skim read the passages, a frequency effect only emerged for linked target words (b = 8.71, SE = 3.77, t = 3.03), and not for unlinked target words (b = -2.12, SE = 3.37, t = -0.63). In contrast, when participants read for comprehension, there was a significant frequency effect both for linked target words (b = 9.48, SE = 3.23, t = 2.94) and unlinked target words (b = 14.80, SE = 3.32, t = 4.46).

Although the majority of fixations on the target word were single fixations, when the target word was fixated 14.11% of target words had regressions to previous interest areas. We also explore go-past times, however, we need to point out that the re-reading analysis will not have a high amount of statistical power, as re-reading was obviously rather rare.

Go-past times. In go-past times the main effects of Word Frequency and Task Type were qualified by a two-way interaction (see Fig 4) whereby the frequency effect is present when reading for comprehension (b = 0.13, SE = 0.03, t = 4.60), but is missing in skim reading (b = -0.01, SE = 0.03, t = -0.46).

Download:

Fig 4. Two-way interaction between word frequency and Task Type in Experiment 1.

Means and standard error bars for log-transformed go-past time.

https://doi.org/10.1371/journal.pone.0239134.g004

Finally, we measured trial duration (See Table 3). This is a global measure, so we only included the independent variable of Task Type (Comprehension, Skimming) as a fixed factor. The intercept was allowed to vary for the items and participants variable. We found a main effect of Task Type where the trial duration (See Table 4) was significantly longer when it was read for comprehension compared to when it was skim read, validating our manipulation of skim reading.

Download:

Table 3. Means and Standard deviations for trial duration in Experiment 1.

Standard deviation in parentheses.

https://doi.org/10.1371/journal.pone.0239134.t003

Download:

Table 4. Fixed effect estimates for trial duration in seconds for Experiment 1.

https://doi.org/10.1371/journal.pone.0239134.t004

Discussion

Experiment 1 demonstrated that skim reading has a pronounced influence on lexical processing. We observed that fixations were shorter and skipping rates were higher, on average, for the skim reading condition compared to the reading for comprehension condition, replicating the findings of Just and Carpenter [6]. However, the most interesting finding here is in relation to the impact that hyperlinks have on reading behaviour. Fitzsimmons et al. [30] found that, when reading for comprehension, hyperlinks had only a limited impact upon reading behaviour, restricted to increased re-reading of the low frequency, hyperlinked words. We replicated the results of Fitzsimmons et al., finding that hyperlinks are not a hindrance to reading when reading for comprehension in early eye movement measures, but we did not observe increased re-reading induced by a low-frequency hyperlink, although re-reading was rare in this experiment.

When the reader was skim reading, however, we observed several notable differences in eye movement behaviour compared to reading for comprehension. Firstly, readers were less likely to skip hyperlinks when skim reading. Secondly, during skim reading, we found a frequency effect for single fixation duration for hyperlinked words, but not for unlinked words. When taken as a proxy for depth of lexical processing, this lack of frequency effect clearly indicates that unlinked words are not as fully processed when readers are engaged in skim reading. This effect of skim reading reducing lexical processing, critically, is not present in the case of hyperlinks.

Regardless of whether a reader skim reads or reads for comprehension, the frequency effect was observed for hyperlinks. This suggests that even when reading quickly, hyperlinks are still important signals to the reader that they should be fully processed in order to engage in an efficient skimming pattern. This finding suggests that readers could be engaging in a speed-comprehension trade-off that is optimal for the task at hand. Participants may have wanted to read quickly while still retaining as much comprehension as possible. They may have been using the links as anchor points or signals throughout the text if the links denote the most important information. Typographical signals have been shown to result in the reader paying more attention to the signalled content [37] and often in improved memory for the signalled text [37–40]. It has also been previously shown that hyperlinks can assist in helping learners retain information [41], perhaps because they are working as a typographical signal. From these findings, we suggest that participants used the hyperlinks as markers for the presence of important information and used them in a strategy to skim read through the text in the most efficient way possible.

Experiment Two

Hyperlinks are visually salient and important navigational tools during reading of hypertext, as they represent a link to other content on the Web. Experiment 1 further displayed the importance of hyperlinks in signalling a unit of important text within a passage. This supports previous evidence showing hyperlinks highlight important information, thus affecting skim reading behaviour [30, 42]. However, we also need to consider how the reader reads the text when it contains clickable hyperlinks (i.e. when reading hypertext).

In Experiment 1, the reader could not click and navigate the environment. While a clear limitation of Experiment 1, as the readers were not strictly reading hypertext, this was implemented to maintain experimental control by simplifying the experience as much as possible in order to explore the impact of hyperlinks during skim reading, without yet introducing the added complexity of navigation and clicking. In Experiment 2, we run a similar manipulation to that seen in Experiment 1, where the task was manipulated (reading for comprehension or skim reading). The target words within the text were also manipulated to be either high or low word frequency and were displayed either in blue (linked) or black (unlinked). Additionally, in Experiment 2 we allow the links to clicked, which also means that if the page was re-visited, the link would be made purple as if the links have been visited, consistent with how they would normally look on the Web. The reader chooses the next trial by clicking on the hyperlink they wish to go to, simulating a realistic Web environment. We predicted that we will find mostly the same effects as in Experiment 1. However, by allowing the reader to navigate the links we might expect to observe inflated fixation durations for the linked words in total reading times where the reader may spend longer on the linked words to evaluate which link to click to navigate to another page. As such, Experiment 2 clearly builds upon Experiment 1 by having the novel inclusion of navigation through hypertext, allowing us to investigate the unique effect of navigating on lexical processing.