Predictive structure building in language comprehension: a large sample study on incremental licensing and parallelism

Fujita, Hiroki

doi:10.1007/s10339-023-01130-8

Predictive structure building in language comprehension: a large sample study on incremental licensing and parallelism

Short Communication
Open access
Published: 16 March 2023

Volume 24, pages 301–311, (2023)
Cite this article

Download PDF

You have full access to this open access article

Cognitive Processing Aims and scope Submit manuscript

Predictive structure building in language comprehension: a large sample study on incremental licensing and parallelism

Download PDF

Hiroki Fujita ORCID: orcid.org/0000-0001-7649-9707¹

2347 Accesses
3 Citations
Explore all metrics

Abstract

In online language comprehension, the parser incrementally builds hierarchical syntactic structures. The predictive nature of this structure-building process has been the subject of extensive debate. A previous study observed that when a wh-phrase indicates parallelism between the upcoming wh-clause and a preceding clause (e.g., John told some stories, but we couldn’t remember which stories…), the parser predictively constructs the wh-clause. This observation demonstrates predictive structure building. However, the study also suggests that the parser does not make a prediction when the wh-phrase indicates that parallelism does not hold (e.g., John told some stories … with which stories…), a potential limit to the prediction of syntactic structures. Crucially, these findings are controversial because the study did not observe processing difficulty when disambiguating input indicated that the predicted continuation was inconsistent with the globally grammatical structure (garden-path effects). The controversial results may be due to a lack of statistical power. Therefore, the present study conducted a large-scale replication study (324 participants and 24 sets of materials). The results revealed that the parser predicts the clausal structure, irrespective of the type of wh-phrase. There was also evidence of garden-path effects, supporting the finding that the parser makes a prediction. These observations suggest that the prediction algorithm inherent in the human parser is more powerful than assumed by the previous study and that the parser attempts to construct globally grammatical structures during revision.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

During online language comprehension, the parser analyses each word and incrementally constructs hierarchical syntactic structures. Research has argued and observed that online language comprehension is a predictive process; the comprehender generates hypotheses about incoming elements during sentence processing (e.g., Crocker 1996; Fujita and Cunnings 2022, in press; Gibson 1991; Gorrell 1995; Ito et al. 2016, 2020; Kamide and Mitchell 1999; Kimball 1973; Lau et al. 2006; Omaki et al. 2015; Weinberg 1993). The present study investigates and discusses the mechanism underlying this predictive process focusing on sentence parsing.

There is evidence that structure building is a predictive process (e.g., Aoshima et al. 2004, 2009; Kush et al. 2017; Phillips 2006; Staub and Clifton 2006; Yoshida et al. 2013). Crucial to the present study is Yoshida et al. (2013). Yoshida et al. investigated whether the parser predictively constructs a clausal structure that follows a wh-phrase, as in the substring below.

(1)
John told some stories, but we couldn’t remember which stories…

In (1), the wh-phrase “which stories” indicates that a clause (TP) follows. This wh-clause must contain a subject noun phrase (NP), a verb phrase (VP) and the wh-phrase’s base position (t), with appropriate lexical content, as in we couldn’t remember [_NPt which stories] [_TP [_NP you] [_VP heard t]] (Chomsky 1977; Ross 1969). Crucially, in (1), the wh-clause’s entire content is recoverable from the first clause because the two clauses can be parallel in syntactic structure and lexical content (i.e., John told some stories, but we couldn’t remember which stories John told). This recoverability does not hold if a preposition accompanies the wh-phrase and undermines the parallelism, as below.^{Footnote 1}

(2)
John told some stories, but we couldn’t remember with which stories…

Yoshida et al. (2013) investigated whether the parser predictively constructs the clausal structure and recovers its content from the first clause by utilising connectivity effects (Merchant 2005; Stjepanović, 2008; Truswell 2014) and online reflexive resolution (Sturt 2003).^{Footnote 2} Connectivity refers to a phenomenon where a fronted phrase appears to occupy a lower position. Online reflexive resolution is a process where the parser resolves a reflexive by searching for its antecedent during sentence processing. For example, consider the following sentences.

(3)
John/Mary told some stories, but we couldn’t remember
a.
which stories about himself Tom became impressed with.
b.
with which stories about himself Tom became impressed.

These sentences are akin to the substrings in (1/2) but have an overt clausal continuation after the wh-phrase. Also, the wh-phrase in (3a/b) contains a reflexive (“himself”), which depends referentially on a c-commanding NP in its binding domain (Binding Principle A; Chomsky 1981). C-command refers to a structural relation between nodes. The present study posits that x c-commands y if and only if x does not dominate y and x’s parent dominates y (Reinhart 1976). The binding domain for x is the minimal NP or TP containing x, x’s governor and a subject, of which x is not a part (Chomsky 1981, 1986). According to these definitions, the reflexive in (3a/b) corefers with the wh-clause subject NP (“Tom”) because this NP c-commands it in its binding domain due to connectivity effects (e.g., [_CP [_NPt which stories about himself] [_TP Tom became impressed with t]]). However, in (3a), if the parser predicts the clausal structure upon encountering the wh-phrase and recovers its content from the first clause, the first clause subject NP must serve as the antecedent (until the wh-clause subject NP appears, e.g., [_TP [_NPk John/Mary]] … [_CP [_NPt wh [_NPk himself]] [_TP [_NPk] [_VP t]]]). In (3a/b), the first clause subject NP either matches (“John”) or mismatches (“Mary”) the reflexive’s gender. It is known that the parser searches for a structurally licensed antecedent immediately after encountering a reflexive and that processing difficulty ensues when the two NPs disagree in gender (gender mismatch effects; e.g., Sturt 2003). Research often utilises reading times as an index of processing difficulty, assuming that reading times become longer as processing difficulty increases. Therefore, in (3a), if the parser predicts the wh-clause and recovers its content from the first clause, reading times at the reflexive should be longer in the gender-mismatch (“Mary…himself”) than gender-match (“John…himself”) conditions. In (3b), Yoshida et al. expected gender mismatch effects to be absent because the prepositional wh-phrase undermines parallelism between the two clauses, thereby preventing the recovery of the wh-clause’s entire content from the first clause. In a self-paced reading task, Yoshida et al. confirmed these hypotheses; they observed gender mismatch effects at the reflexive in (3a) but not in (3b).

Yoshida et al.’s (2013) results have several implications for sentence parsing theories. One is that the parser predictively constructs a large amount of syntactic structure. This implication follows from the fact that the predicted representation comprises a clause ([_CP wh [_TP [_NP] [_VP]]]). Another is that the parser preferentially reuses material in the left context. At the wh-phrase in (3a), it is not impermissible to construct the wh-clause without recovering its content from the first clause. Yoshida et al. argue that the parser recycles material because it prefers to maximise parallelism between the clauses ([_TP John told some stories] … [_TP John told which stories]; see also Carlson 2001; Frazier et al. 1984; Hall and Yoshida 2021; Kim et al. 2020; Knoeferle and Crocker 2009).

Another implication, which is crucial for the present study, is that the parser does not predict the clausal structure when the wh-phrase indicates that parallelism between the two clauses does not hold. As noted earlier, Yoshida et al. (2013) hypothesised that gender mismatch effects would be absent in (3b) because the prepositional wh-phrase prevents the recovery of the wh-clause’s entire content from the first clause. However, this unrecoverability does not disallow the parser to construct the clausal continuation and posit joint reference between the subject NPs of the first clause and the wh-clause at the wh-phrase (i.e., it is possible to analyse (3b) as [_TP [_NPk] [_VP V1]] … [_CP [_PPt] [_TP [_NPk] [_VP V2 t]]], V1 ≠ V2). Given that the reflexive in (3b) corefers with the wh-clause subject NP, the absence of gender mismatch effects might indicate that the parser does not predictively construct the clausal structure. In other words, Yoshida et al.’s observations might point to a potential limit to the prediction of syntactic structure. If this interpretation holds, we must assume that, in some circumstances, the parser predicts a clausal structure only when parallelism provides a cue for it (i.e., prediction is a parallelism-driven process). This potential limitation is theoretically crucial, given that the clausal structure following the wh-phrase is necessary for the sentence’s well-formedness and that there is evidence and argument that the parser predictively constructs obligatory structures during sentence processing (incremental licensing; Abney 1986; Aoshima et al. 2004; Crocker 1996; Frazier and Clifton 1996; Gibson 1991; Gibson et al. 1994; Gorrell 1995; Pritchett 1988, 1991, 1992; Weinberg 1999). That is, according to the incremental licensing theory, the parser should predictively construct the clausal structure upon encountering the wh-phrase in both (3a) and (3b), and Yoshida et al.’s observations in (3b) might contradict this theory.

Alternatively, the absence of gender mismatch effects in (3b) may indicate that the parser predicts the wh-clause but posits disjoint reference with the first clause subject NP (e.g., no prediction of lexical content; [_NPk John/Mary] … [_CP [_PPt wh] [_TP [_NPi] [_VP t]]]). This interpretation is compatible with the incremental licensing theory, and if it is valid, we must explain why the parser avoids coreference at the wh-phrase.

As described above, Yoshida et al.’s (2013) results have significant implications for sentence parsing theories. However, there is one concern about their results: in (3a), their participants did not show processing difficulty at the wh-clause subject NP in the gender-match condition. The appearance of this NP indicates that the material recovered from the first clause is incompatible with the globally grammatical structure. Crucially, there is substantial evidence that reading times increase when an input word does not fit into the current structure (e.g., Clifton 1993; Cunnings and Fujita 2021; Frazier and Rayner 1982; Fujita 2021b; Fujita and Cunnings 2020, 2021a, 2021b; Slattery et al. 2013; Sturt et al. 1999; Tabor and Hutchins 2004). This garden-path effect (Frazier and Rayner 1982) is assumed to result from the parser’s difficulty integrating disambiguating input into the current structure and its attempt to construct the globally grammatical structure (revision). Given these studies, we can expect some processing difficulty at the wh-clause subject NP in (3a). Thus, the absence of garden-path effects at the wh-clause subject NP potentially contradicts the finding that the parser predicts the clausal structure. If gender mismatch effects observed in Yoshida et al. are an experimental artefact, and if their observations at the disambiguating region represent the underlying mechanism of sentence parsing, we must assume that the human parser is not powerful enough to predict such a large amount of syntactic structure as a TP, at least in some circumstances.

There are, however, other accounts of why Yoshida et al. (2013) did not observe garden-path effects. One is that the parser does not initiate the revision process or halts it promptly upon disambiguation because the predicted continuation is tolerable after disambiguation. In psycholinguistics, some take this line of approach. For example, the Good-Enough approach views language comprehension as a heuristic rather than an algorithmic process and presupposes that the comprehender employs fast and frugal heuristic procedures even when these procedures do not conform to the principles of grammar (e.g., Christianson et al. 2001; Ferreira and Patson 2007; Slattery et al. 2013). One consequence of this presupposition is that the comprehender creates representations incompatible with globally grammatical structures. Following the Good-Enough approach, we could interpret the absence of garden-path effects in (3a) as indicating that the predicted continuation is good enough to comprehend the sentence. If sentence processing follows simple heuristic procedures, we need to specify under what circumstances the parser omits the revision process and what structure it constructs (e.g., where does disambiguating input attach?).

Alternatively, the contrasting findings may be due to a lack of statistical power. Yoshida et al. (2013) conducted a self-paced reading experiment with 40 participants and 24 sets of materials. These numbers are typical in sentence processing research. However, given that experimental materials tested in Yoshida et al. are structurally complex, precise estimates of the effect may require high statistical power. Therefore, the present study conducted a high-power replication of Yoshida et al. using a lexicality maze task with 324 participants and 24 sets of materials.

Methods

Participants

The experiment, conducted online, involved 324 native English speakers recruited via Prolific (https://www.prolific.co). These participants were over 18 years old, grew up and lived in the UK and were British citizens.

Design and materials

The experiment contained 24 sets of experimental materials from Yoshida et al. (2013), as in (4a–d) below.

(4a)
Wh-NP, Gender match.

Janet’s grandfather told some stories at the family reunion, but we couldn’t remember which stories about himself from the party the brother was so very impressed with.

(4b)
Wh-NP, Gender mismatch.

Justin’s grandmother told some stories at the family reunion, but we couldn’t remember which stories about himself from the party the brother was so very impressed with.

(4c)
Wh-PP, Gender match.

Janet’s grandfather told some stories at the family reunion, but we couldn’t remember with which stories about himself from the party the brother was so very impressed.

(4d)
Wh-PP, Gender mismatch.

Justin’s grandmother told some stories at the family reunion, but we couldn’t remember with which stories about himself from the party the brother was so very impressed.

In (4c/d), a preposition accompanies the wh-phrase, but not in (4a/b). In (4a/b), the wh-clause’s entire content is recoverable from the first clause until the wh-clause subject NP appears. The first clause subject NP matches the reflexive’s gender in (4a/c) and mismatches in (4b/d).

If structure building is a predictive process, three hypotheses are conceivable. One is that the parser predicts the wh-clause only when parallelism provides a cue for the clausal structure. If this hypothesis is correct, gender mismatch effects at the reflexive should only occur in the wh-NP conditions, with longer reading times in (4b) than (4a). Besides, this reading time pattern should reverse in direction at the disambiguating region (“brother”) due to garden-path effects (e.g., Frazier and Rayner 1982). Similar results should be obtained if the parser predicts the clausal structure in both wh-NP and wh-PP conditions but posits joint reference between the subject NPs of the first clause and the wh-clause only in the wh-NP conditions. If the parser predictively constructs the wh-clause and assumes coreference irrespective of the presence or absence of the preposition, gender mismatch effects and garden-path effects should occur in both wh-NP and wh-PP conditions. Thus, what is crucial is whether a significant main effect of gender or a significant wh-type by gender interaction appears at the reflexive and disambiguating regions.

Procedure

The present study employed a lexicality maze task (Forster et al. 2009; Witzel et al. 2012). In this task, participants read each sentence word by word, with each word presented with a pseudoword, and needed to press a button corresponding to the correct word (see Fig. 1). Thus, the data obtained from the maze task include reading times and judgement reaction times. For expository purposes, the present study refers to this measure as reading times. When participants chose a pseudoword, the trial was immediately terminated, and the next trial began. The lexicality maze task was administered in PCIbex Farm (Zehr and Schwarz 2018), and the experimental file was created using code available online (Boyce et al. 2020; Fujita 2021a). The experiment began with four practice trials, followed by 24 experimental sentences and 72 fillers presented in a pseudorandomised order.

Data analysis

The dependent variable was log-transformed reading times at four regions. These regions were the reflexive (“himself”), post-reflexive (“from”), disambiguating (“brother”) and post-disambiguating (“was”) regions. Before data analysis, reading times shorter than 300 ms or longer than 7000 ms were excluded.^{Footnote 3} These outliers represented less than 0.01% of the data. For data analysis, linear mixed-effects models with full variance–covariance matrices for the random effects (the maximal model; Barr et al. 2013) were fitted separately for each region using the lme4 package (Bates et al. 2015) in R (R Core Team 2020). The fixed effects were sum-coded (0.5/–0.5) main effects of wh-type (wh-NP/wh-PP), gender (match/mismatch) and their interactions. When the maximal model did not converge, random effect correlations were initially removed, and then, random effects with the least variance were iteratively removed until the model converged. To interpret the results, p values were estimated from the t distribution (Baayen 2008), and those below 0.05 were interpreted as statistically significant. Data and analysis code are available at https://osf.io/rh4xz.

Results

Table 1 summarises statistical analyses, and Fig. 2 visualises reading times at the (post-)disambiguating and (post-)reflexive regions.

Table 1 A summary of statistical analyses at the (post-)reflexive and (post-)disambiguating regions

Full size table

Reflexive region

Analysis revealed a significant main effect of gender, with longer reading times in the gender-mismatch than gender-match conditions. The wh-type by gender interaction was not statistically significant.

Post-reflexive region

No effects were statistically significant.

Disambiguating region

There was a significant main effect of gender, with longer reading times in the gender-match than gender-mismatch conditions (i.e., garden-path effects).

Post-disambiguating region

Analysis showed a significant wh-type by gender interaction. As a follow-up analysis, an additional model with nested contrasts was fitted to examine the effect of gender within each level of wh-type. This analysis revealed garden-path effects in the wh-NP conditions (Estimate = 0.019, SE = 0.01, t = 2.19, p = 0.29) but not in the wh-PP conditions (Estimate = –0.006, SE = 0.01, t = –0.71, p = 0.479).

Discussion and conclusion

The present study conducted a large-scale replication of Yoshida et al. (2013) using a lexicality maze task to explore the predictive structure-building process. The experiment revealed gender mismatch effects at the reflexive, which crucially did not interact with wh-type. The absence of the wh-type by gender interaction suggests that, at the wh-phrase, the parser constructs the entire wh-clause with its subject NP coindexed with the first clause subject NP, irrespective of the type of wh-phrase. This finding is partially inconsistent with Yoshida et al., who observed gender mismatch effects only in the wh-NP conditions. Additionally, analysis revealed garden-path effects at the disambiguating region in both wh-NP and wh-PP conditions, supporting the evidence that the parser predictively constructs the wh-clause in these conditions. The presence of garden-path effects also indicates that the parser attempts to construct the globally grammatical structure upon disambiguation, a finding against one possible reflex of Good-Enough language comprehension. Analysis also revealed that garden-path effects are present at the post-disambiguating region only in the wh-NP condition. This observation suggests two loci of garden-path effects in this condition (i.e., the wh-clause subject NP and verb).

The finding that the parser predictively constructs the clausal structure in both wh-NP and wh-PP conditions suggests that the predictive mechanism of sentence parsing is more powerful than assumed by Yoshida et al. (i.e., parallelism-driven prediction). What mechanism underlies the predictive parsing process?

Language comprehension theories often assume left-corner parsing (a class of top-down parsing algorithms; see Aho and Ullman 1972; Grune and Jacobs 2008; Johnson-Laird 1983) as the underlying mechanism of sentence parsing. However, this algorithm is not powerful enough to project the entire clausal structure at the wh-phrase (see Yoshida et al. 2013, pp. 290–291). As noted in the Introduction, prediction in both wh-NP and wh-PP conditions is explicable if we assume that the human parser constructs obligatory structures immediately and incrementally (recall that the clausal structure after the wh-phrase is necessary for the sentence’s well-formedness). The incremental licensing theory views online sentence processing as a process of immediate incremental satisfaction of grammatical constraints (e.g., Abney 1986; Aoshima et al. 2004; Crocker 1996; Frazier and Clifton 1996; Gibson 1991; Gibson et al. 1994; Gorrell 1995; Pritchett 1988, 1991, 1992; Weinberg 1999). In the wh-NP and wh-PP conditions, the appearance of the wh-phrase indicates the beginning of a TP, which must contain a subject NP, a VP and the wh-phrase’s base position. Therefore, the incremental licensing parser should construct the entire wh-clause upon encountering the wh-phrase, which explains the prediction process observed in the present study.

Another issue to discuss in relation to the predictive mechanism is why the parser posits coreference between the subject NPs of the first clause and the wh-clause. As noted in the Introduction, Yoshida et al. (2013) argue that the parser maximises parallelism between the two clauses, leading to joint reference in the wh-NP conditions. In the wh-PP conditions, parallelism does not hold. However, the two clauses still share some similarities in syntactic structure and lexical content. For example, the incremental licensing parser recognises that these clauses consist of similar, though not identical, syntactic structures and share a lexical item (e.g., “stories”) that follows each other’s VP (e.g., [_CP [_TP [_NP Janet’s grandfather] [_VP told [_NP some stories]]]] … [_CP [_TP [_NP] [_VP [_PP with which stories]]]]). Also, the wh-phrase (“which stories”) referentially relates to the NP (“some stories”) in the first clause (assuming footnote 1). These similarities may lead the parser to conceive of the two clauses as related and expect them to be as homogeneous as possible, resulting in joint reference between the two subject NPs ([_TP [_NPi] [_VP V1]] … [_CP [_PPt wh] [_TP [_NPi] [_VP V2 t]]], V1 ≠ V2). This hypothesis explains why the parser does not assume coreference with the possessive NP. Recall that the present study tested the experimental materials used in Yoshida et al. In these materials, the first clause subject NP had a possessive NP that always differed from it in gender (e.g., “Janet/Justin’s grandfather/grandmother”). The results suggested that this gender manipulation did not affect reading times at the reflexive in the wh-PP conditions. We can explain this observation by assuming that the parser expects the maximised similarity between the two clauses in the case of the prepositional wh-phrase.

Alternatively, the parser may favour joint reference because integrating a new referent into the current structure incurs processing costs. Some research in the literature has proposed analogous concepts (e.g., Altmann and Steedman 1988; Gibson 1998, 2000). Gibson (1998), for example, argues that intervening elements that introduce a new referent increase memory costs, resulting in processing difficulty. In the wh-NP and wh-PP conditions, the wh-clause subject NP intervenes between the landing site and the base position of the wh-phrase. Therefore, Gibson’s hypothesis predicts increased memory costs when the embedded subject NP introduces a new referent. Given that the parser often disfavours costly analyses during sentence processing (e.g., De Vincenzi 1991; Fodor and Inoue 2000; Frazier 1979), the postulation of joint reference between the subject NPs may be (partly) due to the avoidance of a new referent, especially in the wh-PP conditions.^{Footnote 4}

To summarise the discussion thus far, the parser predictively constructs the entire wh-clause upon encountering the wh-phrase to satisfy grammatical constraints immediately and incrementally. The parser then expects the conjoined strings to be maximally similar and/or attempts to avoid a new referent, leading to joint reference between the two subject NPs. Thus, the parser recovers the wh-clause’s entire content (the subject NP and VP) from the first clause in the wh-NP conditions, whereas, in the wh-PP conditions, it recycles only the subject NP. This hypothesis is compatible with what we observed at the post-disambiguating region (i.e., the wh-clause verb). Recall that in the wh-NP conditions only, the post-disambiguating region showed garden-path effects. This finding is explicable if we assume that the parser needs to revise the wh-clause subject NP and VP in the wh-NP conditions but only the wh-clause subject NP in the wh-PP conditions (see Figs. 3 and 4).

Lastly, I discuss the possibility that different observations between Yoshida et al. (2013) and the present study are due to differences in the tasks employed. As mentioned in the Introduction, Yoshida et al. measured reading times using self-paced reading, whereas the present study utilised a lexicality maze task. One unique feature of maze tasks that may have influenced the results is that they compel an incremental analysis of each word due to the forced choice between two candidates (e.g., Forster et al. 2009), which may prevent readers from strategically creating underspecified representations while they read. In other words, data from self-paced reading tasks may often reflect strategic underspecification rather than the intrinsic nature of the human parser when they indicate processing patterns incompatible with the principles of grammar (which can sometimes lead to the misapprehension that underspecification is a general property of sentence processing). Note that I do not intend to argue that maze tasks are superior to self-paced reading tasks. My point is that self-paced reading may be a more advantageous tool for investigating behaviouristic aspects of language comprehension (e.g., underspecification) as observed in a natural reading setting, whereas maze tasks may be a more appropriate choice for inquiry into the mechanism underlying sentence processing. Under this hypothesis, it is conceivable that different observations between Yoshida et al. and the present study, such as the presence or absence of garden-path effects, result from task-specific strategies.

In conclusion, the present study suggests/corroborates the following:

1.
The human parser predictively constructs hierarchical syntactic structures during online sentence processing.
2.
This predictive structure building results from the parser’s attempt to satisfy grammatical constraints at the earliest opportunity.
3.
The parser attempts to maximise the similarity between the conjoined strings, which results in the recovery of the entire or partial content of the wh-clause from the first clause.
4.
The parser may also posit joint reference between the subject NPs of the clauses to avoid processing costs incurred by integrating a new referent into the current structure.
5.
When disambiguating input indicates that the predictively built structure is globally ungrammatical, the parser conducts revision to construct the globally grammatical structure.

Data availability

Data and analysis code are available at https://osf.io/rh4xz.

Notes

In the sentences in question, such as (1/2), I assume that the parser analyses the NP headed by a noun in the wh-phrase as coextensive with the one in the first clause.
Yoshida et al. (2013) based their study on a linguistic phenomenon called sluicing (e.g., Chung et al. 1995; Frazier and Clifton 1998; Merchant 2001; Ross 1969; Yoshida et al. 2014), which is a subclass of ellipsis that refers to the omission of linguistic expressions (e.g., Fiengo and May 1994; Lobeck 1995; Phillips and Parker 2014; Ross 1969; Sag 1976; Williams 1977). In (1), for example, the wh-clause’s content recovered from the first clause can be elliptical. The crux of the present study is whether the parser predictively constructs the clausal structure, recognises parallelism and reuses material from the left context, and whether the parser favours ellipsis is of little importance for the present study. Therefore, I do not discuss issues about ellipsis.
The cut-off thresholds used in the present study are longer than those adopted in many language comprehension studies. I analysed the data using different cut-offs (between 200 and 300 ms and between 5000 ms and 10000 ms), but all analyses showed similar results.
This hypothesis may also explain part of the online cataphora resolution process (e.g., Ackerman et al. 2015; Giskes and Kush 2021; Kazanina et al. 2007; Kush and Dillon 2021; van Gompel and Liversedge 2003). Consider the following sentence as an example of a cataphor.
(5) Before he left, John had lunch.
The sentence in (5) contains a pronoun. The parser can resolve this pronoun by either searching for its antecedent in the right context (cataphora) or taking an extrasentential antecedent. There is evidence that the parser prefers cataphora (e.g., van Gompel and Liversedge 2003), which may be attributable to its attempt to avoid a new referent.

References

Abney SP (1986) Licensing and parsing. North East Linguist Soc 17(1):1–15
Google Scholar
Ackerman L, Kazanina N, Yoshida M (2015) Does the cataphoric dependency formation help the parser resolve local ambiguity? [Poster]. In: The 28th Annual CUNY Conference on Human Sentence Processing, University of Southern California
Aho AV, Ullman JD (1972) The theory of parsing, translation, and compiling. Prentice-Hall
Google Scholar
Altmann G, Steedman M (1988) Interaction with context during human sentence processing. Cognition 30(3):191–238. https://doi.org/10.1016/0010-0277(88)90020-0
Article CAS PubMed Google Scholar
Aoshima S, Phillips C, Weinberg A (2004) Processing filler-gap dependencies in a head-final language. J Mem Lang 51(1):23–54. https://doi.org/10.1016/j.jml.2004.03.001
Article Google Scholar
Aoshima S, Yoshida M, Phillips C (2009) Incremental processing of coreference and binding in Japanese. Syntax 12(2):93–134. https://doi.org/10.1111/j.1467-9612.2009.00123.x
Article Google Scholar
Baayen RH (2008) Analyzing linguistic data: a practical introduction to statistics using R. Cambridge University Press. https://doi.org/10.1017/CBO9780511801686
Book Google Scholar
Barr DJ, Levy R, Scheepers C, Tily HJ (2013) Random effects structure for confirmatory hypothesis testing: keep it maximal. J Mem Lang 68(3):255–278. https://doi.org/10.1016/j.jml.2012.11.001
Article Google Scholar
Bates D, Mächler M, Bolker B, Walker S (2015) Fitting linear mixed-effects models using. J Stat Softw. https://doi.org/10.18637/jss.v067.i01
Article Google Scholar
Boyce V, Futrell R, Levy RP (2020) Maze made easy: better and easier measurement of incremental processing difficulty. J Mem Lang 111:104082. https://doi.org/10.1016/j.jml.2019.104082
Article Google Scholar
Carlson K (2001) The effects of parallelism and prosody in the processing of gapping structures. Lang Speech 44(1):1–26. https://doi.org/10.1177/00238309010440010101
Article CAS PubMed Google Scholar
Chomsky N (1977) On wh-movement. In: Culicover PW, Wasow T, Akmajian A (eds) Formal syntax. Academic Press, pp 71–132
Google Scholar
Chomsky N (1981) Lectures on government and binding: the pisa lectures. Foris
Google Scholar
Chomsky N (1986) Barriers. MIT Press
Google Scholar
Christianson K, Hollingworth A, Halliwell JF, Ferreira F (2001) Thematic roles assigned along the garden path linger. Cogn Psychol 42(4):368–407. https://doi.org/10.1006/cogp.2001.0752
Article CAS PubMed Google Scholar
Chung S, Ladusaw WA, McCloskey J (1995) Sluicing and logical form. Nat Lang Seman 3(3):239–282. https://doi.org/10.1007/BF01248819
Article Google Scholar
Clifton C (1993) Thematic roles in sentence parsing. Can J Exp Psychol Can Psychol Exp 47(2):222–246. https://doi.org/10.1037/h0078817
Article Google Scholar
Crocker MW (1996) Computational psycholinguistics: an interdisciplinary approach to the study of language. Springer
Book Google Scholar
Cunnings I, Fujita H (2021) Quantifying individual differences in native and nonnative sentence processing. Appl Psycholinguist 42(3):579–599. https://doi.org/10.1017/S0142716420000648
Article Google Scholar
De Vincenzi M (1991) Filler-gap dependencies in a null subject language: referential and nonreferential WHs. J Psycholinguist Res 20(3):197–213. https://doi.org/10.1007/BF01067215
Article Google Scholar
Ferreira F, Patson ND (2007) The ‘good enough’ approach to language comprehension. Lang Linguist Compass 1(1–2):71–83. https://doi.org/10.1111/j.1749-818X.2007.00007.x
Article Google Scholar
Fiengo R, May R (1994) Indices and identity. MIT Press
Google Scholar
Fodor JD, Inoue A (2000) Garden path re-analysis: attach (Anyway) and revision as last resort. In: de Vincenzi M, Lombardo V (eds) Cross-linguistic perspectives on language processing. Springer, pp 21–61. https://doi.org/10.1007/978-94-011-3949-6_2
Chapter Google Scholar
Forster KI, Guerrera C, Elliot L (2009) The maze task: Measuring forced incremental sentence processing time. Behav Res Methods 41(1):163–171. https://doi.org/10.3758/BRM.41.1.163
Article PubMed Google Scholar
Frazier L, Clifton C Jr (1996) Construal. MIT Press
Google Scholar
Frazier L, Clifton C (1998) Comprehension of sluiced sentences. Lang Cognit Process 13(4):499–520. https://doi.org/10.1080/016909698386474
Article Google Scholar
Frazier L, Rayner K (1982) Making and correcting errors during sentence comprehension: eye movements in the analysis of structurally ambiguous sentences. Cogn Psychol 14(2):178–210. https://doi.org/10.1016/0010-0285(82)90008-1
Article Google Scholar
Frazier L, Taft L, Roeper T, Clifton C, Ehrlich K (1984) Parallel structure: a source of facilitation in sentence comprehension. Mem Cognit 12(5):421–430. https://doi.org/10.3758/BF03198303
Article CAS PubMed Google Scholar
Frazier L (1979) On comprehending sentences: Syntactic parsing strategies [PhD Thesis, University of Connecticut]. https://opencommons.uconn.edu/dissertations/AAI7914150/
Fujita H (2021b) On the parsing of garden-path sentences. Lang Cognit Neurosci 36(10):1234–1245. https://doi.org/10.1080/23273798.2021.1922727
Article Google Scholar
Fujita H, Cunnings I (2020) Reanalysis and lingering misinterpretation of linguistic dependencies in native and non-native sentence comprehension. J Mem Lang 115:104154. https://doi.org/10.1016/j.jml.2020.104154
Article Google Scholar
Fujita H, Cunnings I (2021a) Lingering misinterpretation in native and nonnative sentence processing: evidence from structural priming. Appl Psycholinguist 42(2):475–504. https://doi.org/10.1017/S0142716420000351
Article Google Scholar
Fujita H, Cunnings I (2021b) Reanalysis processes in non-native sentence comprehension. Biling Lang Cognit 24(4):628–641. https://doi.org/10.1017/S1366728921000195
Article Google Scholar
Fujita H, Cunnings I (2022) Interference and filler-gap dependency formation in native and non-native language comprehension. J Exp Psychol Learn Mem Cogn 48(5):702–716. https://doi.org/10.1037/xlm0001134
Article PubMed Google Scholar
Fujita H, Cunnings I (in press) Interference in quantifier float and subject-verb agreement. Lang Cognit Neurosci
Fujita H (2021a) An R Package for Creating Experimental Files in IbexFarm. https://doi.org/10.17605/OSF.IO/7RVX6
Gibson E (1998) Linguistic complexity: locality of syntactic dependencies. Cognition 68(1):1–76. https://doi.org/10.1016/S0010-0277(98)00034-1
Article CAS PubMed Google Scholar
Gibson E (2000) The dependency locality theory: a distance-based theory of linguistic complexity. Image, language, brain: papers from the first mind articulation project symposium. The MIT Press, pp 94–126
Google Scholar
Gibson E, Hickok G, Schütze CT (1994) Processing empty categories: a parallel approach. J Psycholinguist Res 23(5):381–405. https://doi.org/10.1007/BF02143946
Article Google Scholar
Gibson E (1991) A computational theory of human linguistic processing: memory limitations and processing breakdown [PhD Thesis]. Carnegie Mellon University
Giskes A, Kush D (2021) Processing cataphors: Active antecedent search is persistent. Mem Cognit 49(7):1370–1386. https://doi.org/10.3758/s13421-021-01176-z
Article PubMed PubMed Central Google Scholar
Gorrell P (1995) Syntax and parsing. Cambridge University Press. https://doi.org/10.1017/CBO9780511627682
Book Google Scholar
Grune D, Jacobs CJH (2008) Parsing techniques. Springer
Book Google Scholar
Hall K, Yoshida M (2021) Coreference and parallelism. Lang Cognit Neurosci 36(3):296–319. https://doi.org/10.1080/23273798.2020.1827154
Article Google Scholar
Ito A, Corley M, Pickering MJ, Martin AE, Nieuwland MS (2016) Predicting form and meaning: evidence from brain potentials. J Mem Lang 86:157–171. https://doi.org/10.1016/j.jml.2015.10.007
Article Google Scholar
Ito A, Gambi C, Pickering MJ, Fuellenbach K, Husband EM (2020) Prediction of phonological and gender information: an event-related potential study in Italian. Neuropsychologia 136:107291. https://doi.org/10.1016/j.neuropsychologia.2019.107291
Article PubMed Google Scholar
Johnson-Laird PN (1983) Mental models. Cambridge University Press
Google Scholar
Kamide Y, Mitchell DC (1999) Incremental pre-head attachment in Japanese parsing. Lang Cognit Process 14(5–6):631–662. https://doi.org/10.1080/016909699386211
Article Google Scholar
Kazanina N, Lau EF, Lieberman M, Yoshida M, Phillips C (2007) The effect of syntactic constraints on the processing of backwards anaphora. J Mem Lang 56(3):384–409. https://doi.org/10.1016/j.jml.2006.09.003
Article Google Scholar
Kim N, Carlson K, Dickey M, Yoshida M (2020) Processing gapping: parallelism and grammatical constraints. Q J Exp Psychol 73(5):781–798. https://doi.org/10.1177/1747021820903461
Article Google Scholar
Kimball JP (1973) Seven principles of surface structure parsing in natural language. Cognition 2(1):15–47. https://doi.org/10.1016/0010-0277(72)90028-5
Article Google Scholar
Knoeferle P, Crocker MW (2009) Constituent order and semantic parallelism in online comprehension: eye-tracking evidence from German. Q J Exp Psychol 62(12):2338–2371. https://doi.org/10.1080/17470210902790070
Article Google Scholar
Kush D, Dillon B (2021) Principle B constrains the processing of cataphora: evidence for syntactic and discourse predictions. J Mem Lang 120:104254. https://doi.org/10.1016/j.jml.2021.104254
Article Google Scholar
Kush D, Lidz J, Phillips C (2017) Looking forwards and backwards: the real-time processing of Strong and Weak Crossover. Glossa A J Gen Linguist 2(1):70. https://doi.org/10.5334/gjgl.280
Article Google Scholar
Lau E, Stroud C, Plesch S, Phillips C (2006) The role of structural prediction in rapid syntactic analysis. Brain Lang 98(1):74–88. https://doi.org/10.1016/j.bandl.2006.02.003
Article PubMed Google Scholar
Lobeck A (1995) Ellipsis: functional heads, licensing, and identification. Oxford University Press
Google Scholar
Merchant J (2001) The Syntax of silence: sluicing, islands, and the theory of ellipsis. Oxford University Press
Google Scholar
Merchant J (2005) Fragments and ellipsis. Linguist Philos 27(6):661–738. https://doi.org/10.1007/s10988-005-7378-3
Article Google Scholar
Omaki A, Lau EF, Davidson White I, Dakan ML, Apple A, Phillips C (2015) Hyper-active gap filling. Front Psychol. https://doi.org/10.3389/fpsyg.2015.00384
Article PubMed PubMed Central Google Scholar
Phillips C (2006) The real-time status of island phenomena. Language 82(4):795–823. https://doi.org/10.1353/lan.2006.0217
Article Google Scholar
Phillips C, Parker D (2014) The psycholinguistics of ellipsis. Lingua 151:78–95. https://doi.org/10.1016/j.lingua.2013.10.003
Article Google Scholar
Pritchett BL (1988) Garden path phenomena and the grammatical basis of language processing. Language 64(3):539. https://doi.org/10.2307/414532
Article Google Scholar
Pritchett BL (1991) Subjacency in a principle-based parser. In: Berwick RC, Abney SP, Tenny C (eds) Principle-based parsing: computation and psycholinguistics. Springer, pp 301–345. https://doi.org/10.1007/978-94-011-3474-3_12
Chapter Google Scholar
Pritchett BL (1992) Grammatical competence and parsing performance. University of Chicago Press
Google Scholar
R Core Team (2020) R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
Reinhart T (1976) The syntactic domain of anaphora [PhD Thesis, Massachusetts Institute of Technology]. http://hdl.handle.net/1721.1/16400
Ross JR (1969) Guess who? In: Binnick RI, Davison A, Green GM, Morgan JL (eds) Proceedings from the annual meeting of the chicago linguistic society. Chicago Linguistic Society, pp 252–286
Google Scholar
Sag IA (1976) Deletion and Logical Form [PhD Thesis]. Massachusetts Institute of Technology
Slattery TJ, Sturt P, Christianson K, Yoshida M, Ferreira F (2013) Lingering misinterpretations of garden path sentences arise from competing syntactic representations. J Mem Lang 69(2):104–120. https://doi.org/10.1016/j.jml.2013.04.001
Article Google Scholar
Staub A, Clifton C (2006) Syntactic prediction in language comprehension: evidence from either or. J Exp Psychol Learn Mem Cognit 32(2):425–436. https://doi.org/10.1037/0278-7393.32.2.425
Article Google Scholar
Stjepanović S (2008) P-Stranding under sluicing in a non-p-stranding language? Linguist Inq 39(1):179–190
Article Google Scholar
Sturt P (2003) The time-course of the application of binding constraints in reference resolution. J Mem Lang 48(3):542–562. https://doi.org/10.1016/S0749-596X(02)00536-3
Article Google Scholar
Sturt P, Pickering MJ, Crocker MW (1999) Structural change and reanalysis difficulty in language comprehension. J Mem Lang 40(1):136–150. https://doi.org/10.1006/jmla.1998.2606
Article Google Scholar
Tabor W, Hutchins S (2004) Evidence for self-organized sentence processing: digging-in effects. J Exp Psychol Learn Mem Cogn 30(2):431–450. https://doi.org/10.1037/0278-7393.30.2.431
Article PubMed Google Scholar
Truswell R (2014) Binding theory. In: Carnie A, Sato Y, Siddiqi D (eds) The routledge handbook of syntax. Routledge, pp 214–238
Google Scholar
van Gompel RPG, Liversedge SP (2003) The influence of morphological information on cataphoric pronoun assignment. J Exp Psychol Learn Mem Cogn 29(1):128–139. https://doi.org/10.1037/0278-7393.29.1.128
Article PubMed Google Scholar
Weinberg A (1993) Parameters in the theory of sentence processing: minimal commitment theory goes east. J Psycholinguist Res 22(3):339–364. https://doi.org/10.1007/BF01068016
Article Google Scholar
Weinberg A (1999) A minimalist theory of human sentence processing. In: Epstein SD, Hornstein N (eds) Working minimalism. The MIT Press, pp 282–315. https://doi.org/10.7551/mitpress/7305.003.0013
Chapter Google Scholar
Williams ES (1977) Discourse and logical form. Linguist Inq 8(1):101–139
Google Scholar
Witzel N, Witzel J, Forster K (2012) Comparisons of online reading paradigms: eye tracking, moving-window, and maze. J Psycholinguist Res 41(2):105–128. https://doi.org/10.1007/s10936-011-9179-x
Article PubMed Google Scholar
Yoshida M, Dickey MW, Sturt P (2013) Predictive processing of syntactic structure: sluicing and ellipsis in real-time sentence processing. Lang Cognit Process 28(3):272–302. https://doi.org/10.1080/01690965.2011.622905
Article Google Scholar
Yoshida M, Nakao C, Ortega-Santos I (2014) The syntax of ellipsis and related phenomena. In: Carnie A, Sato Y, Siddiqi D (eds) The routledge handbook of syntax. Routledge, p 192
Google Scholar
Zehr J, Schwarz F (2018) PennController for Internet Based Experiments (IBEX). https://doi.org/10.17605/OSF.IO/MD832

Download references

Funding

Open Access funding enabled and organised by Projekt DEAL. This work was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project number 500540359. PI: Hiroki Fujita.

Author information

Authors and Affiliations

Department of Linguistics, University of Potsdam, Karl-Liebknecht-Straße 24–25, 14476, Potsdam, Germany
Hiroki Fujita

Authors

Hiroki Fujita
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hiroki Fujita.

Ethics declarations

Conflict of interest

The author declares no competing interests.

Ethical approval

This study was performed in accordance with the ethical standards as laid down in the 1964 Declaration of Helsinki.

Consent to participate

Informed consent was obtained from all participants in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Editors: Pia Knoeferle (Humboldt University Berlin)/Aine Ito (National University of Singapore); Reviewers: two researchers who prefer to remain anonymous.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fujita, H. Predictive structure building in language comprehension: a large sample study on incremental licensing and parallelism. Cogn Process 24, 301–311 (2023). https://doi.org/10.1007/s10339-023-01130-8

Download citation

Received: 22 September 2022
Accepted: 15 February 2023
Published: 16 March 2023
Issue Date: May 2023
DOI: https://doi.org/10.1007/s10339-023-01130-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Predictive structure building in language comprehension: a large sample study on incremental licensing and parallelism

Abstract

Introduction

Methods

Participants

Design and materials

Procedure

Data analysis

Results

Reflexive region

Post-reflexive region

Disambiguating region

Post-disambiguating region

Discussion and conclusion

Data availability

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Consent to participate

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation