Counterfactual thinking requires thinking about false possibilities, that is, what could have happened in a different situation. Imagine a situation in which a friend, Eva, studied psychology but, what if Eva had chosen to study art instead of psychology?

Counterfactuals such as

  1. (1)

    “If Eva had studied art, she would have worked at the Louvre Museum”

are conditional expressions that establish a hypothetical relation between cause and effect. Many studies have shown that, as the mental model theory proposes (Johnson-Laird & Byrne, 1991), understanding a counterfactual makes us think about two situations. One of them is the situation that corresponds to the counterfactual conjecture (“Eva studied art and worked at the Louvre Museum”; (A, B)) and the other, corresponding to the negation of both terms, refers to the real or presupposed fact (“Eva did not study art and did not work at the Louvre Museum”; (Not-A, Not-B)) (Byrne, 2016; Byrne, 2017).

[Conjectured] Eva studied art (A) Worked at the L.M. (B)

[Presupposed-fact] Eva did not study art (¬A) Did not work at the L.M. (¬B)

However, the mental model theory also establishes that we do not just have to consider both possibilities, but also need to keep track of their epistemic status by codifying it as labels (see [conjectured] and [presupposed-fact] labels above) (Johnson-Laird & Byrne, 2002). This means recognising which one of them is the real possibility and which one the conjectured possibility. Keeping track of the epistemic status is a difficult aspect of thinking counterfactually, as demonstrated by studies with children (Gómez-Sánchez et al., 2020) as well as with adults (Ruiz-Ballesteros & Moreno-Ríos, 2017).

Counterfactuals such as (1) require negating the antecedent in order to create the counterfactual possibility: “Eva did not study art” and, as far as we are aware, it is not known how people construct it. Is it possible that people would consider “Eva studied psychology” instead of “Eva did not study art”. In both cases they are counterfactual possibilities (presupposed facts), one with an explicit negation and the other with an alternative.

Thus, the main aim of this paper is to establish, how during counterfactual thinking people construct that negation of the conjectured possibility to create the presupposed fact. Do they construct the negation of the antecedent using propositional symbols (“she did not study art”) or do they represent an alternative fact (e.g. “she studied psychology”)? Do children construct it in the same way as adults? Does it make a difference having a concrete (“she studied art”) or an abstract (“she studied a career”) situation? These questions arise from results of previous studies showing developmental differences in counterfactual reasoning, construction of the negation and abstract thinking. In this paper we try to disentangle these questions focusing on the role of alternatives and negation.

First, we briefly summarize relevant findings on the development of counterfactual reasoning and how people understand counterfactuals. Following this summary, we review the effect of negation found in some recent studies with inferences (Byrne, 2017; Espino & Byrne, 2018; Moreno-Ríos & Byrne, 2018), and how reasoning with negation changes during development (Markovits, 2013; Markovits & Lortie-Forgues, 2011). We then outline the task that allows us to test our predictions about the effect of each kind of situation (concrete and abstract) on creating counterfactual possibilities. Furthermore, we examine the ability to reason with counterfactuals in adults and children, as well as their ability to distinguish real and conjectured possibilities. We report the results of two experiments, one with adults (Study 1) and another with children (Study 2). Finally, we discuss the findings in relation to previous literature.

The Development of Counterfactual Thinking

Previous results may question that children construct the counterfactual possibility (the negation of the suppositional possibility) in the same way as adults (e.g. Rafetseder et al., 2013). There is no agreement on when counterfactual thinking is fully developed in children, with some research finding this ability in school children (Gómez-Sánchez et al., 2020; McCormack et al., 2018; Nyhout et al., 2019) and others suggesting that even pre-schoolers are able to reason counterfactually in the same way as adults do (Guajardo et al., 2009; Nyhout & Ganea, 2019; Roldán-Tapia et al., 2017). However, there is no clear explanation about the differences found between studies that find evidence for counterfactual thinking in early childhood and those that report it in later childhood. A number of factors could be responsible for these differences, such as demands on executive functions (Beck et al., 2009; Beck & Riggs, 2014), the structure, clarity and difficulty of the task or whether the task implies physical rather than agents as causes (McCormack et al., 2018; Nyhout et al., 2019; Nyhout & Ganea, 2019). Differences found in counterfactual thinking abilities could also be related to differences in the conceptualisation of counterfactual thinking, such as how broad or narrow the author’s view of counterfactuals is (e.g. whether it includes hypothetical future, timeless conditionals or just alternatives to past events) (Beck & Riggs, 2014; Buchsbaum et al., 2012; Rafetseder et al., 2010; Weisberg & Gopnik, 2013) or a misunderstanding in what the task is measuring (Rafetseder et al., 2010, 2013). Another factor that has been found to cause children’s difficulty in reasoning with counterfactuals could relate to the ability to codify correctly the epistemic status, that is keeping track of which possibility is real and which one is conjectured (Gómez-Sánchez et al., 2020). Due to the high working memory load, they could lose the mental footnotes about which situation is the real one and which the conjectured one. Consequently, they could still access the two models but not their mental footnotes. Although to a lesser degree, in some demanding tasks even adults lose track of the epistemic status in counterfactual reasoning (Ruiz-Ballesteros & Moreno-Ríos, 2017).

The development of counterfactual thinking could be also related to the consideration of alternatives (Moreno-Rios & García-Madruga, 2002; Rasga et al., 2016). The ability to consider alternatives as well as working memory capacity increase with age (Barrouillet et al., 2009; Santamaría et al., 2013). Counterfactual reasoning requires the consideration of false alternatives, having to temporarily inhibit our knowledge and imagine an alternative situation was true (Byrne, 2016; Rafetseder et al., 2010). Therefore, due to developmental issues we expect a developmental trend in children’s ability to think counterfactually, and particularly, in their ability to keep track of epistemic status. As we explained before, the mental footnotes are easily forgotten and these are essential for answering the epistemic question correctly (i.e., what is real and what conjectured). Nevertheless, when the mental footnotes are forgotten, the mental model remains, enabling a correct answer to the inferential question.

As we have seen, to create the counterfactual possibility from (1) it is necessary to negate the antecedent (“she did not study art”). Hence, the construction of negation in counterfactuals could be another key element in the explanation of the difficulty children have with counterfactuals.

Reasoning with Negation

Developmental differences need to be understood within the context of the challenges that arise for all people when reasoning with negation. People usually represent a sentence such as “there is not a circle” as “not-circle”, that is, the representation of negation may include symbolic annotations to capture negation such as “not” (e.g., Byrne & Johnson-Laird, 2009; Johnson-Laird & Byrne, 2002; Khemlani et al., 2012, 2014; Moreno-Ríos & Byrne, 2018). However, in a situation in which there is an alternative such as a triangle, the negation of “there is not a circle” leads people to think about the “triangle”. That is, when there is a potential alternative, it is easier to represent the negation by recovering that alternative. This happens in binary contexts such as “there is not a light figure”. In these cases it is easier to represent the negation by thinking of the alternative (“dark figure”) than by thinking of the negation in an abstract way (“not light figure”) as it only has one potential alternative: its antonym (Espino & Byrne, 2018; Mayo et al., 2004).

Similarly, Espino and Byrne (2018) found an inferential effect in binary contexts that they called “inference-to-alternates”: a tendency to draw affirmative conclusions that refer to an alternate even from a negative minor premise. In our study, we predict a similar effect but related to the recovery of the cause responsible for the effect (see Study 1). In more complex situations, where the negation refers to more than one possibility (e.g. circle, triangle and square) people can use the symbolic annotations to economise the process (Espino & Byrne, 2018; Orenes et al., 2014).

This brings us on to the developmental differences in the ability to use negations. Previous studies have shown that schoolchildren use concrete cases for negating the antecedent and that the ability to create more abstract negations develops in adolescence and adulthood (Markovits, 2013; Markovits & Lortie-Forgues, 2011). Thus, schoolchildren would find difficult to create abstract negation (not-circles), thinking instead about concrete categories (triangles). However, adolescents and adults can construct the negation of the antecedent in an abstract way, thinking in not-A abstract cases (not-circles) (Markovits & Lortie-Forgues, 2011). In this paper we test the effect of such concreteness in the construction of counterfactual possibilities from the negation of the supposition.

Frequently, negation is not explicit but included in the context. That is what happens in the case of counterexamples.

Counterexamples and Negation

A conditional such as “If Daniella makes a noise, Charles is awake” (If A, then B) expresses a relation between a cause (A; making a noise) and an effect (B; being awake). However, this causal relation can be different if there are counterexamples for it. There are two kinds of counterexamples (see, Cummins, 1995; De Neys & Everaerts, 2008):

  1. a)

    Alternatives (Not-A, B) which are causes, different from the original one (e.g., an alarm clock ringing), that are capable of producing the same effect cited in the relation (Charles is awake). That is, Charles may be awake (B) even if Daniella does not make a noise (not-A).

  2. b)

    Disablers (A, Not-B) which prevent the effect from occurring (Charles being awake) despite the presence of the cause (Daniella makes a noise), because something breaks the causal relation (e.g., Charles wears earplugs). That is, Daniella making a noise (A) does not cause Charles to be awake (Not-B).

As we have seen, counterfactual conditionals such as “If Daniella had made a noise, Charles would have been awake” tell us that something different to what is said (“Daniella made noise and Charles was awake”; AB) actually happened (“Daniella did not make noise and Charles was not awake”; Not-A Not-B).

The mental model theory suggests that these possibilities are represented as a mental model of reality: an iconic representation of a possibility as the image of a girl making a noise and a boy awakes. As we have mentioned previously, the mental model theory also establishes that there is a codification of their epistemic status as mental footnotes. However, people tend to easily forget these mental footnotes which causes frequent errors in deduction (see Johnson-Laird, 2006; Johnson-Laird & Byrne, 2002). These mental models can also contain abstract features such as negations (the girl not making a noise or “it is false that the girl made a noise”), and obligation or belief (Bucciarelli & Johnson-Laird, 2005; Vargas et al., 2011). Thinking counterfactually requires negating the antecedent to create the counterfactual possibility (presupposed fact: “Daniella did not make a noise”), but how do people think about negation in counterfactual thinking? the main purpose of this paper is to shed light on this issue.

As we have seen in the previous section, negation shows differences depending on different aspects such as the age of the child, the kind of content (e.g. concrete or abstract) or the nature of the counterexamples. Therefore, in a counterfactual conditional, if the counterexample is an alternative then the negation will correspond to an element from the story (the alternative). However, if the counterexample is a disabler then the negation will not correspond to any element from the story. Consider the next two stories to explain how negation induced by a counterfactual differs with both types of counterexample:

Alternative story:

  1. (2)

    A child was awake because his alarm clock had just sounded. His sister went into his bedroom to take a toy. Later, the police officer said: “If his sister had entered silently, the child would have been awake”

Disabler story:

  1. (3)

    A girl was on the beach playing with a bucket. Her bucket was broken and had a hole in the bottom. Later, the police officer said: “If the girl had poured water into her bucket, the bucket would have been empty”.

In alternatives stories such as (2), the negation of the antecedent “if his sister had entered silently” can be made either by a) constructing a syntactic negation using propositional symbols such as “not” (“the sister did not enter silently”) or b) discarding the mentioned action and taking the alternative action explicitly mentioned in the story (“the alarm clock sounded”). However, in disabler stories such as (3) the negation of the antecedent (“the girl did not pour water into her bucket”) has no alternative explicit negation to recover from the story. Hence, for disablers the negation has to be made in a more abstract way, being forced to use the symbolic annotations (“she did not pour water”).

Nevertheless, even with the presence of an explicit negation in alternative stories, it is possible to have a concrete (e.g., an alarm clock) or an abstract alternative (e.g., having trouble sleeping) (see the materials section for more information). We compare these two types of alternatives in order to test our main aim: whether children, but not adults, recover as cause of the effect (why the child is awake) a concrete alternative (the alarm clock) more than an abstract one (trouble sleeping; Markovits & Lortie-Forgues, 2011). However, since there is no explicit alternative that can be used to negate the antecedent in disabler stories, these stories will be used as a control. Consequently, introducing a concrete (e.g. pick white stones up) or an abstract action (e.g. be occupied) in disabler stories should not have any effect since those actions do not correspond to the negation of the antecedent. That is, in disabler stories the concreteness of the cause remains constant (the hole in the bucket) unlike alternative stories.

One of the most common difficulties in comparing cognitive abilities between children and adults is that results of the studies are usually obtained with different tasks, and it is not clear to what extent differences found can be due to the tasks (Royzman et al., 2003). Therefore, our experiments used the same task to examine the ability to reason counterfactually and to keep track of the epistemic status. This task also allows us to test our main aim: how people construct the counterfactual possibility. More specifically, whether people construct the negation of the antecedent in counterfactuals either using an alternative (e.g. “the alarm clock sounded”) or using the abstract negation by itself with symbolic annotations (e.g. “the sister did not enter silently”). Moreover, we also test whether the concreteness of the situation (concrete vs. abstract) has an effect depending on the kind of counterexample (alternative vs. disabler). In general, we predict that adults, as well as children, will negate the antecedent by recovering the alternative or the disabler instead of using symbolic annotations (“not”). Moreover, we predict an effect of concreteness for children with alternatives resulting in the recovery of the concrete alternative (“alarm clock”) more than the abstract one (“trouble sleeping”) to create the counterfactual situation (“his sister did not enter silently”).

Study 1 - Adults

We test how adults construct counterfactual possibilities depending on the kind of counterexample (alternative vs. disabler) and the concreteness (concrete vs. abstract) of the counterexample. In Study 2 we examine whether there are differences between adults’ and children’s construction of counterfactual possibilities. Espino and Byrne (2018) showed that people tend to make affirmative inferences from negative premises in binary context (e.g., to act vs. not to act). The effect was called “inference-to-alternatives”. We predict a similar effect with the negation of causal antecedents in counterfactual conditionals. The comprehension of counterfactuals requires a person to consider the negation of the antecedents. If people represent negation in stories by thinking about an alternative possibility instead of negating the antecedent, then we expect participants asked about the cause to explain the consequence, to create the counterfactual possibility by recovering the alternative possibility. This means that participants will recover the alternative possibility to explain the consequence (from here “predicted cause”; e.g. the alternative or the disabler provided in the story) instead of the syntactic negation of the antecedent using propositional symbols (“not”) for all counterexamples types.

In the study we present alternative and disabler stories about causal events that lead to a consequence. Participants have to identify the causal event for that consequence. Alternatives stories, unlike disabler stories, provide an alternative situation that corresponds to the syntactic negation of the antecedent which causes the same consequence. However, disablers do not provide an explicit situation different from “the girl did not pour water” that corresponds to it (see Figures 1a and 1b). As concreteness only impacts on alternatives but not on disablers, recovering a concrete negation should be easier than recovering an abstract negation in alternative stories but not in control disablers stories. Nevertheless, due to adults’ ability in thinking with abstract negation we do not predict significant differences in contrast to our expectations in relation to children.

Fig. 1
figure 1figure 1

a Causes in the alternative stories that can lead to the response, taking as example the alternative story about the child who is awake: “If his sister had entered silently, the child would have been…”b Causes in the control disabler stories that can lead to the response, taking as example the disabler story about the empty bucket “If the girl had poured water into her bucket, the bucket would have been …”.

Therefore, we summarize our prediction as follow:

  1. 1-

    If the concreteness has an impact on how people think counterfactually and also has a relation with the kind of counterexamples (alternatives and disablers), then we would expect more responses referring to the predicted cause with concrete situations than with abstract situations in alternative stories.

  2. 2-

    Nevertheless, we do not expect to find differences in disabler stories (that act as a control for the alternative stories), because the manipulation of the concreteness does not correspond to an alternative cause (see the materials section and Figures 1a and 1b).

  3. 3-

    Furthermore, we test adults’ ability to reason with counterfactual conditionals as well as their ability to keep track of the epistemic status. Based on previous results, and the mental model theory, we predict that adults will show a good ability to make inferences from counterfactual conditionals, as well as a good ability to distinguish the epistemic status.

Method

Participants

The sample consisted of fifty-five adults aged between 18 and 27 years (Mage= 21.25 years; SD= 2.29). Fifty-two were women and three were men. All participants were native Spanish speakers and were recruited in colleges or universities in Granada.

The sample size was computed based on the epistemic responses of adults from Gómez-Sánchez et al. (2020) previous results. Using G*power, to obtain the same effect (partial eta squared .04, with a power .90) we needed at least, 44 participants. We added one fourth more (11), predicting the loss of participants in one of the analyses that requires complete coherence in their responses to compare frequencies.

Participants read a consent form complying with the University Research Ethics Committee guidelines. The procedure and the task for this study, as well as for Study 2 with children, were also approved by the same committee.

Materials

Nine stories were created (one of them as a practice trial), based on the materials in Gómez-Sánchez et al. (2020)Footnote 1 and Rafetseder et al. (2013) to test inferential accuracy, epistemic status and identification of the cause of the consequent in counterfactual reasoning. The stories were adapted to test the hypotheses in this study.

Two kinds of story were used depending on the kind of counterexample (alternative or disabler). In the disabler stories such as (3), the presence of the action mentioned in the antecedent (pouring water into a bucket) does not produce any consequence (the bucket stays empty). To infer that the correct answer is empty people have to remember that the bucket had a hole. In the second kind of story, alternative stories such as (2), the action mentioned in the consequent happens (the child is awake), even though the antecedent does not happen (the sister does not make noise). This happens because a different antecedent (e.g. the alarm clock) causes the same effect (the child is awake).

In addition, we manipulated the concreteness of the action (concrete vs. abstract) in order to test whether providing a concrete action improves accuracy in the alternative condition, without showing differences in the control disabler condition. The difference between both is shown in the example below.

The participants were presented with a questionnaire which started with a short introduction, asking participants to take the role of an investigator, using the information provided by a police officer, and having to infer what happened in each story. After completing a practice trial where one story is presented, participants were presented with the 8 experimental stories (4 alternatives and 4 disablers). Each one consisted of three tasks (see an example below):

  1. 1.

    The ‘inferential accuracy’ task, tests whether when participants are given an antecedent, they conclude the consequent determined by the story should be accepted (e.g. The child was awake; the bucket was empty).

  2. 2.

    The ‘epistemic status’ task, tests whether they can differentiate real and conjectured situations.

  3. 3.

    The ‘causal question’ task, tests which cause participants tend to report as being responsible for the outcome mentioned in the story (consequent).

The following is an example of an alternative story (translated from the original Spanish). The manipulation of the concreteness can be seen in bold for the concrete situation and in brackets for the (abstract situation).

The police officer saw through a window of the room that a child was awake because his alarm clock had just sounded (he had trouble sleeping). His sister went into his bedroom to take a toy. Later, the police officer said: ‘If his sister had entered silently…’

  1. 1.

    Inferential Question: Would the child have been … awake(correct) or asleep?

  2. 2.

    Epistemic Status Question: Remember, the police officer said: ‘If his sister had entered silently …’ According to this evidence, did the police officer see that his sister went in silently? Yes / No(correct)

  3. 3.

    Causal question: The police officer believes the child was awake because…

The following is an example of a disabler story (translated from the original Spanish). The manipulation of the concreteness can again be seen in bold for the concrete situation and in brackets for the (abstract situation). In this kind of story, as the concreteness is not related to the information needed to answer the questions and, consequently, the concreteness of the cause remains constant, we do not expect differences.

The police officer saw that a girl was on the beach playing with a bucket. Her bucket was broken and had a hole in the bottom. The girl was picking white stones up (occupied). Later, the police officer said: ‘If the girl had poured water into her bucket…’

  1. 1.

    Inferential Question: Would the bucket have been…full or empty(correct)?

  2. 2.

    Epistemic Status question: Remember, the police officer said: ‘If the girl had poured water into her bucket…’ According to this evidence, did the police officer see the girl pour water into the bucket? Yes / No(correct)

  3. 3.

    Causal question: The police officer believes the bucket was empty because…

Procedure and Design

The participants were tested in small groups (4 people) in a quiet room in a session that lasted 15 minutes. The experimenter read the stories and the conditional statement out loud and asked the participants the inferential accuracy question, the epistemic status question and, finally, the causal question, in that order. Participants had to mark their response on an answer sheet. In the first (inferential) question, they had to choose between the alternatives proposed (e.g. awake or asleep), in the second (epistemic) question they had to think about what actually happened, answering yes or no, and in the third one (causal question) they had to write down what caused the result (e.g. why was the child awake).

Responses to the causal question were coded according to which cause was reported as responsible for the consequent. Hence, responses were classified as syntactic negation of the antecedent (she did not enter silently), predicted cause (the alternative or the disabler), or other causes not presented in the story, such as inventions.

We employed an experimental design with Counterexample (alternative vs. disabler) and Concreteness (concrete vs. abstract) as within-participants variables in all the questions. Concreteness in stories was counterbalanced, this way there were two versions of each alternative/disabler story: one concrete and one abstract. The presentation order of the eight stories was also randomised. As a result, four different workbooks with the mentioned factors were constructed, with every participant receiving one. The dependent variables were inferential accuracy, epistemic status accuracy and frequency of the predicted cause.

Results

Inferential Response

As can be seen in Table 1, adults showed a correct counterfactual understanding, with a mean of 92% correct responses. Moreover, there was no difference between alternatives and disablers (91% vs. 93%) nor between concrete and abstract negation (both 92%).

Table 1 Percentages of correct responses in Inferential response and Epistemic status, means and standard deviations in brackets for Counterexample (alternative, disabler) and Concreteness (concrete, abstract) in Study 1

We carried out a Wilcoxon rank-test in Counterexample (alternative vs. disabler) and Concreteness (concrete vs. abstract) as independent variables and Inferential accuracy as dependent variable. The results showed no differences depending on the kind of counterexample (Z = .52, p = .60) nor on the concreteness (Z = .22, p = .826).

Epistemic Status

As can be seen in Table 1, the epistemic status difficulty (77% correct responses) contrasted with the inferential accuracy previously referred to (92% correct responses). A second Wilcoxon rank-test analysis was carried out to examine performance on the epistemic status question. As in the inferential response, there was no difference between alternatives and disablers (76% vs. 78%, Z = .41, p = .685) nor between concrete and abstract negation (80% vs. 74%, Z = 1.39, p = .165).

Causal Question

We categorised the responses to the causal questions as referring to the syntactic negation of the antecedent (e.g. the girl did not pour water; the sister did not enter silently, etc.), referring to the actual cause referred to in the story (e.g. the bucket had a hole, the alarm clock sounded, he had trouble sleeping, etc.), and finally other different causes not present in the story, such as inventions. We excluded participants who responded inconsistently across the two scenarios they were given for each condition. The number of remaining participants in the alternative concrete condition was 30 (25 selected the predicted cause and 5 the syntactic negation of the antecedent), 35 in the alternative abstract condition (27 predicted cause, 7 negation of the antecedent and 1 other causes), 40 in the disabler concrete condition (34 predicted cause and 6 negation of the antecedent) and finally 37 in the disabler abstract condition (31 predicted cause and 6 negation of the antecedent).

As can be seen in Table 2, the predicted cause was the most designated cause (70%) regardless of its concreteness in both types of counterexamples.

Table 2 Percentages of causes of the consequent reported (negation of the antecedent, predicted cause and others), means and standard deviations in brackets for Counterexample (alternative, disabler) and Concreteness (concrete, abstract) in Study 1

Four chi-square tests were performed to examine whether adults provided as cause of the consequent (“why the boy was awake”; “why the bucket was empty”) either negations of the antecedent (“the sister did not enter silently”; “the girl did not pour water”) or predicted causes (“the bucket had a hole”; “the alarm clock sounded”, etc.). The chi-square tests showed adults more frequently referred to the predicted cause as the cause of the consequent than the syntactic negation of the antecedent: alternative stories with a concrete situation (X2 (1, 30) = 13.33, p <.001), alternative stories with an abstract situation (X2 (2, 35) = 31.77, p <.001), disabler stories with a concrete situation (X2 (1, 40) = 19.60, p <.001), and disabler stories with an abstract situation (X2 (1, 37) = 16.89, p <.001).

We also carried out a third Wilcoxon rank-test with Counterexample (alternative vs. disabler) and Concreteness (concrete vs. abstract), with the frequency of the predicted cause (e.g. the bucket had a hole; alarm clock; trouble sleeping) as dependent variable. The results showed an effect of Counterexample with participants identifying more predicted causes in disabler than in alternative stories (73% vs. 66%; Z = 2.03, p = .042, r = .22). However, we did not find effect of Concreteness (Z = .144, p = .885).

Discussion

The results show that adults have good counterfactual thinking abilities (92% accuracy). As was expected, their ability to answer questions about epistemic status was not as good (75% accuracy). Consistent with previous findings, it seems that people can lose track of the footnotes (e.g. Gómez-Sánchez et al., 2020; Ruiz-Ballesteros & Moreno-Ríos, 2017). Accuracy decreases because these mental footnotes are required to distinguish real and conjectured possibilities. However, when people make an inference given the antecedent, they accept the consequent without problem, because they do not need to recover their footnotes (which situation is the real one and which the conjectured one).

The most novel result was the one that concerns how adults construct the negation of the antecedent in order to think with counterfactuals. As far as we know it is something that has not been studied before. Results support our hypothesis in adults, in that they more frequently report the predicted cause (the alternative or the disabler) as the cause of the consequent than the syntactic negation of the antecedent (using “not”). The effect is found with both kinds of counterexamples. It means that adults construct the negation of the antecedent by thinking of the alternative and affirmative situation (e.g. alarm clock, hole in the bucket) instead of by recovering the syntactic negation of the antecedent (she did not enter silently; she did not pour water).

Study 2 – Children

In the second study we tested children with the same task as adults. The aim of this study was to establish a developmental view of the findings with adults in Study 1. Hence, we evaluated the development of counterfactual thinking, as well as of the ability to keep track of epistemic status. Moreover, we tested how children construct the counterfactual possibility, as we did with adults. We therefore expect that children will recover the predicted cause (the alternative or the disabler) as the cause of the consequent more than the syntactic negation of the antecedent using propositional symbols (“not”).

Considering the increase with age in the ability to create alternatives (Barrouillet et al., 2009; Santamaría et al., 2013), as well as in working memory efficiency (Beck et al., 2009; Drayton et al., 2011; Ferguson & Cane, 2015; Gathercole et al., 2004), we predict a developmental trend in the ability to think counterfactually, as well as in the ability to distinguish the epistemic status (real and conjectured situation). The mental model theory maintains that one source of error when inferring is the easy loss of labels in the mental models, particularly in load conditions of working memory (Johnson-Laird, 1983; Johnson-Laird & Byrne, 2002). The limitation in younger children’s working memory span could led them to use an economical representation of counterfactuals by omitting the epistemic labels. Therefore, we predict more errors detecting the epistemic status than thinking counterfactually because children lose the mental footnotes about which situation is the real one and which the conjectured one.

Furthermore, if children represent negation by thinking about another possibility instead of negating the antecedent as adults do (Espino & Byrne, 2018; Study 1), and they are not able to think about negation in an abstract way, as some studies propose, then we predict the same effect we found with adults: to recover as the cause of the consequent to a lesser extent the syntactic negation of the antecedent compared to the predicted cause.

Finally, by manipulating concreteness (concrete vs. abstract situation) we predict that children will report the abstract situation as the cause of the consequent to a lesser extent than the concrete situation in alternatives stories but not in control disabler stories. This prediction is based on the findings about the construction of negation using concrete cases in schoolchildren and the development of the ability to create abstract negation in adolescence and adulthood (Markovits, 2013; Markovits & Lortie-Forgues, 2011). When we ask about the cause of an event in alternative stories (e.g. the boy was awake because…), children should find it easier to recover a concrete and imaginable situation or cause (the alarm clock) rather than an abstract one (trouble sleeping; see Figure 1a) to explain why the boy was awake, due to their poor ability to think about abstract negation. However, we do not predict any differences with disablers because, as we said before, they act as control.

Method

Participants

One hundred and forty-three children aged between 8 and 12 years (Mage= 10.33; SD 1.15) from three schools in Granada participated in this study. There were 53 girls with a mean age of 10.44 years (SD= 1.19) and 90 boys with a mean age of 10.26 years (SD= 1.13). They were organised into their two year groups: 2nd (73 children; Mage= 9.35; Age range: 8.00-10.22) and 3rd (70 children; Mage= 11.35; Age range: 10.34-12.95). All participants spoke Spanish as their first language.

The sample size was computed using G*power based on the responses of children from Gómez-Sánchez et al. (2020) previous results. We used the same strategy as in Study 1 but looking at differences in the interaction Conditional (even if, if) with Age-group in the epistemic measure found in Gómez-Sánchez et al. (2020). In this case the minimum sample size required (partial eta squared .02 to get a power of .90) was 108 participants. For the same reason than in Experiment 1, and predicting that children could be less coherent than adults in their responses, in order to make the test of frequencies only in participants with complete coherent responses we increased the sample in a third (35).

They participated in the study only if their parents gave written consent, complying with the ethical protocol from the University Ethics Committee for this study.

Materials

We used the same counterfactual reasoning task employed in Study 1(adults).

Procedure and Design

We manipulated the same factors within-participants as in Study 1 with adults: Counterexample (disabler vs. alternative), Concreteness (concrete vs. abstract) and Cause (negation vs. predicted cause). We added a fourth between-participants variable: Year group (2 vs. 3).

The task was carried following the same procedure as in Study 1.

Results

Inferential Response

We carried out a Wilcoxon test with counterexample (alternative vs. disabler) and concreteness (concrete vs. abstract), and a Mann Whitney U test with year group (2nd vs. 3rd) with inferential accuracy as dependent variable. Consistent with our hypothesis, the results showed an effect of year group (78% vs. 87%; Z = 2.77, p = .006, r = .23), revealing a developmental trend in the ability to reason with counterfactual conditionals. They also showed an effect of counterexamples with more correct responses in disabler than in alternative stories (85% vs. 79%; Z = 2.87, p = .004, r = .17). We did not find an effect in concreteness (Z = .63, p = .526) (Table 3).

Table 3 Percentages of correct responses in Inferential response and Epistemic status, means and standard deviations in brackets by year group, counterexample (alternative, disabler) and concreteness (concrete, abstract) in Study 2
Table 4 Percentages of causes of the consequent reported (negation of the antecedent, predicted cause and others), means and standard deviations in brackets by counterexample (alternative, disabler), concreteness (concrete, abstract) and year group (2nd vs 3rd) in Study 2

Epistemic Status

For the epistemic status question, we carried out a Wilcoxon rank-test for counterexample (alternative vs. disabler) and concreteness (concrete vs. abstract), as well as a Mann Whitney U test in order to compare year group (2nd vs 3rd), using Epistemic status as dependent variable.

The results showed an effect of year group (60% vs. 70%; Z = 2.52, p = .012, r = .21), showing a developmental trend in the ability to distinguish real and conjectured situations. However, we did not find any other effects (counterexample: Z = 1.16, p = .248; concreteness: Z = 1.44, p = .151).

Causal Question

We categorised responses and participants as in Study 1. The number of remaining participants in the alternative concrete condition was 90 (81 in the predicted cause, 6 in the syntactic negation of the antecedent and 3 in the other causes), 56 in the alternative abstract condition (47 predicted cause, 7 negation of the antecedent and 2 other causes), 99 in the disabler concrete condition (92 predicted cause, 4 negation of the antecedent and 3 other causes) and finally 105 in the disabler abstract condition (98 predicted cause, 5 negation of the antecedent and 2 other causes). The total number of excluded participants was similar in both year groups: in the case of alternatives, 70 from the 2nd year group and 69 for the 3rd year group; in the case of disablers 44 in the 2nd year group and 37 in the 3rd year group.

A chi-square test was carried out to test whether children provided as cause of the consequent (why the boy was awake; why the bucket was empty) either more negations of the antecedent (“the sister did not enter silently”; “the girl did not pour water”) or predicted causes (“the bucket had a hole”; “the alarm clock sounded”; “the child had trouble sleeping”, etc.). The chi-square test showed children more frequently suggested the predicted cause as the cause of the consequent than the negation of the antecedent in all cases: alternative stories with a concrete situation (X2 (3, 90) = 120.36, p <.001), alternative stories with an abstract situation (X2 (3, 56) = 131.99, p <.001), disabler stories with a concrete situation (X2 (3, 99) = 148.61, p <.001), and disabler stories with an abstract situation (X2 (3, 105) = 166.85, p <.001).

We also carried out a third Wilcoxon rank-test analysis using counterexample (alternative vs. disabler) and concreteness (concrete vs. abstract), and Mann Whitney U test for year group (2nd vs. 3rd), with the frequency of the predicted cause (e.g. the bucket had a hole; alarm clock; trouble sleeping) as dependent variable.

Results showed an effect of counterexample, giving more predicted responses with disablers than with alternatives (80% vs. 66%; Z = 5.27, p < .001, r = .31). There was a main effect of concreteness with more predicted causes when the situation was concrete (alarm clock) compared to abstract (trouble sleeping; 76% vs. 70%; Z = 2.29, p = .022, r = .14). However, we did not find an effect of year group (70% vs. 75%; Z = 1.49, p = .137). As expected, there was no effect of concreteness in disablers (Z = .64, p = .52) but there was in alternatives, giving more predicted causes with concrete situations than with abstract (73% vs. 58%, Z = 3.56, p < .001, r = .21) (Table 4).

In order to compare the performance of children and adults, we carried out an additional analysis using the Mann Whitney U test. We must be cautious with the interpretation of this analysis because, although we used the same task, the experiments were made at different time and places.

In the first question (inferential response), the analysis showed more correct responses in adults than in children (Z = 3.71, p < .001, r = .26). The same occurred in the second question on epistemic status (Z = 3.73, p < .001, r = .27), but there was no difference in the third one (Z = .06, p = .953). Moreover, as was expected and as has been seen previously, children found it easier to recover concrete causes in alternatives (73% vs. 58%, Z = 3.56, p < .001, r = .21) but not in disablers (Z = .64, p = .52). Adults did not show such difference in either condition (alternatives: Z = .19, p = .853; disablers: Z = .47, p = .637).

General Discussion

In the present studies, we examined the development of counterfactual reasoning and the ability to distinguish real and conjectured situations, as well as the ability to keep track of them (epistemic status). Counterfactual reasoning requires negating the antecedent to create the counterfactual possibility, however as far as we are aware it is not known how adults and children think about negation in order to negate the antecedent, and this was our most important aim.

As expected, we found high accuracy (92%) in adults’ ability to make inferences from counterfactual statements. As predicted, children’s performance showed a developmental trend in this ability (78% in 8-10 years old children and 87% in 10-12 years old children), that could be due to an increase in the ability to consider alternatives and increases in working memory capacity. This finding is consistent with results from earlier studies that indicate that 6 to 7 year-olds can indeed reason counterfactually (McCormack et al., 2018; Nyhout et al., 2019; Rafetseder & Perner, 2018), but revealing counterfactual thinking as a developmental ability, that improves until adolescence (Gómez-Sánchez et al., 2020; Rafetseder et al., 2013, 2021).

Regarding the epistemic status, we found that adults showed more difficulty than in the inferential task with a mean of 77% correct responses. This finding is consistent with previous results (Gómez-Sánchez et al., 2020; Ruiz-Ballesteros & Moreno-Ríos, 2017; Thompson & Byrne, 2002). According to the mental model theory, responding correctly to the epistemic status question requires not only having a complete representation of counterfactuals that includes the conjectured and the presupposed model, but also keeping in mind their labels in order to keep track of their epistemic status. These labels or mental footnotes are easily forgotten (Johnson-Laird, 1983; Johnson-Laird & Byrne, 2002), causing a decrease in the number of correct responses. Children’s performance on the epistemic status questions also showed a developmental trend (60% vs. 70%) and as with the adult group, their performance was worse than on the inferential task. In a previous study, Gómez-Sánchez et al. (2020), using similar stories (with some important differences regarding the variables manipulated here, such as concreteness and counterexample availability), also found higher scores in the inference responses than in the epistemic status and a developmental improvement with age in the epistemic responses. This ability could be crucial for achieving correct counterfactual reasoning (Gómez-Sánchez et al., 2020).

Finally, in respect of the causal question, we found the expected effect. More predicted causes were reported (e.g. the bucket had a hole, the child had trouble sleeping, etc.) as being the cause of the consequent than the syntactic negation of the antecedent using propositional symbols (e.g. the girl did not pour water; the sister did not enter silently, etc.). We observed this effect in both studies, which implies that adults and children construct the counterfactual possibility in the same way. These results are consistent with previous research, showing that people construct negation (“there is not a light figure”) by recovering another possibility (“a dark figure”) instead of by thinking on the syntactic negation (“not light figure”) (Espino & Byrne, 2018; Khemlani et al., 2012; Mayo et al., 2004; Moreno-Ríos & Byrne, 2018). The inference-to-alternative effect (Espino & Byrne) was obtained only with binary possibilities but not with non-binary possibilities (there is a red figure). In our study, the alternative stories contain two possible causes. As with the binary categories, the negation of one, seems to lead people to think of the other. It is possible that effect could disappear with more possible causes.

Something that was not expected was observing more predicted causes in disablers than in alternatives stories, especially as we found this effect in both studies. Although both kinds of stories are not comparable as they are different, it could be related to the salience of the cause: in alternatives the cause is something (e.g. an alarm clock; trouble sleeping) that does not get as much attention as in the disabler stories, where something unusual happened (e.g. a bucket with a hole). It could also be related to the structure of the counterexample. In disabler stories there is no competing alternative that catches attention, whereas in the alternative stories an explicit alternative is mentioned. However, more research is needed in order to find what causes this result.

We also found that children, unlike adults, gave the predicted cause more with concrete situations than with abstract situations, which could be explained by their poor ability in thinking with abstract concepts. Moreover, as hypothesized, the manipulation of concreteness had an impact on alternative stories but not in control disabler stories. This happens because alternative stories provide an explicit situation that corresponds to the negation of the antecedent, and the manipulation of its concreteness affected the recovery of the cause, as expected. In other words, when we asked children why the child was awake they reported the predicted response with concrete situations (“because the alarm clock sounded”) more than with abstract situations (“because he had trouble sleeping”). However, this effect was not found in disabler stories because they acted as control, with the concreteness of the cause remaining constant (compare Figures 1a and 1b). This result is in accordance with the developmental differences in their ability to negate, as schoolchildren find difficulty in thinking abstractly (Markovits, 2013; Markovits & Lortie-Forgues, 2011).

The findings of the present studies bring to light several difficulties that could cause children’s difficulty in thinking with counterfactuals, such as keeping in mind which is the real situation and which the conjectured one, and having to negate the antecedent in an abstract way. However, the most novel result is the one related to how adults and children construct negation. More research is needed in order to know to what extent these difficulties are responsible for their performance in counterfactual thinking and whether the construction of negation of the antecedent can be generalised to other kind of contents.