Discovering Common Hidden Causes in Sequences of Events

Valentin, Simon; Bramley, Neil R.; Lucas, Christopher G.

doi:10.1007/s42113-022-00156-z

Discovering Common Hidden Causes in Sequences of Events

Original Paper
Open access
Published: 11 November 2022

Volume 6, pages 377–399, (2023)
Cite this article

Download PDF

You have full access to this open access article

Computational Brain & Behavior Aims and scope Submit manuscript

Discovering Common Hidden Causes in Sequences of Events

Download PDF

2078 Accesses
1 Citation
5 Altmetric
Explore all metrics

Abstract

Human cognition is marked by its ability to explain patterns in the world in terms of variables and regularities that are not directly observable, e.g., mental states, natural laws, and causal relationships. Previous research has demonstrated a capacity for inferring hidden causes from covariational evidence, as well as the use of temporal information to identify causal relationships among observed variables. Here we explore the human ability to use temporal information to make inferences about hidden causes, causal cycles, and other causal relationships, without relying on interventions. We examine two behavioral experiments and compare participants’ judgments to those of Bayesian computational-level models that use temporal order and delay information to infer the causal structure behind observed event sequences. Our results indicate that participants are able to use order and timing information to discover hidden causes, and make inferences about causal structures relating hidden and observable variables. Computational modeling indicates that most participants are best described by normative delay model predictions, but also reveals several clusters of participants who made unexpected inferences, suggesting opportunities to enrich future models of human causal reasoning.

A Psychological Approach to Causal Understanding and the Temporal Asymmetry

Article 02 January 2020

Causality Guides Time Perception

Temporal Binding and the Perception/Cognition Boundary

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

People have a remarkable capacity to make sense of the sparse, noisy, and ambiguous stream of data that makes up everyday experience. We infer causal relationships to explain events that occur close in time—such as between pressing an unmarked button on a hotel TV remote and seeing the TV turn on—but also between events much further apart in time—such as eating fast food and having an upset stomach some hours later. Often this involves positing the existence of unobserved (or latent) causes as well as the (inherently unobservable) causal relationships. For instance, for the example above, instead of concluding the food caused the symptoms, we may think that both deciding to eat fast food and having an upset stomach are due to stress at work or school. Similarly, sitting on a train and observing that several people are repeatedly picking up their phones at similar times, we may infer that they are reacting to the same messages in a group chat or news notifications.

Coming up with an appropriate generative model of the external environment is valuable for any cognitive agent, allowing for accurate prediction and effective control in pursuit of goals, as well as sub-serving explanation and communication. Where latent variables actually exist, identifying them also tends to result in a better and more compact representation than attempting to do without them (e.g., Gershman & Niv, 2010). Previous studies on causal reasoning have shown that adults and children, even as young as 10 months, can use covariation information to learn about hidden causes (Kushnir et al., 2003, 2010; Saxe et al., 2005; Lucas et al., 2014; Rottman et al., 2011). In particular, Lucas et al. (2014) showed that adults can correctly infer the presence of one or several hidden elements, as well as the functional form of their causal mechanism to explain the behavior of a black-box machine.

The possibility of common hidden causes, or latent confounders—i.e., causally relevant variables that have not been or cannot be observed—creates a challenge for the discovery of causal relationships. This is because dependencies between any two events or variables can always be explained by the idea that both are being influenced by some unobserved third variable. As a simple example, an observed correlation between X and Y is consistent with a variety of causal models: Perhaps X causally influences Y, perhaps Y causally influences X; or perhaps some unobserved variable H causally influences both X and Y. These are also not mutually exclusive possibilities; it could also be the case that there is a bidirectional causal influence with X influencing Y even as Y influences X. Worse, X and Y could be constituents of some larger feedback loop involving multiple hidden variables.

Causal graphical models (CGMs, also known as causal Bayesian networks; Pearl, 1995, 2009) can help us formalize and better understand this problem.^{Footnote 1} CGMs have become a dominant tool for causal inference both in data science and as a framework for modeling causal cognition (e.g., Koller and Friedman, 2009; Griffiths & Tenenbaum, 2005; Gopnik & Tenenbaum, 2007; Tenenbaum & Griffiths, 2001; Bramley et al., 2017). A CGM represents causal relationships between random variables using directed edges and parameters encoding the causal mechanisms connecting causes with their effects. This formalism has the constraint that the resulting graph must be acyclic, i.e., there can be no path from any node in the graph back to itself, meaning there is no natural way to represent feedback loops. As in the example involving X and Y above, when learning a CGM from observational contingency data, there is a strict upper bound on structural identifiability. Causal structures that have different interventional semantics can be Markov equivalent, meaning they imply identical (conditional) independencies in the absence of interventions and so cannot be distinguished from observational data alone (e.g., Pearl, 2009; Peters et al., 2017; Heinze-Deml et al., 2018).

The gold standard for uniquely determining causal structure is to perform experiments, manipulating causal variables and observing what else changes (Pearl, 2009; Cartwright, 2007). However, interventions are not always practical or even possible; in many settings, they may be unethical or prohibitively costly. While people are capable intuitive experimenters (Gopnik et al., 2007; Cook et al., 2011; Kushnir & Gopnik, 2005; Lagnado & Sloman, 2004; Steyvers et al., 2003), they may also make use of the richness of observational data to tackle the challenging problem of inferring causal structure (e.g., Rothe et al., 2018). Given the abundance of observational data, ignoring such information would severely limit people’s ability to make sense of the world around them. Critically, temporal information can provide cues and constraints on causal structure that are missed when only considering “static” contingency information, which may be particularly important for tackling the challenging problem of inferring causal structures involving hidden causes.^{Footnote 2}

People do not generally encounter contingency data directly, but rather events occurring over time, often without the additional information that would allow them to build a contingency table (e.g., about what would constitute an “independent trial”). Temporal order and delay between events have been linked to learning since the early days of psychology, featuring in basic accounts of animal and human learning and conditioning (e.g., Grice, 1948; Michotte, 1946). The connection between order and causality has been noted since the earliest work on causality, for instance by Hume, as causes are assumed to precede their effects (Hume, 1740). Recent research on the role of time in human causal structure learning has shown that people readily use temporal information to infer causal relationships among observed events, making use of both temporal order (Rottman & Keil, 2012; Bramley et al., 2014) and delay information (Bramley et al., 2018; Gong et al., 2022). Underlining the importance of temporal information, people have also been found to make inferences that align with the temporal order of events, even if this temporal information is at odds with covariation evidence (Lagnado & Sloman, 2006; Rottman & Keil, 2012). Regarding more nuanced temporal delay information, it is well established that longer delays between two events lead to weaker judgments of causality, other things being equal (Grice, 1948; Shanks et al., 1989). Explanations for this observation consider working memory capacity constraints (Ahn et al., 1995; Einhorn & Hogarth, 1986), but also the normative rationale that longer delays imply more events may have occurred in the meantime that could explain the effect (Buehner & May, 2003; Lagnado & Speekenbrink, 2014). Shorter delays are not invariably seen as more causal, however: People are also able to adapt their expectations to specific domains. As in the introductory examples, we expect the delay between pressing the power button on a TV controller and seeing the device turn on to be short but between eating fast food and developing symptoms of an upset stomach to be much longer (Garcia et al., 1966; Buehner & McGregor, 2006a). Indeed, violations of the expectations about causal delays have been shown to reduce judgments of causal strength (Greville & Buehner, 2010). Research focusing on the problem of identifying the structure of fully observed and acyclic causal systems suggests that people use order information to rule out incompatible causal structures, but are also able to use the duration and variability of causal delays to make more fine-grained judgments (Bramley et al., 2018).

In this work, we build on prior research into how people infer the presence of hidden causes as well as on work that studies the role of temporal information in shaping people’s causal inferences. Specifically, we study how people use temporal information to infer the causal structure giving rise to observed sequences of events when this can involve hidden causes and causal cycles (see Fig. 1). Across two experiments covering different domains, we compare people’s causal judgments to Bayesian structure learning models that use order or delay information.

Formal Framework

Our approach follows the tradition of rational analysis (Anderson, 1991) and computational-level models (Marr, 1982). That is, we derive normative predictions under different assumptions about how people may construe the learning problem and compare these predictions to human judgments. We take causal structure learning to be a probabilistic inverse problem, a common perspective that has been successful in explaining a wide range of phenomena (e.g., Griffiths & Tenenbaum, 2005; Gopnik & Tenenbaum, 2007). More precisely, we represent structure learning as a problem of Bayesian inference over a hypothesis space of possible causal generative models. This begins with the learner’s prior beliefs about the set of possible causal structures S and their parameters 𝜃_s for all s ∈ S, represented as p(s) and p(𝜃_s∣s), respectively. Given data $\mathcal {D}$, a learner updates their beliefs about causal structures via Bayes’ theorem:

$$p(s \mid \mathcal{D}) \propto p(\mathcal{D} \mid s) p(s).$$

(1)

Here, $p(\mathcal {D} \mid s)$ is the marginal likelihood of structure s, having integrated out our uncertainty about the parameters and potential hidden causes. However, this marginal is typically not available in closed form, as this involves integrating over unknown parameters, even for structures without hidden causes:

$$p(\mathcal{D} \mid s) = \int p(\mathcal{D}\mid \boldsymbol{\theta}_{s}, s) p(\boldsymbol{\theta}_{s} \mid s) d \boldsymbol{\theta}_{s}.$$

(2)

Evaluating marginal likelihood for structures with hidden hidden causes involves even more uncertainty, since the likelihood of parameters $p(\mathcal {D}\mid \boldsymbol {\theta }_{s})$ depends on the values of the hidden causes (h), which must also be marginalized over:

$$p(\mathcal{D}\mid s) = \iint p(\mathcal{D}\mid \mathbf{h}, \boldsymbol{\theta}_{s}, s) p(\mathbf{h} \mid \boldsymbol{\theta}_{s}, s) p(\boldsymbol{\theta}_{s} \mid s) d \mathbf{h} d \boldsymbol{\theta}_{s}.$$

(3)

The integrals in these expressions can usually not be solved in closed form, so they require the use of an approximation scheme. We next discuss existing proposals to modeling time in causal relationships, before discussing our approach.

Existing Modeling Approaches

Static CGMs serve as interpretable and compact static representations of the causal relationships between random variables; however, the temporal dynamics between individual events are not typically represented explicitly. The idea of taking into account temporal information for discovering causal relationships is not new, considering, e.g., Granger causality (Granger, 1969), dynamic causal modeling (Friston et al., 2003) and advances in the machine learning literature (e.g., Didelez, 2008; Pamfil et al., 2020; Löwe et al., 2020; Malinsky & Spirtes, 2019; Strobl, 2019; Mastakouri et al., 2021).

A popular variant of graphical models for modeling dynamic relationships or systems that evolve over time is given by dynamic Bayesian networks (DBNs; Dean & Kanazawa, 1989), where—as for for static CGMs—edges can be attached with causal semantics. However, one downside is that DBNs typically require a discretization of continuous time into discrete steps. Continuous time Bayesian networks (CTBNs; Nodelman et al., 2002) extend DBNs to represent structured stochastic processes in continuous time, thereby mitigating the problem of choosing a discretization of time in DBNs. However, both standard DBNs and standard CTBNs implicitly assume that delays between events are memoryless, following exponential distributions (Murphy, 2012). This assumption does not hold for many real-world phenomena, such as incubation periods expressing the delay between exposure to a virus and the first sign of symptoms.

A related approach that relaxes the DAG representation of CGMs is to introduce an undirected edge to capture both common hidden causes and cyclic relationships, as in so called “chain graphs” (Lauritzen & Richardson, 2002). However, this amounts to declaring the distinction between causal cycles and hidden causes as unidentifiable, even though these graphs respond differently to interventions: While intervening on one of the observed variables might cause the other variable to activate in a cycle, such interventions would have no effect in a CHC structure, as there is no direct causal connection between the observed variables. Overall, despite many advancements, challenges remain for inferring causality from temporal as well as atemporal data (e.g., Glymour et al., 2019).

A different class of models that were applied to human causal cognition are based on point processes and represent causes that influence the rate with which other events occur in continuous time (Pacer et al., 2012; Pacer & Griffiths, 2015). Recent modeling work has also addressed the question of how people infer causality between specific event instances. This has been studied by Stephan et al. (2020), who focus on singular causes rather than causal structure over multiple observations with a modeling framework that is related to the delay model we discuss below. A different line of work has focused on the question of how people infer causal structure between continuous causes in continuous time, as opposed to events in continuous time (Davis et al., 2020). Moreover, while the present work focuses on studying human behavior at the computational level, prior research by Fernando (2013) presents an approach to learning causal structure between events at an implementational (i.e., neural) level.

Events in Continuous Time

We are interested in the causal relationships between the onsets of events, which can be treated as points on the real line. This means that two events never occur at exactly the same moment. We also assume that effects never precede their causes. Figure 1 shows an example of how four events imply different sets of cause–effect delays under five different causal structures.

Order Model

We first describe an inference model that is sensitive only to order information and disregards delays between events. For this, we generalize previous work (Rottman and Keil, 2012; Bramley et al., 2014, 2018) in order to accommodate hidden causes and cycles. Our order model has likelihoods that depend only on the order of events in a given sequence, so is agnostic about the exact length of any inter-event delay distributions (see Fig. 2). We construct generative order models for each causal structure, compactly representing each as a probabilistic finite state machine (PFSM; Vidal et al., 2005) (see Appendix 3). Depending on the causal structure, different sequences of events are possible.

The model in which X and Y are independent imposes no restrictions on transitions between events: X might be followed by another occurrence of X or by a Y, and vice versa. For $X\rightarrow Y$ (that is, X causes Y ), we assume there is only one possible transition for each state: cause X is invariably followed by its effect Y and X does not reoccur until its previous activation (and causal chain to Y ) has run its course, meaning that Y is always followed by X. Essentially, for the order model (but not the delay model, as discussed below), we assume causes are blocked from re-occurrence while they are still involved in producing their effects. An alternative could be to assume that multiple causal influences are able to travel between a cause and its effect simultaneously. We return to this assumption in the discussion. The structure $Y\rightarrow X$ has the mirrored semantics of $X\rightarrow Y$. A common hidden cause can produce activations of X and Y in succession, but uniquely and as opposed to the independent structure, implies that the same variable will never activate more than twice in a row, following the assumption described above. For causal cycles, the order-only model assumes either observable event might initialize the observation sequence, but after this the observables activate in turn.

Following prior work (Bramley et al., 2018), we complete each order structure by setting the transition probability for a state k to $\frac {1}{\text {outdegree}(k)}$, where the outdegree is defined as the number of outgoing edges from a state, implementing the principle of the Bayesian Ockham’s razor (e.g., Myung & Pitt, 1997). For example, we assume there is always a $\frac {1}{2}$ chance of X occurring as the next event in the independent structure (see Appendix 3).^{Footnote 3}

Delay Model

For our generative model of events and their delays, we use a variant of a dynamic Bayesian network (DBNs; Dean & Kanazawa, 1989) representation, as a CGM in which nodes denote when components of the causal system activate. Edges correspond to parameterized delay distributions controlling the intervals between activations of a cause and effects. Root causes are assumed to cause their own recurrence with their own set of parameters controlling their inter-event distributions. Here we restrict out attention to causal relationships in which each effect event can only have at most one cause event, and effect events appear in the order their cause events occurred in. For instance, for a $X \rightarrow Y$ structure, the sequence x⁽¹⁾ ≻ x⁽²⁾ ≻ y⁽¹⁾ ≻ y⁽²⁾ (where ≻ denotes precedence with respect to the temporal order) would be consistent with the delay model, whereas the sequence x⁽¹⁾ ≻ x⁽²⁾ ≻ y⁽²⁾ ≻ y⁽¹⁾ would not be consistent. This assumption is weaker than the assumption for the order model, as the delay model does not assume that causes are blocked until their effects have occurred, but only assumes that the problem of causal attribution is resolved via temporal precedence; we discuss this point further in the general discussion. To generate data from such a model, one samples root cause activations, then samples causal delays to their effects, unrolling the graph into a tree of event timings. Inference then amounts to “reverse engineering” the causal structure most likely to have given rise to the set of timings observed, taking into account prior beliefs about their plausibility.

Following prior work, we use the gamma distribution to model delays (e.g., Bramley et al., 2018; Gong et al., 2022; Stephan et al., 2020). Gamma distributions have positive, i.e., $\in (0, \infty )$, support and can capture beliefs about the expectation and variance of delay distributions. Gamma distributions are typically defined by a shape k and a scale 𝜃 (or alternatively, by a shape α and rate β). As opposed to the exponential distribution (which is a special case of the gamma distribution), the gamma distribution allows for modeling non-memoryless delay distributions. That is, gamma distributions can capture expectations about when an effect will happen after observing a cause as well as how much variability there is around this expectation. For easier interpretability, we express gamma distributions using a standard reparametrization in terms of their mean μ = k𝜃 and variance σ² = k𝜃². For additional background on the gamma distribution, see Appendix 4.

We display static summary graphs along with example event sequences in Fig. 1. For our computational analysis, we index events of each type separately by their i-th occurrence. In the unrolled graphical representation, edges thus represent parameterized gamma delays between occurrences of events (represented as nodes). For instance, if x⁽ⁱ⁾ is connected to y⁽ⁱ⁾ by an edge under a particular structure hypothesis, x⁽ⁱ⁾ occurred at some time x⁽ⁱ⁾ = 1s and we observe a delay of 0.4s, then the value of y⁽ⁱ⁾ is y⁽ⁱ⁾ = x⁽ⁱ⁾ + 0.4s = 1.4s.

For the independent structure, x⁽ⁱ⁾ causes x^(i+ 1) and y⁽ⁱ⁾ causes y^(i+ 1), such that there is no inherent dependence between occurrences of X and the next occurrence of Y. Under $X\rightarrow Y$, each x⁽ⁱ⁾ is caused by x^(i− 1), while each y⁽ⁱ⁾ is caused by x⁽ⁱ⁾ and thus is independent of y^(i− 1) conditional on x⁽ⁱ⁾. For the common hidden cause structure, the occurrences of x⁽ⁱ⁾ and y⁽ⁱ⁾ are taken to be caused by activations of a hidden cause h⁽ⁱ⁾, which was self-caused by h^(i− 1). We further assume causal delays from h⁽ⁱ⁾ to x⁽ⁱ⁾ and h⁽ⁱ⁾ to y⁽ⁱ⁾ have tied parameters, such that occurrences of X neither systematically succeed or precede occurrences of Y. Coupled with our assumptions about delay distributions, this means that common hidden causes entail effects that are close to one another in time, but occur in arbitrary orders. This assumption of tied parameters may be justified in real-world settings whenever the observed variables have a shared causal mechanism, e.g., if both observed variables are instances of the same “type” of variable (as is the case for our cover stories below), but see the discussion about the possibility of relaxing this assumption. Lastly, the causal cycle resembles $X\rightarrow Y$ (or $Y \rightarrow X$) but distinguishes itself by two characteristics, which arise from a shared cause-effect mechanism: (1) delays from X to Y and Y to X are symmetric; hence, the parameters are tied; and (2) a sequence of observations might begin with either X or Y.

Overview of Experiments

We ran two experiments in which participants watched a series of short videos. Participants were allocated to different cover stories, but videos always showed two colored circles on a gray background, corresponding to the two observable components X and Y of some causal scenario of unknown structure. Activations were then visualized by the requisite component flashing briefly. Participants had to watch the videos and make a forced choice from the set of five candidate causal structures.

Experiment 1

Methods

Participants

Fifty adults (18 female, mean age 34.30 years, SD = 10.70) participated in the experiment in return for a base payment of £1.50 and performance-related bonuses of up to £1.20 resulting in an average compensation of £11.46/h. Participants took, on average, 16.25 (SD = 9.09) minutes to complete the task.

The experiment was conducted online with participants recruited via Prolific Academic (www.prolific.co). The experiment was programmed in Scala.js and ran as a standard client-side experiment. In order to ensure high data quality, participants were required to have a 99% completion rate on previous studies on Prolific, as well as between 100 and 10,000 previous submissions. The study was pre-registerd on OSF (https://osf.io/e5d23).

The sample size for our first experiment is based on a power analysis using G*Power (Faul et al., 2009). We consider binomial tests for each of the five conditions, where the stimuli were sampled from the generative structure (that is, excluding the prior elicitation condition). We are interested in detecting deviations from the expected proportion of chance responses (0.2 for 5 response options) for the ground-truth category relative to all other categories. For a power of 0.95, a Bonferroni-adjusted alpha level of $\frac {0.05}{5} = 0.01$ (for running five separate tests), a proportion of 0.2 under the null-hypothesis, and a proportion (to be interpreted as an effect size) of 0.5 under the alternative hypothesis, we obtain a required total sample size of 44. Including a safety margin, we thus obtain a total sample size of 50 participants.

Design and Procedure

Participants were asked to identify the causal relationships between different species of bioluminescent bacteria and told they would be paid £0.20 for each trial in which they identified this correctly. Concretely, their task was to identify the causal structure giving rise to observed sequences of events. We constructed video stimuli that showed two bacteria labeled X and Y which would occasionally light up and presented five possible causal hypotheses in the form of graphical diagrams relating the bacteria to one another and to a potential hidden cause (see Fig. 3 for a screenshot of the interface).

Participants were first trained on how to interpret each diagram, see Appendix 1 for the training participants underwent. Participants were also told that in addition to the observed X and Y bacteria, a hidden bacterium might also be present, and that this might influence the occurrence of illuminations of the observed bacteria. Participants were also instructed that some causal structures may never be the correct answer and some might be correct for several of the trials. After completing instructions and two comprehension checks (see Appendix 1), each participant completed six trials in randomized order. Five of these involved watching 35 s videos containing between 12 and 15 occurrences of each bacterium X and bacterium Y lighting up. In reality, each of the five causal structures was used to generate the data for exactly one of these trials. One additional trial (occurring randomly in the sequence of trials) did not include any observed data. Participants were instead instructed: “This recording must have gone missing. Even though there is no video recording available, please give your best guess about which structure it might have been anyway.” This trial served to probe participants’ prior expectations about the plausibility of the different structures without seeing any data.

At the end of each trial, participants made a forced-choice judgment selecting which of the five causal structures generated the data and provided a confidence in this judgment using a slider ranging from a left pole of 0 for “completely uncertain” and a right pole of 100 “completely certain” (with increments of 1, and without a starting value to minimize anchoring effects), as shown in Fig. 3.

The left/right position of the X and Y objects was randomized for each video (with the labels remaining in place), and the colors of the lights were randomly drawn for each trial to make different trials more distinguishable and reinforce the idea that different causal structures could govern different pairs of bacteria. Colors were sampled such that each pair had maximally dissimilar hues in a hue/saturation/value color space, but equal saturation and value (i.e., brightness). Events took the form of bacteria lighting up: This was displayed by having the colored circle representing the bacterium become maximally visible (by assigning minimal opacity) and then decay exponentially back to its baseline opacity, fading into the gray background with a rate of 25% per video frame (with a refresh rate of 50ms), providing a visual presentation that is consistent with the semantics of point events but ensures participants could easily perceive and distinguish between the events.

Stimuli

All video stimuli were generated by sampling from one of the five causal structures under consideration. Sampled delays were generated from distributions that provided adequate evidence for the true generative structure, while leaving some uncertainty. In particular, we set the parameters such that recurrence of each cause occurred with a longer, more variable delay than those delays between causes and effects, ensuring that there would be little chance of a cause recurring while its effects were still underway or of effect events overtaking each other. Specifically, for the independent structure, we set $\mu _{a} = 2.0s, {\sigma _{a}^{2}}=0.4s$ and $\mu _{b} = 2.0s, {\sigma _{b}^{2}}=0.4s$. For $X \rightarrow Y$, we set $\mu _{a} = 2.0s, {\sigma _{a}^{2}}=0.4s$ and $\mu _{b} = 0.5s, {\sigma _{b}^{2}}=0.01s$ (for $X \leftarrow Y$ swapping a and b). For the common hidden cause structure, we set $\mu _{a} = 2.0s, {\sigma _{a}^{2}}=0.4s$ and $\mu _{b} = 0.5s, {\sigma _{b}^{2}}=0.01s$. For the causal cycle, we set $\mu _{a} = 1.0s, {\sigma _{a}^{2}}=0.01s$ and $\mu _{b} = 1.0s, {\sigma _{b}^{2}}=0.01s$, which leads the self-delays of the X’s and Y ’s to be the same as for all other structures in expectation, while providing reliable cues. All stimulus parameterization settings are also reported in Appendix Table 2 as an overview. For each causal structure, we generated four stimulus sequences using the same generative model and selected one uniformly at random for each participant to average over idiosyncrasies of particular sampled sequences. All stimuli are visualized in Fig. 4 and the exact timings of the events are included in the pre-registration materials (see https://osf.io/e5d23).

Results

Overall, participants recovered the correct structure 58% of the time (compared to a chance-level accuracy of 20%). However, accuracy depended on the ground-truth causal structure. Figure 5 displays confusion matrices showing how often participants recovered the true generative structure in each condition. For the independent and the directed causal structures ($X\rightarrow Y$ and $Y\rightarrow X$), participants most frequently identified the ground truth. For the common hidden cause (CHC), people more often judged the data to be generated by a causal cycle, and for the causal cycle, people more often tended to favor one of the directed structures.

Results for the prior elicitation condition are presented in Fig. 6. As indicated by deviations from the gray horizontal line, participants descriptively favored independent causal relationships and the common hidden cause over other possibilities.

Experiment 1 had a cover story of bioluminescent bacteria in a Petri dish, and it may be that the findings reported above express people’s idiosyncratic expectations about the behavior of such bacteria. To assess the domain-generality of people’s causal inferences from temporal data, we repeat (and internally replicate) the task under a range of different cover stories, as we report below.

Experiment 2

The aim of our second experiment was to replicate the findings from Experiment 1, and to understand (1) whether Experiment 1’s results reflect domain-general expectations or specific expectations about our bacterium cover story; and (2) in the event that participants’ beliefs vary, how they do and to what extent their elicited priors are consistent with their inferences. We included a replication of the original cover story and three new between-participant conditions with different cover stories. The experiment was pre-registered on OSF (https://osf.io/jq9bd).