Introduction

Since Heider and Simmel's pioneering demonstration of 1944, it had been acknowledged that simple geometrical shapes, despite their inanimate character, can express aliveness, emotions and intentional attributes by virtue of their motion.

The impression of aliveness and meaningfulness of these events is surprisingly vivid, immediate and compelling (see Scholl & Tremoulet, 2000 and Scholl & Gao, 2013, for reviews), and many studies demonstrate they are strongly related to low-level visual processing, despite the seemingly higher-level impressions they prompt. According to several authors, we perceive animacy and intentionality automatically, in the same manner that we perceive spatial depth or optical-geometrical illusions (Scholl & Gao, 2013).

The emergence of animacy in moving shapes has been often related to minimal kinematic cues like self-propelling motion in the absence of an external cause, thus implying the perceptual evidence of a hidden energy source (Premack, 1990; Dasser, Ulbaek, & Premack, 1989; Markson & Spelke, 2006; Csibra, 2008), and other violations of the conservation energy principle, such as increased acceleration and/or heading (Tremoulet & Feldman, 2000), the C-shaped or S-shaped path of the trajectory (Stewart, 1984; Blythe, Miller, & Todd, 1996, 1999; Gelman et al., 1995), speed and direction changes (Szego & Rutherford, 2007; Visch & Tan, 2009; Träuble, Pauen, & Poulin-Dubois, 2014), or non-rigid, by rhythmic expansion-contraction, motion (Michotte, 1946/1963; Schlottmann & Ray, 2010), and by mimicking the motion of naturally occurring stimuli (Schultz & Bülthoff, 2013).

Even if such minimal kinetic variations alone do induce impressions of animacy and goal-directedness, another stream of research has focused on the role of spatiotemporal contingencies between two or more moving objects in generating animacy, as in the Heider & Simmel's cartoon-like demonstration (1944) and in the Michotte's trigger effect (1946/1963), including more recent works (Bassili, 1976, Dittrich & Lea, 1994; Schlottmann & Surian, 1999; Schlottmann et al. 2002; Pavlova, 2012; Schultz, Friston, O'Doherty, Wolpert, & Frith, 2005). Some authors hypothesized that “displays that contain two or more moving objects appear often more compellingly animate than those depicting only one object, perhaps because each object provides a more rich and variable “context” for the other" (Heberlein 2008, p. 366). Consequently whenever a simple supporting context like another small geometrical object is added into the same display, animacy ratings significantly increase (Tremoulet & Feldman, 2006).

In most of these configurations, moreover, beside animacy, viewers tend also to attribute various emotional reactions and intentional states to the moving objects, such as goal-directedness or chasing, avoiding, fighting, playing and other kinds of social interactions (e.g. Kanizsa & Vicario, 1968; Asch, 1952; Opfer, 2002; Csibra, 2008; Dasser et al., 1989; Gao, Newman, & Scholl, 2009; Rochat et al., 2004; Tremoulet & Feldman, 2006; Gao, McCarthy, & Scholl, 2010; Gao & Scholl, 2011; Gao, New, & Scholl, 2017; Dittrich & Lea, 1994; Blythe et al., 1999; Barrett et al., 2005; Schlottmann, Ray, Mitchell, & Demetriou, 2006; Parovel & Guidi, 2015).

To emphasize the role of relational and contextual cues in the emergence of animacy and mental states, some authors refer to the intentionality hypothesis, which claims that the environment where an action is observed may influence the observer's perception of animacy (Baker et al., 2009; Csibra, 2008; Gao & Scholl, 2011; for reviews see Tremoulet & Feldman, 2006; Gao & Scholl, 2013).

However, there still coexist diverging views in the recent literature. (1) Intentionality itself, that is the appearance of psychological attitudes in the moving object, has been related to goal-directedness, i.e., moving towards something (Gao & Scholl, 2011; Blythe, Miller & Todd, 1996), even if the goal is out of the screen (Tremoulet & Feldman, 2000), (2) while other authors have suggested instead that goal-directedness can be an effective cue in improving animacy, but not intentionality. They define intentionality “in the sense of higher-level orders of intentionality”, that is the presence of mental states (Heberlein, 2008; Opfer, 2002). In the "intentional reaction" (Kanizsa & Vicario, 1968; see Schlottmann et al., 2006), for instance, a square is described as "seeing" an approaching square and escaping-away from it. Note that, in this case, the object shows a form of awareness even if it is moving away, and not towards, something. Moreover (3), according to other authors, animacy itself depends on intentionality, and so there cannot be animacy without intentionality (Dittrich & Lea, 1994; Tremoulet and Feldman, 2000), while, according to others (4), animacy and intentionality are separate constructs (Premack, 1990; Gao & Scholl, 2013). An intermediate position (5) has been proposed by Visch and Tan (2009), stating that animacy could be a basic perceptual property that can be integrated with more complex impressions like emotions and intentions. In this sense, intentionality and goal-directedness could be conceived as "related properties" of animacy (Gao & Scholl, 2011) that may be triggered by specific spatiotemporal contexts.

One study specifically centred on the role of context has been reported by Tremoulet and Feldman in 2006. Their displays showed a single figure (rectangular or round) moving on a screen and changing both speed and trajectory simultaneously, while a static object (dot-foil or rectangular paddle) was placed in different positions defining several behavioural conditions of the target, like moving toward a prey, or away from a predator, or being an obstacle. Tremoulet and Feldman found a small but significant effect of the context on animacy ratings, particularly in the goal/prey conditions, and a strong effect of speed change, which replicated previous results from the same group (Tremoulet & Feldman, 2000). In one of their experiments they also adopted dynamic contexts, to create a temporal contingency between the paddle and the target's change in velocity, but they did not get a significant increment in animacy ratings, even if the goal condition was rated higher than the irrelevant condition. They suggest that a key factor in the perception of animacy is the attribution of an intention to the object, an intention than can be triggered by speed increase and change in direction alone, but that can also be specified by a supporting context, leading the observer to infer the cause of its motion (see Gelman, Durgin & Kaufman, 1995).

This hypothesis is also supported by a long tradition of research in this topic in experimental phenomenology, according to which we identify causal and social relations on the basis of the kinetic structure of the global configuration, i.e. the way in which two moving objects interact (Michotte, 1946/1963; see Asch, 1952; Bozzi, 1969, 1990; Wagemans, Van Lier, & Scholl, 2006; Costall, 1991; Thinès, Costall, & Butterworth, 2015).

The aim of the present research was to explore further the role of different spatiotemporal contexts on the perception of animacy and other related properties—such as emotional states and intentionality—in two-dimensional moving objects. In particular, we focussed on simple basic interactions between the target and a second object acting as a context (similar to Michotte’s causal displays), and planned to compare displays where the context is neutral with displays where the nature of the context is specified as an animate agent or as a lifeless object. We expected that both the type of context and the relative direction of the target’s motion with respect to the same context, i.e. approaching vs avoiding, may influence the observer’s perception of animacy and intentionality.

Experiment

The experiment was designed to investigate if, and how, different kinds of context differ from each other in influencing the impression of animacy elicited by a moving object. The context consisted of a static or moving object, having the same shape of the target object, which could show an animate-like or a physical-like trajectory.Footnote 1 We also contrasted the opposite relative directions between the target object and the contextual object, with the goal to compare approaching vs avoiding displays, keeping unchanged the other stimulus conditions. In simpler words, the contextual object was placed before or after the path of the target object. Finally, we also varied the type of motion of the target on the screen introducing animacy cues in its trajectory by making it move in a way reminiscent of a caterpillar, as did Michotte (1946/1963) in his demonstrations on selfpropelled animal locomotion. In this way we could investigate the relative weight of external (i.e., context) and of intrinsic (i.e., motion type) factors in the perception of animacy.

Most of the studies of perceived animacy reported in the literature used an explicit rating task to measure the perceived animacy. We decided to complement rating tasks with 2-alternative forced-choice (2AFC) tasks, in which participants had to choose between two stimuli the one in which the target object seemed more alive. Requiring very spontaneous and intuitive responses, in fact, the forced choice method allows us to obtain results based more on immediate perception than those obtained by a scale that asks for explicit judgements about the impression of animacy (see Parovel & Guidi, 2015, for another application of this method in a related investigation).

Finally, to determine the effect of different contexts on the emergence of properties related to animacy, such as emotions and intentions, we also recorded the phenomenological reports of participants at the end of the experimental session.

Methods

Participants

Participants were 60 university students (42 females) with a mean age of 28 years (SD = 3.6). All participants reported normal or corrected-to-normal vision. All gave their written informed consent in accordance with the Declaration of Helsinki, and were debriefed at the end of the experiment about the purpose of the study.

Design

The stimuli were designed starting from two basic configurations in which a small black square, initially static on the screen, briefly moved horizontally at constant speed, either from left to right or vice versa, and then halted (Fig. 1a,b). Six variations were then devised by introducing a second grey square that provided a context for the motion of the black one. In the static context conditions, the grey square did not move, and it was placed either to the left of the initial position of the black square, so that the latter moved away from it (Fig. 1c), or to the right of the final position of the square at the end its path, so that the black square moved toward the context element (Fig. 1d). In the physical context, the grey square moved downward vertically from an initial position at the top left of the screen or at the top right of it, so that it seemed to fall either next to the starting position of the black square (that thus moved away from it; Fig. 1e), or next to the final position of it (so that the black square moved toward the grey one; Fig. 1f), bouncing twice when it reached the same horizontal position of the black square. Finally, in the animated context conditions, the grey square moved in a way that suggested it was animated, changing its shape during motion, alternating phases of contraction to phases of expansions, thus resembing a caterpillar moving along a linear path. In one of these conditions (Fig. 1g) the grey square seemed to follow the black one, which in turn seemed to move to escape from it. In the other condition (Fig. 1h) the grey square moved toward the black square, which in turn moved toward it, as if it wanted to approach it. Eight further configurations identical to the previous ones were then produced by introducing animacy cues in the motion of the black square, making it move while its shape changed alternating phases of contraction and expansion (examples for the static context condition only are depicted in Fig. 1i–j). Overall 16 stimuli were produced corresponding to all the possible combinations of three factors: type of motion of the black square (two levels), context (four levels) and direction of motion of the square (two levels).

Fig. 1
figure 1

Schematic depictions of the stimuli used in the experiment. In each panel are represented the relative positions and directions of motion of the target (black square) and of the context element (grey square) at different consecutive time-points during an animation. a, b No context element; cd, ij context square is static; gh, ef context is dynamic, and the context moves, respectively, in an animate fashion (contracting and expanding while moving) and falling down from above simulating (physical context). In ah the black square moves linearly, while in ij it moves in an animate fashion (contracting and expanding while moving). The full set of displays are available as supplementary material

The experiment consisted of 72 trials, divided into three blocks. The first two blocks comprised 28 trials each, corresponding to the all the possible pairwise comparisons of the eight stimuli in which the type of motion of the black square was the same. Approximately half the participants first saw in the first block the stimuli in which the square moved linearly, and the other half first saw the stimuli in which it moved like a caterpillar. In the third block, all 16 stimuli were shown individually, and participants gave free descriptions and rated each of them for the level of animacy displayed by the black square. The order of trials was randomized within each block.

Stimuli

The stimuli were short animations realized in Adobe Flash Professional CS 5.5 and presented as Quicktime™ displays. They had variable durations depending on the experimental condition (from 3000 ms to 4000 ms), and consisted in either one or two black and grey squares (side = 0.5°) inside a white rectangle (width = 12.2°, height = 8°) placed at the centre of the screen against a blank background (see Fig. 1).

At the onset of the stimuli, the black square was placed on the horizontal midline of the rectangle, at 3° from its left side in most conditions (in the animated contexts it was placed at 0.75° from left side of the border, or in the centre of the rectangle, respectively, in the approach and in the escape conditions). Shortly after the onset (from 660 ms to 1160 ms after, depending on the condition) the square started to move horizontally toward the left at constant speed (3.72°/s), halting after a displacement of 6.25°. The grey square (i.e., the “context” of the black one) was either absent, static or dynamic, depending on the experimental conditions. When it was dynamic, it started from a 0.75–3° distance from the white rectangle’s left side on the same horizontal line as the black one (and it would then move horizontally on this line), or from a 0.75° distance from the rectangle’s top (moving then down up to its midline) with a speed of 3–3.72°/s. Its (final or static) position was always next to the black square’s initial or final position (distance = 0.02°). At the onset, and at the end of the stimuli, both squares stood still for 660 ms; in the experimental condition in which both moved, the black square started with a 500 ms delay after the grey one. The full set of displays are available as supplementary materials.

The stimuli were presented on a LED ultrabook monitor, with a screen measuring 13.3 inches diagonally, and participants viewed them from approximately 60 cm, with unrestricted head and eye movements.

Procedure

The experiment consisted of two parts, the first devoted to 2AFC tasks, and the second to descriptions and ratings.

At the beginning of the first part, participants were instructed about the nature of the task. They were told that, on each trial, they would see two simple animations, one after the other, and that their task was to choose the one in which the black square appeared more animated, i.e., gave the more vivid impression that it were alive. Two blocks of trials were run, each devoted to a type of movement of the black square (i.e., linear or caterpillar-like). At the beginning of each trial, the screen remained blank for 500 ms, and then the first display was played. After the first display stopped, the squares were removed and the white rectangle remained empty on the screen for other 1500 ms, before the second display was played. After the second movie stopped, the screen turned blank again, and instructions were shown to the participants: they were told to indicate the movie in which the black square had seemed more animate to them, and to press the left key if they considered the first movie to be more animate, and the right key if they thought the second movie was more animate. After completing a block of trials, participants were given the opportunity to take a break if they desired.

In the second part, participants were shown the displays individually and they were then asked to describe them freely. They were asked to give spontaneous descriptions and to specify, where applicable, the type of interaction between the two elements. The descriptions of participants were recorded on paper by the experimenter to be further analyzed. After eliciting the description of the stimulus, participants were asked to rate the degree of animacy they perceived in the black square, on a 7-point Likert-scale, where “1” indicates no impression of animacy, “7” a very clear impression of animacy.

Stimuli presentation and data recording during the first part of the experiment were controlled by PsychoPy2 v1.80.01 software running on an Acer Aspire S3.

Results

Forced choice data

Paired comparisons data were analyzed using the Bradley-Terry-Luce scaling method (Bradley & Terry, 1952; Luce, 1959) to derive estimates of the perceived animacy of the target in the different configurations on a ratio scale level. Given that participants only compared in 2AFC tasks stimuli in which the type of motion of the target was the same, separate analyses were conducted to estimate the perceived animacy of the target moving linearly and of the one moving in a caterpillar-like fashion. The validity of the assumptions concerning the consistency of participants’ judgements underlying these probabilistic choice models was tested in the following way. We first counted the number of violations of the Weak Stochastic Transitivity assumption, required to place stimuli on an ordinal scale (Tversky, 1969).Footnote 2 No violations of the WST were observed, either in the model for the linear target, or in the one for the caterpillar target, so we did not conduct formal statistical tests. The more stringent assumption of Strong Stochastic Transitivity (SST),Footnote 3 needed to derive estimates on a ratio level scale, was checked by comparing the likelihood of the fitted choice models to the likelihood of the corresponding saturated binomial models that fit the data perfectly, using likelihood ratio tests. In both cases, the tests was not significant [linear movement: G 2(21) = 23.2; P = 0.33; caterpillar movement: G 2(21) = 20.5; P = 0.49], indicating a good fit of the model, and showing that violations of SST were random rather than systematic, and thus that the model assumption was respected. Finally, we tested for stimulus equality by comparing with likelihood ratio tests the fitted models to models in which the parameters were set as if the probability of any choice among two stimuli was 0.5. For both types of motion this test was significant, showing that at least two stimuli were considered different in animacy from each other [linear movement: G 2(7) = 192.5; P < 0.0001; caterpillar movement: G 2(7) = 269.9; P < 0.0001].

The perceived animacy in the different conditions is plotted in Fig. 2, with error bars representing 95% confidence intervals. As the plots show, first of all, both in the case of the linear target, and in the one of the caterpillar, the perceived animacy in the control and in the static context conditions was approximately the same, and significantly lower than in the dynamic contexts. Moreover, in the control, and in the static context conditions, there was no difference in animacy as a function of the direction of motion. Conversely, in the dynamic context conditions, the perceived animacy was higher when the target moved away from the contextual element than when it was heading toward it.

Fig. 2
figure 2

Perceived animacy scale values estimated from the paired comparison data, as a function of context type and direction of motion. The estimates are normalized so that, for each plot, the values sum to 1. Values are on a ratio-scale. a Plot of animacy values derived from paired comparisons of stimuli with a linear target. b Corresponding values relative to the caterpillar target. Error bars 95% confidence intervals of the scale values. Dotted line Location on the scale where all the stimuli would lie if, for each pair of stimuli, the probability of choosing one over the other as more animate were 0.5 (i.e., if there were no difference in the perceived animacy)

Further tests were conducted to assess whether there were differences in the animacy perceived by males and females. Following Duineveld et al. (2000), we compared with likelihood ratio tests, a model in which the parameters (i.e., the scale values) were restricted to be equal across groups to a more complex model that allowed different parameters for each group. Neither in the case of the linear movement of the target [G 2(7) = 6.28; P = 0.51] nor in the one of the caterpillar movement the test was significant [G 2(7) = 10; P = 0.19], showing that perception of animacy was not different in males and females.

Likert scale ratings

The ratings of animacy were analysed in a 3-way ANOVA, including type of movement of the target (linear vs caterpillar), context (four levels: no context, static context, physical context, animate context), and direction of motion with respect to the context (toward vs away) as within-subject factors. The results show significant effects of the type of movement [F(1,44) = 15.49, P < 0.001] and of the context [F(3,132) = 26.73; P < 0.001]. The three-way interaction was also significant [F(3,132) = 2.95; P < 0.05]. No other factors or interactions among factors were significant.

The average animacy ratings are plotted in the top row of Fig. 3, as a function of context (a) and type of movement (b). As can be seen, the pattern of means first of all shows that ratings were higher in the dynamic contexts than in the static one, where, in turn, ratings were higher than in the control no-context condition, consistent with the results of the paired comparisons. Secondly, the ratings of the animacy of the target were higher when it moved in an animated fashion (caterpillar) than when it moved linearly (difference = 0.64).

Fig. 3
figure 3

Plots of the average ratings of the target’s animacy in the different conditions. Top row Average ratings by context (a) and type of motion of the target (b). Bottom row 2-way interaction plots of context and type of motion for the target moving away from the context (c) and moving toward it (d). Error bars 95% confidence intervals of the means adjusted to support inferences about the differences between the means. Dotted line Midpoint of the scale

We followed up the significant 3-way interaction conducting two sets of separated 2-way ANOVAs, splitting data in one case by type of motion (including context and direction as factors), and in the other by direction of motion (including context and type of motion as factors). In the first set of analyses, we always found a significant effect of context only. As shown in Fig. 3 (c–d), in fact, for both types of movement, the direction of motion did not seem to influence the average rating of the target’s animacy. The ANOVAs on the data relative to the target moving away from and toward the context square, instead, revealed both significant main effects of context and type of motion and, for the target moving away from the context element, also a significant interaction [F(3,132) = 3.52; p < .05]. Further paired sample t-tests showed that only in the physical context, and for a target moving away from the context, animacy ratings in the caterpillar motion condition were not significantly higher than ratings in the linear motion condition.

Free reports

To analyse free reports, we first applied thematic coding to each description given by participants of the stimuli. Two ad hoc categorical variables were devised: one (animacy) encoded whether the target was described as an inanimate object or as a living creature, and, in the latter case, also whether the description entailed animacy on an automatic (e.g., “walking”), instinctive (e.g., “flight”) or mental level (e.g., a “person”, who “wants to reach something”). The second variable encoded instead the emotional valence of the description of the behavior of the target square, on three levels: negative, neutral/no emotion, and positive. We then computed the frequencies of occurrence of the different categories in the overall set of descriptions, and used diverging stacked-frequencies plots to visualize the results. These plots are a variation on the stacked bar chart commonly used to display the frequency of responses in surveys, where each stack represents the frequencies of replies to a question (with replies usually sorted, say from “strongly disagree” to “strongly agree”). In the diverging stacked bar plot version, the replies (i.e., the response categories) are positioned horizontally so that negative responses are stacked to the left of a vertical baseline and positive responses are stacked to the right of this baseline. In case of an odd number of categories, the stacks are centred on the middle category (generally representing a “neutral” response).

Figure 4a and b display, respectively, the frequencies of the animacy types and the emotions attributed to the target in the free reports. For each context, type and direction of the target motion, the length (horizontally) of the colored bars represent the percentage of reports in which the target animacy or emotional valence were described using the respective categories. For example, 61% of participants described the target moving linearly left-to-right in the static context as inanimate, while the remaining 39% described it as mainly either as “animate, automatic” or “animate, mental”. Moreover, 96% of participants described the valence of the emotion of the target in this condition as “neutral/none”, and the remaining 4% ascribed to the target either a positive (2%) or a negative (2%) emotion. As can be seen, for both the linear target and the caterpillar one the proportion of descriptions of the target as being somehow animated was lowest in the control conditions, and highest in the dynamic context conditions. In the control condition, the proportions of description of the target as being animate were higher when it moved like a caterpillar (70%) than when its movement was linear (39–44%). This was also true in the static context, although the proportions of animacy attributions were higher than in the control condition, and substantially above 50% for both types of targets. In the dynamic context, only a small percentage of participants judged the target to be inanimate (7–12%); moreover, the majority of subjects described the target’s animacy as either instinctive or mental. Interestingly, the proportions of descriptions of the target as being driven by mental states was higher when it moved toward the context square than when it moved away from it.

Fig. 4
figure 4

Diverging stacked bar plots of the frequency of (a) the types of animacy attributed to the target, and (b) the emotions ascribed to its behavior in the free descriptions of the stimuli, as function of the type of movement of the target (Linear vs Caterpillar). For each context, type, and direction of the target motion, the length (horizontally) of the colored bars represent the frequency (percentage) of occurrence of the various categories for the target animacy (sand/left inanimate, green/right animate, shades of green types of animacy) and the emotional valence (red/left negative, grey/middle no emotion or neutral, blue positive) in the free reports. Each stack of bars has a total length of 100%. The side toward which the stack is displaced with respect to the chosen baseline or the central category reflect the prevalence (i.e., higher frequency) of categories of a given polarity over the other. The frequency of each polarity is presented in the margins, as annotations at the ends of the scale. See text for further details

Looking at the plots of the emotions attributed to the target behaviour, we can notice first of all that, in the control condition (no context), almost all participants did not attribute an emotional state to the target. Secondly, in the other contexts participants tended to attribute a negative emotion to the target when it was moving away from the context, and a positive one when it was approaching it. The percentage of participants describing the target with reference to an emotional state, finally, was highest in the dynamic context when the target was moving toward the context square.

Discussion and conclusions

The main results show that the attribution of animacy is significantly influenced by the features of the context where the target object is moving. Even in presence of clear cues of biological motion, that is when the target object moves in a "caterpillar-like" manner, there do still exist different levels of perceived animacy on the basis of different contexts, instead of an even distribution of the levels of perceived animacy, as we would expect if animacy were an all-or-none property. To vary the context we adopted a simple stimulus identical to the target, located at the beginning or at the end of its trajectory, in static or dynamic conditions (physical or animate).

Our main results indicate that, in both the forced-choice and the Likert-scale tasks, there is a significant difference between static and dynamic contexts, where the dynamic contexts prompt a quite clearer impression of animacy, while for the physical and the animate contexts there are similar results in almost all conditions. We also found a significant difference in the impression of animacy between the control situation without context and all the other conditions.

As a possible explanation of this first finding, we suggest that, independently from the specificity of the relationship between the target object and the context object, a dynamic context produces a spatiotemporal contingency comparable to Michotte's paradigm of perceptual causality. Like in the launching paradigm, in fact, the target object starts to move as a reaction to the behaviour of another object, either moving closer/escaping or bouncing nearby. On the other side, the static context object could be inferred as being the cause of the target object’s motion, but without temporal constraints this causal impression is much less vivid and compelling.

A second meaningful new finding emerged from the paired-comparisons data, concerning the role of the direction of the target's motion with respect to the context, while in the control and in the static context condition, there were no differences in the perceived animacy of the target as a function of its direction of motion, in the dynamic contexts we consistently found that the impression of animacy was higher when the target was moving away from the context element than when it was approaching it.

This effect was not found in the ratings data. Possibly this discrepancy is due to the lower sensitivity of the method relative to paired-comparisons, which, forcing the viewer to make a choice, allows the identification of even small differences that would otherwise pass unnoticed. Thus, it supports the effectiveness of adopting the forced-choice method, which avoids enquiring about an explicit quantitative judgement from the participants, in measuring more qualitative and subtle impressions of motion. Besides corresponding to the other measures based on the forced-choice task, the Likert ratings show that the displays chosen as animate in the forced-choice really seem animate, by themselves and not only in relation to each other. We must, however, note that neither the paired comparison data nor the ratings allow us to pinpoint the level of the processes behind our results, and thus to tell whether they are more due to automatic low level perceptual processes or to higher-level conscious inferential cognitive processes. However, at least the finding that perceived animacy was equally low in absence of context and in a static one, does not seem to be easily explained by inferential processes, as the static context displaying a goal-directed motion would seem to support inferences about intentionality and animacy that would not be possible without context.

For a better understanding of the results, we referred also to the phenomenal reports of the participants. First, spontaneous descriptions show that the increased animacy in the dynamic contexts corresponds to a parallel increasing of the use of terms generally related to animate behaviors. Within this category, moreover, participants freely described the moving target in very different ways, specifically related to the different contexts. This revealed a range of animate impressions that, together with the aforementioned results that animacy is perceived more or less clearly in different conditions, according to us confirms the hypothesis that there can be different facets or “levels” inside the animacy concept—for instance, an automatic animacy, an instinctive one and a mental/emotional one—and that the context plays a crucial role in making them evident.

Animacy, according to us, can be conceived as an expressive quality of motion, which is articulated in more general characteristics, like simple aliveness, and more specific ones, like intentional or emotional aspects. The overall results corroborate the theoretical assumption according to which intentionality and other emotions are "related properties" of animacy (Visch & Tan 2009; Gao & Scholl, 2011).

Secondly, the relative direction of the target motion strongly influenced the free descriptions of the participants. The "moving away" condition is related to negative emotions, while the "approaching" condition generally evokes positive emotions, and, in general, the avoiding behavior is more often described using emotional terms. In addition, comparing the descriptions of the animate motion, it is more "instinctive", i.e., reflexive an animal-like behavior, in the "moving away" condition, and more mentally driven in the "approaching" condition.

As we can see, for some aspects the qualitative reports can be enlightening in suggesting some possible explanations of the data. The "moving away" from something or someone may look more animated for evolutionary reasons, because of the more adaptive meaning of escaping, or for a general predisposition to note emotionally negative events. The negativity bias, known in different areas of cognition and attention, in fact, predicts a higher sensitivity to negative events. Therefore, both “real” and schematic-geometrical events with a negative content or outcome, like, e.g., chasing, according to Heberlein (2008) are easily perceived as more intentional. Also, with Johansson's dynamic of point-light displays, anger and fear are easier to identify than other emotions (Walk, 1984). Nevertheless, further studies and the development of more objective paradigms in this field of research will be necessary to corroborate this hypothesis, and explore how these findings can be generalized and applied in different contexts.

Phenomenal descriptions support also a relationship between the attribution of intentionality (i.e., the presence of mental states) and the direction toward something/someone, that is goal-directedness (Gao & Scholl, 2011; Blythe, Miller & Todd, 1996; Tremoulet & Feldman, 2000), and an independency between animacy and intentionality, as some authors have claimed (Premack, 1990; Gao & Scholl, 2013). Also, neural recordings recently gave strong support to the independent constructs hypothesis, showing that the areas involved in animacy/goal-directedness perception on one side and intentionality on the other [i.e., temporoparietal cortex, including areas in and near the posterior superior temporal sulcus (pSTS) and angular gyrus, particularly in the right hemisphere] are topically the same but manifest different degrees of activation with different patterns (Gao, Scholl & McCarthy, 2012), leading to a functional distinction among these concepts, even if they must be strictly interconnected (see Heberlein, 2008).

In conclusion, we suggest that at least two main different factors seem to be effective in the perception of animacy and intentionality. On one side, at the same conditions, as the reviewed literature showed, kinematic variables such as sudden speed accelerations and high values of speed, play a crucial role in evoking and intensifying the impression of animacy (see Tremoulet & Feldman, 2000, 2006). On the other side, it seems very fruitful to deeply investigate the role of spatiotemporal context in eliciting animacy and other related properties, like intentionality and emotional or social behaviours. As happens with other displays of phenomenal causality like triggering and intentional reaction (Michotte, 1946/1963; Kanizsa & Vicario, 1968; Bozzi, 1969; Schlottmann et al., 2006; Thinès et al, 1991; Parovel & Casco, 2006; Parovel, 2012; Hubbard, 2013a, 2013b; Parovel & Guidi, 2015; Wagemans et al., 2006), the perceptual belongingness to a unitary spatiotemporal structure specifies the expressive meaning of the trajectory and the direction of the target motion, and can plausibly influence in this way the perceptual saliency of animacy, also on the basis of the adaptive value of the behavior.

According to an evolutionary-biological account, in fact, all emotions derive from fundamental actions present in the most primitive organisms, which can have two main types of causes: approach (to partner or food) and avoidance (of predators) (Kandel, 2011). Because an effortless and automatic discrimination between animate and inanimate is crucial to successfully avoid a dangerous agent or to approach prey, it should necessarily be an adaptive advantage not only to recognize life, but also to discriminate the specific intentions of the agent (Scholl & Gao, 2013; Neri, Luu, & Levi, 2006).

And it is the relationship occurring between two or more moving objects, to conclude with Michotte's words, that specifies the meaning of an object's motion, mainly on the basis of spatial and temporal contingencies (Michotte, 1950; Thinès, Costall, & Butterworth, 2015).