Interest in the application of behavioural economics (BE) to consumer policy has grown in recent years. At the risk of oversimplification, BE looks at rational choice theory and related assumptions about the homo economicus (e.g., unbounded rationality, clearly defined preferences, utility maximisation) from a broader perspective to provide a more realistic understanding of human economic behaviour. BE explores cognitive, motivational and contextual factors (e.g., status quo bias, heuristics, social norms) that affect people’s preferences, decision-making processes and resulting choices (Kahneman, 2003; Thaler, 2015). With a strong focus on consumers (Reisch & Zhao, 2017), it aims to generate policy-relevant information by analysing interventions that address or overcome those factors.

In the domain of energy and climate change, and following the seminal contribution from Pichert and Katsikopoulos (2008), BE applications have focused on the use of green energy defaults (GED) to promote the market uptake of renewable energy electricity (REe) among consumers. In simple terms, GED are understood as a type of ‘nudge’ or choice setting that seeks to determine the outcome if energy users do not actively choose their electricity mix supply (Schubert, 2017; Sunstein & Reisch, 2013). In simple terms, GED experiments (lab, field or natural) aim to analyse the impact of default rules via an opt-out green energy tariff or contract (i.e., intervention group), compared to an opt-in option (i.e., control group). Participants are randomly allocated to one of the two groups; the former is often labelled as the ‘green energy condition’, and the latter the ‘standard/grey energy condition’ (Ebeling & Lotz, 2015; Hedlin & Sunstein, 2016 ; Pichert & Katsikopoulos, 2008). Sometimes, a third condition called ‘active choice’ (or ‘neutral condition’) (Hedlin & Sunstein, 2016; Pichert & Katsikopoulos, 2008) is also analysed. It is claimed that GED can have large effects because of the presence of procrastination, inertia (or ‘status quo bias’), loss aversion and also due to implied endorsement. For detailed conceptual and theoretical aspects, see Schubert (2017) and Sunstein and Reisch (2013).

So far, most GED experimental lab studies have produced results in developed countries, where contextual aspects are comparatively favourable for the adoption of REe, notably in Germany (Ebeling & Lotz, 2015; Momsen & Stoerk, 2014; Pichert & Katsikopoulos, 2008; Vetter & Kutzner, 2016). The results are often positive and significant. For example, Pichert and Katsikopoulos (2008) found a 68% adoption rate when participants were exposed to a green default. This compared favourably with a 41% rate under a standard or conventional ‘grey energy’ default (p = 0.001). Rates were similarly high when participants were exposed to an active choice setting (67%). Comparable results are reported by Ebeling and Lotz (2015): The acceptance rate for the green opt-out option was 69.1%, compared to 7.2% for the opt-in treatment (p < 0.001). In a second (online) experiment, the authors found that 93.8% of participants chose green energy in the opt-out option, compared to 34.1% for the opt-in alternative (p = 0.001). Similarly, Vetter and Kutzner (2016) found that a green default increased the odds of choosing green electricity by a factor of 4.05 (p < 0.001). In a comparison of the effectiveness of different choice settings, Momsen and Stoerk (2014) estimate that a GED generated the highest adoption rate (69.7%), which compared favourably with an active choice scenario (48.2%) (p = 0.028). For further details, see also Kaiser et al. (2020).

Within this experimental context, it is argued, for example, that GED ‘can be used to promote pro-environmental behaviour’ (Pichert & Katsikopoulos, 2008, p. 63), ‘overrule motivational aspects of green energy purchases’ (Ebeling & Lotz, 2015, p. 868) and that studies ‘support confidence in defaults as tools to influence environmental-decision making’ (Vetter & Kutzner, 2016, p. 27). In addition to the contributing factors affecting the effectiveness of GED mentioned above (e.g., procrastination, loss aversion), the literature on consumer policy addressing the adoption of REe also highlights important motivational and contextual aspects that can also contribute to the success of GED. They include, for example, high public acceptance of renewable energy (Bertsch et al., 2016; Zoellner et al., 2008), pro-environmental behaviour (Gerpott & Mahmudova, 2010; Gifford & Nilsson, 2014), a long tradition of certainty about renewable energy policies (Bechberger & Reiche, 2004; Laird & Stefes, 2009; Wüstenhagen & Bilharz, 2006), high willingness to pay for green electricity (Sundt & Rehdanz, 2015) and positive consumer attitudes to renewable energy when supplied by public firms and/ or (local) cooperatives (Kalkbrenner & Roosen, 2016; Mundaca et al., 2018; Rommel et al., 2016). Now, let us now imagine a country in which, for instance, there is a perceived negative corporate reputation (Walsh et al., 2006), lack of trust in or confusion about green (energy) product attributes (Joshi & Rahman, 2015) and ongoing reviews of, and changes to, renewable energy policy (Simpson & Clifton, 2014). What can it be the response from consumers to a GED? What drives their choices? What can the comparison of stated and revealed preferences expose? Which directions can emerge for future consumer policy studies?

This paper seeks to answer these research questions, aiming to provide new perspectives about GED for consumer policy. Our goal is to complement existing studies and gain a better understanding of GED, particularly when they are less, or unexpectedly, effective. We expand the common analytical framework focusing on effectiveness and identify key determinants. In addition, we also compare stated versus revealed preferences (methodological details in the next section). For consistency and comparative reasons, we replicate the experimental lab research design of previous work (Ghesla, 2017; Hedlin & Sunstein, 2016; Pichert & Katsikopoulos, 2008; Vetter & Kutzner, 2016), notably the one from Pichert and Katsikopoulos (N = 225). Using the UK as a case study (details below), we thus run a GED lab experiment and randomly assigned participants to three treatment scenarios: opt-out, opt-in and an active choice for a REe tariff. Building upon existing theoretical and empirical evidence, we aim to identify various motivational and contextual issues (potentially) framing the effectiveness and policy adequacy of GED. These aspects can be relevant for countries in which market and consumer policy conditions for REe are less advanced or certain.

We choose the UK as a case for a variety of institutional and contextual reasons. First, some elements of the UK electricity market deserve attention. For example, the electricity supply industry was privatised in 1989 and full competition was introduced to Britain’s electricity retail market in 1999, including to all residential customers. In the following decade, the market was dominated by six incumbent firms, accounting for almost 100% of the retail market. However, since 2011, the combined electricity market share of the large legacy suppliers has steadily reduced from nearly 100% in 2011 to 70% by the end of 2020 (OFGEM, 2021a), as more new entrants enter the market (by December 2020, there were 54 suppliers active in the domestic electricity market (OFGEM, 2021b)). The increase in suppliers and available tariffs has been coupled with an increase in switching behaviour by consumers, with a record high of 20.4% in April 2019 (OFGEM, 2019). However, despite this rise around 50% of households remain on default electricity tariffs (OFGEM, 2020a), which are often more expensive (OFGEM, 2021c).

Secondly, the share of UK electricity generation from RE sources reached a record high of 37.1% in 2019, increasing from 33.1% in 2018 (BEIS, 2020a). In 2019, REe generation totalled 121 TWh, a 9.5% increase from 2018, a tenfold increase in the share of electricity generation since 2004.Footnote 1 The UK has also experimented with all forms of renewable energy policy mechanisms (e.g., feed-in-tariffs) (Newbery, 2016) and according to BEIS (2020b), the UK’s climate and energy policies have been an important driver for the REe capacity growth, which has increased by more than three times since 2010. The IEA (2019) identifies that the most important policies behind this growth are the Renewable Obligation (introduced in 2002), Feed-in-Tariffs (introduced in 2010) and the Contract for Difference (introduced in 2013).Footnote 2 Whereas these policies are largely devoted to address the production side, much less is known about policies addressing consumers.

Thirdly, in terms of green energy tariffs per se, these were first available to UK domestic customers over 20 years ago (Graham, 2006). Such tariffs remained available until 2014, when in an attempt to ease consumer confusion over complex tariffs, the government regulator for electricity and gas markets, OFGEM, introduced a limit to four simple tariffs per supplier in 2014 (OFGEM, 2014a). This resulted in all the major suppliers withdrawing the green electricity tariffs they offered, which were less popular and made less profit (Littlechild, 2019). Yet, in recent years, there has been a resurgence in the provision of green tariffs from both incumbent firms and new entrants. However, during the periods that green electricity tariffs have been available in the UK domestic market, issues have existed surrounding confusion of the product and lack of trust. For long periods of time, there was no official definition of what was meant by a ‘green’ tariff, which led to many different forms of including carbon offsets and green funds. This resulted in confusion among consumers over what a green tariff was, or that different types of green tariffs even existed (OFGEM, 2014b). In addition, concerns over suppliers double counting has been found to be more prominent in the UK than in other countries (Hast et al., 2015), where suppliers assign the renewable energy they are obligated to source as a green energy tariff. Such concerns over the risk of double counting and confusion of the product have weakened consumer trust in the green tariffs in the UK.

Within the context described above, there are also further, specific reasons to choose the UK as a case study. For example, studies show that consumer trust in the energy sector is low (CarbonBrief, 2014; Which?, 2013). It is claimed that this is the least-trusted industry in the UK, and one of the least-trusted sectors in Europe (Citizens Advice, 2015; Edelmenan Trust, 2014). There is also a loss of political trust among consumers due to Brexit (Marshall & Drieschová, 2018; Newton et al., 2018) that may influence the level of endorsement that a GED implies. Third, the UK has experimented with all forms of renewable energy policy mechanisms, including tradable green certificates, traditional feed-in-tariffs and contract for differences (Newbery, 2016) and the UK market allows customers to switch suppliers. In addition, many of the major UK energy suppliers offered a green tariff in the past (Graham, 2006) are customers are familiar with the concept of green tariffs. However, all of the major companies abandoned the practice by 2015 (Bawden, 2015). This led to confusion among consumers, as few customers understood what a green tariff was, or that different types of green tariffs existed (OFGEM, 2014a). The above has generated a complex renewable energy governance (Kern et al., 2014), creating risks and uncertainties among consumers and producers (Bolton et al., 2016; Cowell et al., 2017; Fudge et al., 2016). Our general hypothesis is that this set of country-specific factors, to be found potentially in many countries around the world, may undermine―at least to some extent―the effectiveness of GED compared to what has been reported in previous online lab experiments.

The paper is organised as follows. The ‘Methods’ section provides methodological details. It describes the experiment, procedure and data collection. The ‘Results’ section summarises our key findings. In ‘Discussion: Avenues for Future Consumer-Oriented GED Studies’ section, and building upon existing knowledge and our findings, we propose and discuss various motivational and contextual that should deserve more attention in future GED studies. Finally, we present concluding remarks in the ‘Conclusion’ section.

Methods

Experiment, Variables and Participants

Following the replication of previous GED studies, our experiment analysed the effectiveness of GED and randomly assigned participants to three treatment scenarios: opt-out, opt-in and an active choice for a REe tariff. In line with the literature (Hedlin & Sunstein, 2016; Horton et al., 2011), the study can be categorised as an online laboratory experiment.

First, a power analysis was conducted to determine the sample size capable of robustly detecting an experimental effect, considering type I and type II errors. Because very few GED studies report an effect sizeFootnote 3, we assumed a relatively conservative, medium effect size (0.5 Cohen’s d), and more strict levels of statistical significance (1%) and power (90%) than previous studies. This resulted in a minimum sample per group of 121 participants who were recruited from an online panel. We enlisted the support of a market research firm (ResearchNow) to collect the data. Taking into account that we were able to recruit 518 participants (150+ under each treatment scenario), our study is capable of detecting even a small effect size (d = 0.2) with 38% power in a two-sided hypothesis test. Note that our sample is more powered than previous GED experiments (Pichert and Katsikopoulos (2008) (N = 225), Ebeling & Lotz, 2015—Experiment 2 (N = 290), Momsen and Stoerk (2014) (N = 118) and Ghesla (2017) (N = 161)). However, it is smaller than two studies with larger samples (Ebeling and Lotz (2015)—Experiment 1 (N = 41,952) and Hedlin and Sunstein (2016) (N = 1,245)). A fundamental consideration for this difference is that we, like all the previous GED studies, focus on the difference between experimental groups (i.e., effect size) but refrain from analysing the prevalence of certain attributes in a larger population (e.g., political preferences, as in Ebeling and Lotz (2015)).

The online lab experiment (see Appendix) was hosted by an open-source survey provider (LimeSurvey).Footnote 4 It began with an initial welcome page that stated the title of the experiment, provided information regarding the anonymization of responses, and thanked participants for taking part. This page was deliberately brief to avoid framing responses in any way. An initial screening question eliminated those who were not responsible for electricity supplier/ tariff decisions in the household. These initial participants were not included in the experiment, as willingness to pay for renewable energy is related to whether the respondent is a bill payer or not (Hite et al., 2008; Zarnikau, 2003). The next part of the survey elicited socio-economic and demographic data (see Table 1).Footnote 5 Variables were chosen based on previous studies of willingness to pay (WTP) for renewable electricity (e.g., Aldy et al., 2012; Bollino, 2009; Hanemann et al., 2011; Kim et al., 2012; Kosenius & Ollikainen, 2013; Longo et al., 2008).

Table 1 Summary statistics for socio-economic and demographic variables of the sample and comparison with UK average values

Next, participants were randomly allocated to one of three treatment scenarios using a ‘recruit and deny’ strategy (Gandhi et al., 2016). A hidden, random number generator drew an integer number from 1 to 3, which allocated participants to one of three treatments: (1) a green default scenario (opt-out) (n = 190), (2) a standard default scenario (opt-in) (n = 170) and (3) an active choice scenario (n = 158). Kruskal-Wallis tests showed no evidence of any significant unequal distribution of variable parameters across groups (see Table 2). Importantly, this meant that the randomization of participants was successful, which cancelled out potentially confounding effects of these variables on the likelihood to choose (or not) a green energy tariff.

Table 2 Descriptive statistics for respective groups and Kruskal-Wallis (K-W) test statistics

At the beginning of each treatment, participants were asked to imagine they had just moved to a new town and were informed by the local energy supplier of the available electricity tariffs. The preparation of this text was guided by OFGEM proposals about default renewal notices; notably, that the supplier should not encourage the consumer to choose one particular tariff, and should clearly explain what happens if the consumer does nothing (the default treatment) (OFGEM, 2014a, b, 2021c) (Table 3).

Table 3 Definition of treatments

An important element of our study was to replicate a real-world pricing premium for green electricity. However, at the time the experiment was conducted, none of the larger suppliers offered specific green tariffs. Therefore, information was sourced from historical tariff data, personal communications with energy companies and relevant, published studies. This resulted in a price premium of 5%, or around £2.50/ month (€3 approx.) based on average electricity bills in the UK in 2016. This value is significantly lower than those used in most GED studies (Table 4).Footnote 6

Table 4 Studies of premiums for REe tariffs in the UK

The final part of the survey elicited information about current electricity contracts. This was critical to be able to compare stated and revealed choices.

To compare groups, various statistical techniques were used to tests for significance across treatment scenarios. First, a Pearson’s χ2 test was applied to test differences in enrolment rates between the three treatments. We also ran a series of χ2 goodness-of-fit tests to compare our results with the average values we derived from the literature. Next, a series of Chi-square (χ2) goodness-of-fit tests took place to compare the results of our experiment with the ones previously reported in the GED literature. Bonferroni adjustment (α = .017) was applied to control for type error I whenever post-hoc tests were carried out. We also compared whether there were significant differences in the time it took participants to answer the different treatment questions using a one-way analysis of variance (ANOVA) based on an F distribution. Extreme outliers that lied without the intervals were removed (28 in total). We also used a Wilcoxon signed-rank test to assess for the differences between stated and revealed choices for each group across treatment scenarios. Finally, effect size statistics were computed. Phi (Φ) and Pearson’s correlation (r) coefficients were calculated for χ2 and Wilcoxon signed-rank tests respectively; where 0.5 was considered a large, 0.3 a medium and 0.1 a small effect size (Cohen, 1988, 1992).

Model Specification

Unlike most previous studies, we quantitatively explored the extent to which different variables explained respondents’ choices. Logistic regressions assessed the impact of predictors on the probability that respondents would choose either the green energy tariff or the standard energy tariff. The logistic regression model is presented below. Independent and dependent variables are specified in Table 2 and Table 3, respectively.

$$ P\left({C}_i\right)=\frac{1}{1+{e}^{-\left({\beta}_0+\beta {X}_{ij}+\varepsilon \right)}} $$

in which P(Ci) is the probability of C occurring, which is the binary dependent variable indicating the willingness to choose (1) or not (0) the green electricity tariff under i treatment scenario by respondent j. e is the base of natural logarithms, βX represents our vector of explanatory variables and ε is an error term. Explanatory variables include income (Inc), location (Loc), household tenure type (Ten), household size (HHsize) and the current electricity contract (Reality). We also include the time (Time) undertaken by participants to make a choice under each scenario. This was used as a proxy for the cognitive effort (Lenzner et al., 2010) that participants deployed during the experiment. In a second step, we introduced the following control variables to check for the robustness of our results: age (Age), gender (Gend) and education (Educ). Both explanatory and control variables are based on the reviewed literature, notably the socio-economic and demographic factors affecting WTP for renewable electricity and findings from GED studies. Income or financial constraints (and related educational level) are often cited as the most important factors that prevent or promote the adoption of renewable energy (cf. Hobman & Frederiks, 2014; Pichert & Katsikopoulos, 2008; Zarnikau, 2003). Gender and age have also been put forward, although the results are inconclusive (cf. Longo et al., 2008; Sardianou & Genoudi, 2013). Electricity contracts and, thus, price and the corresponding use of renewable energy have been identified as having an impact on an individual’s decision to choose a green or standard energy carrier (Bird et al., 2002; Mozumder et al., 2011), and this observation has been confirmed in experimental studies (cf. Kaenzig et al., 2013; Momsen & Stoerk, 2014; Pichert & Katsikopoulos, 2008). Furthermore, household size and tenure have been identified as having a positive and significant impact on a consumer’s preference for renewable energy (Ameli & Brandt, 2015; Beckman & Xiarchos, 2013; Mozumder et al., 2011). Time was adopted as a proxy of cognitive effort. The time it takes to answer a survey question is generally accepted to be a reflection of the cognitive effort required to arrive at an answer (Lenzner et al., 2010). Such measures are often used in online surveys, including other energy default studies (Dinner et al., 2011). However, it is important to highlight that the use of response time as a measure of cognitive ability is influenced by other factors (Kyllonen & Zu, 2016), notably slow processing or carefulness by the individual concerned.

Each model (with and without control variables; bootstrapped based on 1000 samples) was checked for goodness-of-fit using a Chi-square (χ2) test to give an overall, initial indication of how well models performed. Then, Cox and Snell’s R2 and Nagelkerke’s R2 were used as pseudo R2 measures to indicate the amount of variation in the dependent variable odds ratios (i.e., the constant effect of a predictor X on the likelihood that an outcome will occur) were estimated for all independent variables to assess their contributions to the model. For categorical variables with more than two categories (e.g., income, location), each category was compared with the category last coded being used as a baseline. Surveys with incomplete answers were excluded from the logistic regression analysis (14 in treatment 1, 11 in treatment 2 and 9 in treatment 3). To ensure the robustness of the results, numerous diagnostic tests were performed.Footnote 7

Results

First-Order Impacts of the GED

As hypothesised, findings show a relatively lower level of effectiveness for the green default and active choice treatments, and a relatively higher level under the standard default (see Fig. 1 and Table 5). In the green default scenario, and contrary to indications from the literature, 81 of the 190 participants (42.6%) chose to stay with the green tariff. In the standard default scenario, 81 of 170 participants (47.6%) chose to opt-in and switch to the green tariff. In the scenario requiring an active choice, 37 of 158 participants (23.4%) chose the green tariff. Results show that a large number of individuals were unfavourably inclined to REe, challenging the existing experimental GED evidence.

Fig. 1
figure 1

Green and standard energy tariff choices across all treatment scenarios

Table 5 Choice of REe across different treatments in various GED studies

A 3 × 2 χ2 test found a significant overall effect for all treatment groups (χ2(2, 518) = 22.57, p < .001). We assessed which treatments were statistically significant in three, separate 2×2 post-hoc χ2 tests (α = .017). We found no significant differences between the green and the standard default (χ2(1, 360) = 0.912, p = .340). Interestingly, given that more participants chose the REe tariff in the standard default treatment, tests revealed a significant difference between the standard default and the active choice scenario (χ2(1, 328) = 20.82, p < .001), with a medium effect size (Φ = 0.25). Likewise, the comparison of the green default and the active choice treatment found that significantly more participants chose the REe tariff in the green default condition (χ2(1, 348) = 14.21, p < .001, Φ = 0.20). This is in contrast to Hedlin and Sunstein (2016), who found that active choosing may be more effective than green defaults. Our findings reveal relatively lower levels of enrolment even in an active choice treatment.

A series of χ2 goodness-of-fit tests comparing the choice of REe in our experiment with the ones reported in the literature (see Table 5) revealed the following: we found a significant difference (χ2(1, 190) = 119.5, p = .000) in the proportion of participants that chose REe in our green default scenario (42.6%) as compared with the average value of 76.3% we derived from previous studies. Likewise, we also identified a significant difference (χ2(1, 158) = 163.2, p = .000) when comparing the proportion of individuals that chose REe in our active choice scenario (23.4%) and the average value from the literature (63%). However, we found that the choice of REe in the standard default treatment (47.6%) is consistent with the results reported in the literature (43.2%), as no significant difference was found (χ2(1, 170) = 1.37, p = .242).

Following Pichert and Katsikopoulos (2008), we also examined the motivation of participants who were unwilling to choose the REe tariff across all treatment scenarios (see Table 6). Most participants, 294 out of 319 (92%), provided a reason for their grey tariff choice and price concerns dominated (69%). Our finding supports other studies that state that electricity demand and competition among utilities in the UK is still very price-driven, with cost by far the most important factor in switching choices (Diaz-Rainey & Ashton, 2008; Hast et al., 2015). However, of those unwilling to choose REe, around half also gave at least one other reason. The second most common reason was a lack of information (27%); this percentage is high compared to other studies and could be due to the controversy surrounding green tariffs in the UK. Very few of those who did want a green tariff considered that climate change mitigation was not their responsibility (~ 3%), suggesting a small effect of the GED among people with low environmental concerns.

Table 6 Reasons for not adopting a REe tariff (n = 294)

Cognitive effort was proxied by the latent time it took each participant to answer the treatment choice question (Lenzner et al., 2010). The analysis indicated that there was a statistically significant difference between treatment groups, determined by one-way ANOVA (F(2, 493) = 30.757, p < .001). The data was also analysed to examine whether there was a difference between the time taken by participants who chose the REe tariff and those who chose the standard tariff under each treatment scenario. We found that participants who stuck with the REe tariff took the same time to make in the green and standard default treatments (Table 7). However, in the active choice treatment, those who chose the REe tariff took almost twice as long to answer the question than those who chose the standard tariff.

Table 7 Time taken to choose a green or standard energy tariff under different treatment scenarios

Key Determinants Behind Choices

Logistic regression was performed to identify and evaluate the impact of different factors on the likelihood to choose (or not) a green tariff. Two models were estimated (without and with control variables). Overall, results consistently indicate that current (green) electricity contracts, treatments and time were significant predictors. Once control variables were introduced, results remained robust and gender (female) also became significant (see Table 8 for details).

Table 8 Logistic regression outputs (bootstrapped, based on 1000 samples) predicting the likelihood of choosing a green energy tariff

The first model (i.e., without control variables) was statistically significant (χ2 (24, 484) = 105.22, p = .000). The model explained between 19.5% (Cox & Snell R2) and 26.6% (Nagelkerke R2) of the variance of the REe choice, and was able to correctly classified 71.9% of cases. As shown in Table 8, only four variables made a unique statistically significant contribution to the model. The strongest predictor was ‘reality’ (i.e. green electricity tariff contracts). In particular, ceteris paribus, we observe that respondents with a green electricity contract are 6.8 times more likely to choose a green energy tariff than those who do not know which electricity tariff they have. The second strongest predictor is the actual treatment. Consistent with the results shown in the ‘First-Order Impacts of the GED’section, odds ratios show that it is the standard default treatment (odds ratio = 3.65) that has a relatively higher effect than the green default treatment (odds ratio = 2.66) on the likelihood to choose a green tariff. Note that we use ‘active choice’ as a baseline for comparison. This means that participants under the ‘standard’ and ‘green default’ treatments were 3.65 and 2.66 times, respectively, more likely to choose a green energy tariff than those who were under the active choice treatment. The fourth strongest predictor turned out to be time. The odds ratio is positive, indicating that for every additional second devoted under the treatment, respondents were 1.01 times more likely to choose a green tariff; controlling for other factors in the model.

After controlling for age, gender and education, the model remained significant (χ2 (27, 484) = 118.60, p = .000) and its explanatory capability increased (21.7% Cox & Snell R2 and 29.6% Nagelkerke R2). The same determinants indicated above made a statistically significant contribution to the model, namely: the green default treatment, the standard default, a ‘green tariff’ in reality and time. We observe that the effect of current green electricity tariffs was stronger (odds ratio = 8.02) compared to the previous model (odds ratio = 6.81). On the contrary, the effects of treatments remained almost the same. Finally, one can also see that gender made a significant contribution. The odds ratio was positive, indicating that women were 1.85 times more likely than men in choosing a green electricity tariff.

Stated Versus Revealed Preferences

We also explore the effectiveness of each experimental treatment compared to real-life preferences. This analysis focused on cases where information about the actual choices of respondents was possible to obtain (n = 450) (i.e., it excluded all responses for which no information about current electricity contracts was identified). As shown in Fig. 2, revealed preferences were relatively homogenous across all groups: around 10% for the REe tariff, and 90% for a standard ‘grey’ or ‘standard’ electricity tariff. No significant difference was found across all groups (χ2 (2, 405) = .52, p = .771).

Fig. 2
figure 2

Stated versus revealed choices under a scenario 1: green energy default, b scenario 2: standard energy default and c scenario 3: active choice. Error bars represent 95% confidence intervals

Under treatment 1 (n = 166), the REe tariff was chosen by 70 participants (42%). In reality, however, only 18 (11%) had a REe contract; a differential effect of 31%. A Wilcoxon signed rank test revealed a statistically significant impact of the treatment (z = 6.5, p = .000) with a medium effect size (r = 0.36). Under treatment 2 (n = 150), only 15 people (10%) had a REe contract; however, 73 participants (49%) chose a REe tariff with a nearly large effect (z = 7.3, p = .000, r = 0.43). Finally, in treatment 3 (n = 134), although the differential effect was lower (11%) than in the other treatments, it remained significant (z = 2.8, p = .004), with a small effect size (r = 0.18). Overall, and while our initial results show a low impact of a GED, the direction and significance of these effects also suggest that the provision of an explicit, simple decision framework can still have the potential to trigger a greater adoption of REe, even in an opt-in or active choice treatment scenario.

Discussion: Avenues for Future Consumer-Oriented GED Studies

Despite growing attention to GED, research beyond their potential effectiveness is still scarce and our analysis shows a different perspective. From a humble point of view, here we propose and briefly discuss various avenues that have the potential to increase the understanding of context-specific moderators and mediating mechanisms affecting consumer behaviour and thus the effectiveness and suitability of GED from a consumer perspective. By pursuing these avenues, we believe they can strengthen the value of future GED studies and resulting policy recommendations. This may be particularly relevant in countries where market and consumer policy conditions for REe may be less advanced or uncertain.

Endorsement and Policymaking

Validation by, and trust in, the choice architect are important factors that can determine the effectiveness of GED (Halpern, 2015; Sunstein & Reisch, 2013). The main effects reported here suggest some level of scepticism among consumers when the GED originates from a (potentially) untrustworthy choice architect. One can argue that given unfavourable policy and/ or market conditions, the choice architect (e.g., policy maker, energy company) is likely to fail in providing a minimum level of trust for a GED. Note that the identity of the choice architect was not given in our experiment. In addition to research design replication, the main reason for this was to avoid any pre-framing and deception. At the same time, the absence of an explicit choice architect may have led participants to think that a ‘hidden’ company or policymaker was behind the study, leading to an actual increase in REe tariffs, for example. If a participant did make such inferences, this could have motivated him/her to override their cognitive factors and biases (e.g., inertia) that would have otherwise caused him/her to stick with the default. Different levels of trust or endorsement by policymakers among consumers do deserve particular attention in future studies. Furthermore, previous research has assumed that GED interventions can be effective acting alone, neglecting interaction effects and corresponding endorsements effects with other policy instruments (e.g., carbon tax) (Hagmann et al., 2019). This is another area that needs attention. Turbulent political times (such as Brexit) can also affect policymaking and related levels of (implicit) endorsement among consumers. Sensitivity to explicit endorsements may indeed vary in time. Likewise, heterogeneity among consumers and numerous determinants suggest a variety of ‘asymmetries’ among individuals. Thus, additional research can address specific, asymmetric paternalistic approaches (Camerer et al., 2003) to REe enrolment programmes. This would mean, for instance, to identify, target and benefit consumers who are more likely to make ‘irrational’ choices about the purchase of REe contracts, while imposing marginal or no constraints on those who are already making more thoughtful decisions.

Marketplace Metacognition and Experience

One can also speculate that the ‘marketplace metacognition’ (MM) is likely to develop among consumers to cope with perceived attempts at persuasion. According to Wright (2002, p. 667), MM involves ‘everyday individuals’ thinking about market-related thinking. This includes people’s beliefs about their own and others’ mental states and processes and their beliefs about other people’s beliefs on those topics as these beliefs pertain to the specific domain of marketplace cooperation and manipulation.’ In a GED setting, a key implication is that when MM is invoked, consumers may become sceptical and their cognitive effort increases (Brown & Krishna, 2004). When consumers perceive an attempt to persuade them, cognitive effort is enhanced and they are more likely to resist such efforts; particularly if GED are perceived as an imposition. Sunstein (2015) argues that individuals reject nudges if they are perceived to have illicit motivations (e.g., profiteering), even if the nudge supports their preferences. Discrepancies between our study and previous GED research may also be the result of the market experience of participants. The UK has one of the highest switching rates for electricity in Europe (ACER/CEER, 2014), with multiple campaigns and regulations encouraging such behaviour (such as ‘The Big Switch’ or ‘Energy Best Deal’) As mentioned before, we found an increase in switching behaviour by consumers (20.4% in April 2019) (OFGEM, 2019) and studies have shown that the effects of defaults are reduced by individual experience (List, 2003; Löfgren et al., 2012). This observation is supported by Brown and Krishna (2004), who found that previous experience can influence whether individuals stick with a default. Further research should address elements that directly or indirectly cause (dis)comfort among consumers when being targeted with a GED, and the extent to which they relate to switching rates market (in)experience.

Psychological Reactance

The above observations lead us to the issue of psychological reactance, which refers to people’s negative reactions when their freedom is threatened or eliminated (Clee & Wicklund, 1980). While the literature has acknowledged that reactance can make GED less effective (Hedlin & Sunstein, 2016), there is limited empirical evidence about its actual effects. Although our default rule did not restrict individual freedom, as in any GED experiment (i.e., each individual could opt in or out), a perception of constraint could exist. If consumers perceive that a GED goal is illegitimate and do not fit their values or interests, they are unlikely to support it (Reisch & Sunstein, 2016). In our case, note for example that the standard default had a higher REe enrolment rate and one could argue that this was, at least partly, because individuals felt less sceptical or ‘reactant’ about the choice format and the hidden choice architect. In turn, the standard default may have appeared less paternalistic than the other two treatments. Combined with issues concerning endorsement, trust and MM, our results are an indication that reactance is a mediating mechanism that deserves far more attention in upcoming studies. After all, consumers ‘do not decide in a vacuum, but always in a choice context’ (Reisch & Zhao, 2017, p. 201) and psychological reactance may always affect or be part of the consumer choice context.

Pricing and Loss Aversion

The literature argues that when a good or service with social welfare features is offered, extra costs can create a boomerang effect due to psychological reactance (Clee & Wicklund, 1980; Hedlin & Sunstein, 2016). The experimental evidence with GED consistently shows the application of using a premium tariff (in the range of €0.75–€23 per month) and our study is not the exception. However, our premium (~ €3/month) is at the lower end of the spectrum. As current electricity contracts were a consistent predictor across all treatment scenarios, one can infer that consumers are very likely to be price-sensitive. In addition, price concerns were the top reason for the non-adoption of REe contract (see R1 = 69% in Table 2). Our results support Hedlin and Sunstein (2016), that speculate about greater reactance about GED when REe is more expensive than the grey or standard energy. This is also supported by Ghesla (2017), who shows much lower REe enrolment rates (2% in active choice, 4% in standard default and 20% in green default) when high REe prices are used. Thus, participants may not perceive that a GED triggers a loss of autonomy; they may object to it simply because REe costs more. In other words, loss aversion (Kahneman et al., 1991; Tversky & Kahneman, 1991) could initially arise as individuals place a higher value on (financial) losses than (potential environmental) gains relative to their current tariff agreement (i.e., reference point). Furthermore, and contrary to what the literature indicates (Schubert, 2017), respondents may not stick to the GED because of loss aversion, they may opt-out because the kind of ‘endowment’ established by the GED is already perceived as undesirable. In our case, consumers may have also felt that they were already paying the full social cost of electricity. Considering our results, one can argue that the GED approval rate estimated for the UK (64.9%) (Reisch & Sunstein, 2016, p. 318) may be much lower and more consistent with our findings, if the extra costs are made explicit to energy users in such studies. Combined with the issues of trust, price sensitivity via the use of different premiums and saliency levels can shed more light on economic concerns and the extent (or not) to which extra costs trigger, for example, reactance and/or loss aversion among consumers. Again, interactions with other policy instruments matter (e.g., energy pricing).

Institutional Aspects

The above-mentioned aspects relate to some or to a large extent with various institutional consumer issues that can help explaining or contextualising our results. For example, as mentioned early on, a lack of trust in energy suppliers is relatively high and can probably drive or increase high levels of inertia among consumers (cf. OFGEM, 2020b). Consumer confidence to engage in the UK’s electricity market has remained unchanged since 2017 (OFGEM, 2020b). Thus, institutional efforts should be devoted to increasing trust in the electricity supply industry. In addition, as the UK has introduced but also ended various REe policy support schemes, one can also speculate that the uncertainty about the future of RE growth that project developers have experienced (IEA, 2019) has also been perceived by (some) consumers in terms of REe supply, and thus prompted some unwillingness to adopt a RE tariff in our experiment. The regulatory framework in the UK also allows consumers to switch energy suppliers; however, negative perceptions of the switching process remain relatively high, including perceived risks (e.g., ‘REe supplier might go bust’) (OFGEM, 2020b). Consumer research shows that those who tend to switch are prompted by (i) end of fixed-term notices, (ii) get an electricity bill and (iii) move to a new house (OFGEM, 2020b). In our case, we hypothesise that none or most of our participants felt or experienced some of these events at the time we ran the experiment. Certainly, we also need to acknowledge that our messaging on ‘assume you have moved to a new town’ did not work properly. Whatever the case might be, the three aspects indicated above that prompt engagement to switch tariffs among consumers should be closely considered by choice architects. Pricing (and related cost savings) is still the dominant reason among consumers to switch supplier (OFGEM, 2020b). This is very consistent with both the design of our experiment and the given reasons to be unwilling to choose a REe tariff. Importantly, this means that the cost-effectiveness of REe policy measures remains critical to increase and/or maintain consumer engagement in REe tariffs in the future. According to Newbery (2016), capacity auctions combined with power purchasing agreements can provide the most cost-effective form of support to keep REe competitive in the long-term.

The Stated vs. Revealed Preference Gap

The comparison of stated and revealed choices suggests numerous aspects. For example, one could initially claim that this reveals a paradox or that differential effects simply imply a ‘value-action’ gap among energy users. The latter means that individuals simply overstated their behavioural reaction or willingness to adopt REe; a critical aspect already found in stated preference (lab) experiments (Loewenstein et al., 2014), but not yet fully understood in GED experimental lab studies. This highlights that much further analytical attention needs to be paid to what people say and actually do. This, in turn, underscores (even more) the value of revealed preferences and field and natural experiments to support consumer policy in the future. We also take note that 27% of the sample mentioned ‘not enough information on green tariffs’ as a reason for non-adoption. At the same time, differential effects suggest that the provision of an explicit, simple decision framework (or better information presentation) may trigger greater adoption of REe, even in an opt-in or active choice scenario. After all, interest in REe, as shown by stated preferences, is higher (by a factor of 2 to 5) than revealed rates. With due limitations, this suggests that a GED may still be an effective instrument, provided an explicit, simple ‘salient’ decision framework is implemented. This poses the question of whether a GED, even if less effective than previous research shows, is still capable of supporting a simple heuristic model to effectively promote the adoption of REe. Clear preferences also matter. Johnson and Goldstein (2003, p. 1339) state that ‘if preferences [….] are strong, we would expect defaults to have little or no effect’. Sunstein and Reisch (2014) argue that default effects might be limited by entrenched, pre-existing attitudes. Our first order effects seem to support this; however, differential effects also suggest less established or inconsistent preferences. In line with Vetter and Kutzner (2016), the difference between stated and revealed preferences show that potentially strong, pre-existing attitudes (e.g., towards the choice architect) might not necessarily limit the effectiveness of a GED. Carlsson (2010) highlights that unstable preferences throughout an experiment do not imply irrational behaviour; instead, a learning process could be unfolding. Thus, some respondents may not have fully developed their preferences during our experiment and they formed or even changed their preferences (e.g., preference reversal) (Tversky et al., 1990). In future GED studies, values, beliefs, attitudes and risk attributes (e.g., financial loss, credibility of electricity supplier) could be varied independently from one another and allow the estimation of their effect on the choice of REe under different treatments. The identification of specific motivational and psychological factors (e.g., self-deception) in explaining unconstructed or dynamic preferences regarding REe tariffs can also provide a better understanding of differences between stated and revealed choices.

Conclusion

This paper has been a humble attempt to provide new analytical and empirical perspectives about green energy defaults (GED) for consumer policy. Above all, it has aimed to complement the existing scientific and policy knowledge that can be derived from randomized controlled lab experiments. Compare to previous research, we have expanded the analytical framework going beyond first-order effects. On the one hand, our study has shown a different view on the potential effectiveness of GED. First-order results showed a relative lack of effectiveness of GED and lower market uptake of REe across all treatment groups. Results also showed that current green electricity contracts are a significant driver for REe choices. However, on the other hand, the comparison between stated and revealed preferences also revealed a positive and significant differential effect of GED. Our results suggest that GED can still frame decision-making in the desired policy direction, and we hypothesise that the provision of a simple and salient decision framework via a GED can trigger greater adoption of REe; even in an opt-in or active choice default rule. By no means have we claimed to provide a definitive answer. Modestly, and by replicating research design from previous lab experiments, our results underscore the need to carefully consider the heterogeneity of energy users, markets and policy environments. Importantly, our results suggest that extrapolating experimental results from one context/country to another needs to be taken with caution. Building upon existing knowledge, we postulate various motivational and contextual factors that underline the complexities to predict consumer behaviour in a GED setting. A larger critical mass of country-specific studies and representative samples are needed to continue unpacking the context-dependence of consumer preferences and provide better evidence-based policy advice. Removing barriers for REe is critical, and this is often a function of the existing consumer behaviours, policy portfolios and context. In sum, our argument is that a better understanding of context-specific moderators and mediating mechanisms will more effectively support consumer policy addressing the adequacy and effectiveness of GED in the future.