Introduction

The pollution-income relationship (PIR) is an important topic that is increasingly investigated by researchers. Since the early work by Grossman and Krueger (1991) and Shafik and Bandyopadhyay (1992), many researchers have attempted to estimate a PIR empirically, based on the idea of an environmental Kuznets curve (EKC) (see, e.g. surveys by Dinda 2004, 2005; Stern 2004; Lieb 2002; and Uchiyama 2016 inter alia). Such studies have attempted to estimate the impact of economic growth on various indicators of environmental degradation, such as carbon dioxide (CO2) and sulphur dioxide (SO2) emissions, particulate matter, and water pollution.

Kuznets (1955) originally suggested that, as an economy develops, inequality first rises and then decreases—giving rise to the inverted U-shaped relationship known as the Kuznets curve. This concept has since been used in several areas, such as the PIR, where it is hypothesised that as an economy develops, environmental pollution first rises and then decreases—giving rise to what has become known as the inverted U-shaped EKC, first referred to by Panayotou (1993).Footnote 1 Arguably, following the original Kuznets curve, the EKC refers only to a quadratic relationship that produces an inverted U-shaped curve. However, in the environmental economics literature that investigates the PIR to discover whether the EKC hypothesis holds for a given country or group of countries, the general framework tends to now be referred to as the EKC. In other words, although the EKC is regarded as only one special case of the PIR, it is now used in much wider terms to represent any non-linear PIR, whether it be a quadratic specification, a cubic specification, or a quartic specification—or theoretically any nth-order polynomial. Therefore, for consistency with the literature, we use the term EKC to represent any non-linear PIR (potentially of any order).

Researchers in this area initially tried to determine whether environmental degradation indicators have an inverted U-shaped relationship with income since such a quadratic relationship would suggest that environmental degradation increases in the early stages of economic growth, before eventually peaking and then decreasing after income reaches a certain level—similar to the original Kuznets (1955) idea for inequality and economic growth, as indicated above. However, as also mentioned above, this area of research has now been extended, with researchers also attempting to determine whether environmental degradation indicators have an N-shaped relationship with income by estimating a cubic relationship or even an M-shaped relationship, with income by estimating a quartic relationship. However, as Destek et al. (2020) demonstrate, a quadratic specification could give a U-shaped EKC, a cubic specification could give an inverted N-shaped EKC, and a quartic specification could give an inverted M-shaped (or W-shaped) EKC. Empirically, these cases, discussed by Destek et al. (2020), might be found with the initial part of the curve being downward sloping. However, a downward-sloping initial part of the curve is inconsistent with the a priori theoretical expectation that the initial stage of any estimated EKC would be upward sloping, which would reflect the initial development stage of the economy or economies being considered and the increasing environmental degradation that occurs during this stage. Therefore, an estimated EKC curve that deviates from an initial upward sloping part warrants further investigation.

As is well known in the environmental economics literature, these non-linear EKCs can be estimated using variables in levels or, as is more often the case, in logarithms, which is this paper’s focus. In particular, we consider the properties of such non-linear EKCs when the variables are in logarithms, showing that the estimated coefficient signs and the statistical significance of the non-leading terms in such specifications are arbitrary, being dependent upon the units of measurement chosen for the independent variables (i.e. being dependent upon different rescaling). We therefore propose a methodology for choosing the appropriate higher-order polynomials for a logarithmic EKC. However, before considering this critical issue, we detail several outstanding issues surrounding the estimation of PIRs/EKCs.

The next section therefore details the outstanding issues in the EKC environmental economics literature followed by Section 3 that initially outlines the functional forms for the quadratic, cubic, and quartic EKC specifications, before focusing on the quadratic logarithmic EKC specification. It provides mathematical proofs of the unit dependence of the coefficients of non-leading terms and the invariance of the coefficients of the leading terms, elasticities, and their t-values.Footnote 2 It also shows that the estimated turning points are effectively non-unit dependent. Section 4 follows with an empirical illustration and a discussion of the findings for the estimation of quadratic, cubic, and quartic EKCs. Section 5 summarises and concludes the study. Additionally, for completeness, in an on-line Annex, Appendix 1 details all the current issues around the EKC literature summarised above, and Appendix 2 details the mathematical proofs for the cubic and quartic EKC functional forms.

Outstanding issues in the EKC environmental economics literature

Although it is almost 30 years since the publication of the Grossman and Krueger (1991) and Shafik and Bandyopadhyay (1992) papers, there are still many unresolved issues in the EKC environmental economics literature. These issues can be broadly grouped into ‘theoretical’, ‘empirical’, and ‘mathematical/statistical’ nuances and are discussed in detail in Appendix 1. In summary, these issues include the following:

  • The real-life representativeness of the EKC specification (see Beckerman 1992; Panayotou 1993; Rothman 1998; Mills Busa 2013; Choumert et al. 2013, inter alia)

  • The limited capacity of the instruments of environmental indicators (see Schindler 1996, inter alia)

  • The use of production- versus consumption-based environmental indicators (see Rothman 1998; Gawande et al. 2001; Bagliani et al. 2008, inter alia)

  • Contradicting empirical results (see Roca et al. 2001; Khanna and Plassmann 2004; Auci and Becchetti 2006; Fosten et al. 2012; Yang et al. 2015a; Liddle and Messinis 2016; Mikayilov et al. 2018; Arshad et al. 2020, inter alia)

  • The use of per capita versus total income (see Selden and Song 1994; Friedl and Getzner 2003, inter alia)

  • The integration-cointegration properties of the used variables (see Stern et al. 1996; Stern 2004; Romero-Avila 2008, inter alia)

  • The use of powers of non-stationary variables (see Fürstenberger and Wagner 2007; Wagner 2008, 2012, 2015; Hong and Wagner 2008, inter alia)

  • The type of functional specifications used and the econometric techniques employed (see Galeotti et al. 2006; Liddle and Messinis 2016; Apergis 2016; Moosa 2017; Mikayilov et al. 2018, inter alia)

  • The use of a trend in the specification, level versus logarithmic variables, and issues related to the turning point (see Lieb 2003; Dinda 2004, inter alia)

  • The different cases of the potential relationship between income and environmental degradation, such as monotonic, quadratic (inverted U-shaped), cubic (N-shaped), and quartic (M-shaped) (see Shafik and Bandyopadhyay 1992; Grossman 1995; Lieb 2003, Hasanov et al. 2019, inter alia)

The above summary indicates (and Appendix 1 details) that there are many issues around the modelling of the EKCs, with some way to go before a clear understanding of the environmental quality to economic growth relationship is definitively established. The issues that we focus on in this paper, explained in detail below, are connected in various ways to the issues discussed in the final three bullets of the list above, and, as far as we know, these have not been discussed in the environmental literature before.

Despite all the issues above, the PIR is, in all probability, likely to be a non-linear EKC relationship, which can be rationalised from both theoretical and mathematical/statistical viewpoints. From a theoretical perspective, as mentioned by Lieb (2003) and Dinda (2004) inter alia, there is a demand for environmental quality, regardless of whether it is a normal or a luxury good. At the early stage of a country’s/society’s development, meeting the first items at the top of the demand pyramid is preferred (Maslow 1943), and controlling environmental degradation is not seen as the major concern. Later, when the early necessities are met and the economy is wealthier, environmental awareness and cleaner nature become an increasing concern, likely resulting in a change in the PIR. However, it could also be argued that, due to technological progress and sectoral shifts, after the first turning point, as income continues to rise, a second turning point might emerge giving an N-shaped PIR, or even a second or a third turning point giving an M-shaped PIR (Yang et al. 2015b; Terrell 2020). As mentioned above, some researchers such as Destek et al. (2020) suggest that such non-linear specifications might result in an inverted N-shaped PIR or inverted M-shaped (W-shaped) PIR, depending on the estimates obtained. However, we argue that this is likely due to the initial development stage being missed in the estimation. Either way, given the growing interest in the estimation of quadratic, cubic, and quartic PIRs/EKCs, it would appear prudent to fully understand the properties of such relationships, which is the fundamental issue considered in this paper.

Accepting the argument that the PIR is likely to be non-linear, the issue becomes what is the best specification and what is the appropriate order of the polynomial to capture the non-linearities when attempting to model the EKC. Researchers therefore require a clear understanding of the properties of such specifications and of the interpretation of the results obtained. These are important issues that arguably should be added to the summary list above and ones that, as far as we know, have not been adequately explored in the PIR/EKC environmental literature. Hence, this critical issue is addressed in detail in this paper.

The issue is that the coefficient estimates and their associated t-values (and hence their significance levels) in multiplicative-logarithmic functions vary according to the different units of measurement used for expressing the variables, as noted by Hunt and Lynk (1993). Thus, in estimating a logarithmic EKC, the inclusion of a squared term renders the coefficient estimates of the other level term arbitrary and hence meaningless, depending upon the units of measurement chosen for the variables (or, in other words, when the variables are rescaled). Similarly, the inclusion of a cubed term when estimating a logarithmic EKC renders the coefficient estimates of the squared term and the level term arbitrary and hence meaningless when the variables are rescaled. In addition, the inclusion of a term to the power of 4 when estimating a logarithmic EKC renders the coefficient estimates of the cubed term, the squared term, and the level term arbitrary and hence meaningless when the variables are rescaled. As stated, as far as we are aware, this issue has not been considered in the environmental economics literature. As highlighted later in the paper, many previous EKC studies have chosen the preferred specification based on these arbitrary and meaningless estimates—hence this is a cautionary tale given that many previous studies might have potentially accepted inappropriate results. Moreover, given this problem, we offer a way forward, suggesting criteria that researchers should adopt when determining the existence and shape of an estimated logarithmic EKC.

Properties of the logarithmic EKC

Background of the logarithmic EKC specification

The many attempts to estimate an EKC relationship have used a variety of specifications with both time series and panel data. Later in this section, for expositional reasons, we focus on the quadratic version, but first we outline the three relatively popular versions of the EKC. Letting Et = environmental degradation (pollution) per capitaFootnote 3 in time t, Yt = income or GDP per capita in time t,Footnote 4ln = natural logarithm, and the α’s and β’s parameters to be estimated, the base specifications of the non-linear EKC, namely the quadratic, cubic, and quartic forms, are as follows:Footnote 5

Linear (not in logarithmic form)

$$ {E}_t={\alpha}_0+{\alpha}_1{Y}_t+{\alpha}_2{Y}_t^2 $$
(1a)
$$ {E}_t={\alpha}_0+{\alpha}_1{Y}_t+{\alpha}_2{Y}_t^2+{\alpha}_3{Y}_t^3 $$
(1b)
$$ {E}_t={\alpha}_0+{\alpha}_1{Y}_t+{\alpha}_2{Y}_t^2+{\alpha}_3{Y}_t^3+{\alpha}_4{Y}_t^4 $$
(1c)

Logarithmic Footnote 6

$$ \mathit{\ln}{E}_t={\beta}_0+{\beta}_1\mathit{\ln}{Y}_t+{\beta}_2{\mathit{\ln}}^2{Y}_t $$
(2a)
$$ \mathit{\ln}{E}_t={\beta}_0+{\beta}_1\mathit{\ln}{Y}_t+{\beta}_2{\mathit{\ln}}^2{Y}_t+{\beta}_3{\mathit{\ln}}^3{Y}_t $$
(2b)
$$ \mathit{\ln}{E}_t={\beta}_0+{\beta}_1\mathit{\ln}{Y}_t+{\beta}_2{\mathit{\ln}}^2{Y}_t+{\beta}_3{\mathit{\ln}}^3{Y}_t+{\beta}_4{\mathit{\ln}}^4{Y}_t $$
(2c)

Attempts have been made to estimate an EKC using all specifications shown in Eqs. (1) and (2).Footnote 7 For example, Shafik and Bandyopadhyay (1992) estimated linear, quadratic, and cubic models in logs; Grossman and Krueger (1995) estimated cubic model in levels (without logs); Harbaugh et al. (2002) estimated cubic specification in logs as well as in levels; Lieb (2002) estimated cubic specification in logs; Ang (2007), Shahbaz et al. (2014), Onafowora and Owoye (2014), Alshehry and Belloumi (2016), and many others estimated quadratic models in logs; Yang et al. (2015a) used all three functional forms in logs (linear, quadratic, and cubic); Li et al. (2019) estimated quadratic and cubic models in logs; Yang et al. (2015b) estimated all polynomial functions (in logs) from the first to the fifth order; and Terrell (2020) estimated quartic specification in both levels and logs.

However, as far as we are aware, there has been no discussion in the PIR/EKC literature about the effect of using different units of measurement for the main explanatory variable, income (such as moving from $bn to $m), and how to determine the appropriate preferred polynomial specification. For the functional specifications given by Eqs. (1a), (1b), and (1c) in levels, it is known that for α1 , α2α3, and α4, the sizes (which do not affect any decision about the shape) are unit dependent, while the signs and significances are unit independent. This means that the coefficients can be used to decide the curvature/shape of the relationship. However, Hunt and Lynk (1993) showed that in multiplicative-logarithmic functions other than for the coefficient of the highest power in the polynomial function (hereafter referred to as the leading term), the estimated coefficients and their significances of the lower power terms are sensitive to rescaling (i.e., unit dependent), whereas the elasticities are not sensitive to rescaling (i.e., unit invariant).Footnote 8 Given this, Hunt and Lynk (1993) suggest that the focus of reporting and interpretation should be on the leading terms and the estimated elasticities, rather than the lower power coefficients. This issue is therefore explored here, since in conventional PIR/EKC studies, the preferred models have often been chosen based upon both the individual estimated coefficient signs and significance. This has been the case in many papers published on estimating an EKC that have been included in several leading environmental economics journals. For example, these include, for the common quadratic specification like Eq. (2a), the following:

  • In Energy Policy, Ang (2007; p. 4774) states ‘Under the EKC hypothesis, [β1] is expected to be positive whereas a negative sign is expected for [β2]’.

  • In Sustainable Development, Atici (2008; p. 158) states that according ‘to the EKC, we may expect the sign of β1 to be positive and that of β2 to be negative’.

  • In Energy, Tang and Tan (2015; p. 449) state ‘According to the EKC hypothesis, the sign of β1 is expected to be positive, while the sign of β2 is expected to be negative’.

  • In Economic Modelling, Kasman and Duman (2015; p. 98) state ‘Under the EKC hypothesis, it is expected that β1 > 0 and if β2 < 0. Hence, there is an inverted U-shaped pattern’.

  • In Renewable and Sustainable Energy Reviews, Jebli and Youssef (2015; p. 178) state ‘Under the EKC hypothesis, the sign of [β1] is expected to be positive, whereas [β2] is expected to be negative’.

  • In Ecological Indicators, Al-Mulali et al. (2016; p. 274) state ‘The existence of the EKC hypothesis, which indicates the inverted U-shaped relationship between income and CO2 emission, can be confirmed if β1 > 0 … and β2 < 0’.

  • In Ecological Economics, Bimonte and Stabile (2017; pp. 39-40) state ‘Contrary to expectations, the main stylized fact … is that the relationship between per capita income and conservation of environmental resources … follows a U-shaped path (β1 < 0; β2 > 0’).Footnote 9

  • In Energy Economics, Balaguer and Cantavella (2018; p. 290) state “Under the assumption of a conventional EKC we expect that [β1 > 0] and [β2 < 0];

  • In Renewable Energy, Sharif et al. (2019; p. 689) state “It can be seen from the results … that the economic growth has a positive value whereas the square rate of economic growth shows the negative value. This confirms the existence of the Kuznets curve hypothesis i.e., the inverted U-shape association between the economic growth and CO2 emission”;

  • In Structural Change and Economic Dynamics, Jiang et al. (2019; p. 248) state if “ … [β1 > 0] and [β2 < 0], it reveals an inverted U-shaped curve” but if “… [β1 < 0] and [β2 > 0], it indicates a U-shaped curve”; and

  • In Environmental Science and Pollution Research, Gormus and Aydin (2020; p. 27908) state “… the coefficient of [lnYt] should be positive and the coefficient of [ln2Yt] should be negative…”

This list represents just a few examples of where researchers have made such statements when estimating a quadratic EKC using logarithms. In addition, some authors have made similar statements when attempting to estimate a cubic EKC like Eq. (2b), such as the following:

  • In Energy Economics, Baek (2015; p. 14) state ‘… known as an N-shape curve, it is expected that β1 >0, β2 < 0 and β3 >0’.

  • In Economics of Energy & Environmental Policy, Sorge and Neumann (2020; p. 174) state ‘β1, β2, and β3 are statistically significant and β1 >0, β2 < 0 and β3 >0, which … suggests a N-shaped pattern’.

Other examples include Yang et al. (2015a, b), Jaforullah and King (2017); Terrell (2020), and Li et al. (2019). However, as we show below, such statements are misleading for the logarithmic versions of the EKC shown in Eq. (2), since the size, sign, and significance of the non-leading terms are irrelevant to the decision of whether the inverted U-shaped, N-shaped, or M-shaped EKC exists or not. Therefore, as stated, the remainder of this section focuses on the quadratic specification, Eq. (2a), for ease of exposition and because it is the most popular specification in the environmental economics literature. Nonetheless, the issue is equally important for researchers attempting to estimate cubic or quartic PIRs/EKCs (which are increasing in the literature), but the theoretical results for these can easily be generalised for the cubic and quartic cases (and are detailed in Appendix 2 for completeness). Moreover, in Section 4 we present examples of empirical estimates for all cases and develop a structured methodology for choosing the preferred order of polynomial for the estimated logarithmic EKC.

Some algebra for the quadratic logarithmic EKC

As discussed above, the quadratic version of the EKC is more popular in the literature, whether after ‘testing down’ or by assumption; hence, as indicated above, we focus here in the algebra section on Eq. (2a). Using the definitions above, the ‘raw’ data for Yt could be re-based by multiplying by an arbitrary constant (e.g. to convert the ‘raw’ data to millions or thousands or to index to a certain base year). Therefore, letting a be the rescaling factor, the new independent variable becomes:

$$ {Y}_t^{\ast }=a{Y}_t $$
(3)

so that Eq. (2a) becomes:

$$ \mathit{\ln}{E}_t={\beta}_0^{\ast }+{\beta}_1^{\ast}\mathit{\ln}{Y}_t^{\ast }+{\beta}_2^{\ast }{\mathit{\ln}}^2{Y}_t^{\ast } $$
(4)

Estimated parameters and statistical significance

After substituting Eq. (3) into Eq. (4) and re-arranging, it becomes:

$$ \mathit{\ln}{E}_t=\left[{\beta}_0^{\ast }+{\beta}_1^{\ast } lna+{\beta}_2^{\ast }{\mathit{\ln}}^2a\right]+\left[{\beta}_1^{\ast }+2{\beta}_2^{\ast } lna\right]\mathit{\ln}{Y}_t+{\beta}_2^{\ast }{\mathit{\ln}}^2{Y}_t $$
(5)

Given that the left-hand side of Eq. (2a) and Eq. (5) are the same, we can equate the right-hand sides to relate the previous and new coefficients so that:

$$ {\beta}_0^{\ast }={\beta}_0-{\beta}_1 lna+{\beta}_2{\mathit{\ln}}^2a $$
(6a)
$$ {\beta}_1^{\ast }={\beta}_1-2{\beta}_2 lna $$
(6b)
$$ {\beta}_2^{\ast }={\beta}_2 $$
(6c)

In this case, Eq. (6c) shows that the coefficient on the quadratic (leading) term is invariant to the units of measurement, while the other coefficients are unit dependentFootnote 10—shown in Eqs. (6a) and (6b).

The natural question to follow this is: Does the rescaling of the variables also affect the statistical significance of the coefficients? To investigate this, we need to consider the effect of the rescaling on the t-values of β1 and β2, which are defined as follows:

$$ {t}_{\beta_1}=\frac{\beta_1}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_1\right)}} $$
(7a)
$$ {t}_{\beta_2}=\frac{\beta_2}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_2\right)}} $$
(7b)
$$ {t}_{\beta_1^{\ast }}=\frac{\beta_1^{\ast }}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_1^{\ast}\right)}} $$
(7c)
$$ {t}_{\beta_2^{\ast }}=\frac{\beta_2^{\ast }}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_2^{\ast}\right)}} $$
(7d)

Substituting Eqs. (6b) and (6c) into Eqs. (7c) and (7d) gives:

$$ {t}_{\beta_1^{\ast }}=\frac{\beta_1-2{\beta}_2 lna}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_1-2{\beta}_2 lna\right)}}=\frac{\beta_1}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_1\right)}}\sqrt{\frac{\ \mathit{\operatorname{var}}\left({\beta}_1\right)}{\ \mathit{\operatorname{var}}\left({\beta}_1-2{\beta}_2 lna\right)}}-2 lna\frac{\beta_2}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_2\right)}}\sqrt{\frac{\ \mathit{\operatorname{var}}\left({\beta}_2\right)}{\ \mathit{\operatorname{var}}\left({\beta}_1-2{\beta}_2 lna\right)}} $$
(8a)

and

$$ {t}_{\beta_2^{\ast }}=\frac{\beta_2}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_2\right)}} $$
(8b)

Therefore, comparing Eq. (7a) with Eq. (8a) and Eq. (7b) with Eq. (8b) shows that:

$$ {t}_{\beta_1}\ne {t}_{\beta_1^{\ast }} $$
(9a)

whereas

$$ {t}_{\beta_2}={t}_{\beta_2^{\ast }} $$
(9b)

Thus, the statistical significance of the coefficient of the non-leading term (like its sign and size) is unit dependent given that the t-values will vary according to different scaling, whereas the statistical significance of the coefficient of the leading term (like its sign and size) is unit invariant, given the t-value does not vary due to different scaling. This implies that the only necessary condition for a quadratic (inverted U-shaped) EKC is that the leading term β2 is negative and statistically significant.Footnote 11 The necessary conditions for the cubic (N-shaped) and quartic (M-shaped) EKCs are detailed in Appendix 2.

Estimated turning point

The turning point for the quadratic logarithmic EKC specification, Eq. (2a), is found by differentiating with respect to Y and setting equal to zero. This is given byFootnote 12:

$$ {Y}^{TP}=\exp \left(\frac{-{\beta}_1}{2{\beta}_2}\right) $$
(10a)

but for Eq. (4) with the re-based units, this gives:

$$ {Y}^{\ast TP}=\exp \left(\frac{-{\beta}_1^{\ast }}{2{\beta}_2^{\ast }}\right) $$
(10b)

However, substituting Eqs. (6b) to (6c) into Eq. (10b) gives:

$$ {\displaystyle \begin{array}{c}{Y}^{\ast TP}=\exp \left(-\frac{\beta_1-2{\beta}_2 lna}{2{\beta}_2}\right)=\exp \left(-\frac{\beta_1}{2{\beta}_2}+ lna\right)\\ {}=\exp \left(-\frac{\beta_1}{2{\beta}_2}\right)\exp (lna)=a\exp \left(-\frac{\beta_1}{2{\beta}_2}\right)=a{Y}^{TP}\end{array}} $$
(10c)

Therefore, this shows that the rescaling of the income variable results in the turning point of the estimated EKC being rescaled by the same factor.Footnote 13 In other words, as would be expected intuitively, the estimated turning point is effectively the same.

Estimated elasticity and statistical significance

Given that we have shown above that the sign, size, and statistical significance of the non-leading term of a quadratic EKC in logarithmic form are unit dependent, we next consider the elasticity of E with respect to Y for Eq. (2a), which is given by:

$$ \eta =\frac{\partial {E}_t}{\partial {Y}_t}\frac{Y_t}{E_t}=\frac{\partial \mathit{\ln}{E}_t}{\partial \mathit{\ln}{Y}_t}={\beta}_1+2{\beta}_2\mathit{\ln}{Y}_t\kern0.5em $$
(11a)

And the elasticity for the rescaled version, Eq. (4), is given by:

$$ {\eta}^{\ast }=\frac{\partial {E}_t}{\partial {Y}_t^{\ast }}\frac{Y_t^{\ast }}{E_t}=\frac{\partial \mathit{\ln}{E}_t}{\partial \mathit{\ln}{Y}_t^{\ast }}={\beta}_1^{\ast }+2{\beta}_2^{\ast}\mathit{\ln}{Y}_t^{\ast } $$
(11b)

But substituting Eqs. (3), (6b), and (6c) into Eq. (11b) and re-arranging gives:

$$ {\eta}^{\ast }={\beta}_1-2{\beta}_2l\mathrm{n}a+2{\beta}_2\mathit{\ln}{Y}_t+2{\beta}_2 lna={\beta}_1+2{\beta}_2\mathit{\ln}{Y}_t $$
(11c)

so that the elasticity of E with respect to Y is unit invariant, since as Eqs. (11a) and (11c) show:

$$ {\eta}^{\ast }=\eta $$
(11d)

Furthermore, the statistical significance of the elasticity is also unit independent, since the t-values of η and η are identical. To show this, the variance of η can be expressed as follows:

$$ \mathit{\operatorname{var}}\left(\eta \right)=\mathit{\operatorname{var}}\left({\beta}_1+2{\beta}_2\mathit{\ln}{Y}_t\right)=\mathit{\operatorname{var}}\left({\beta}_1\right)+4\mathit{\ln}{Y}_t\ast \mathit{\operatorname{cov}}\left({\beta}_1,{\beta}_2\right)+4{\mathit{\ln}}^2{Y}_t\ast \mathit{\operatorname{var}}\left({\beta}_2\right) $$
(12b)

While for η, the variance can be expressed as follows:

$$ \mathit{\operatorname{var}}\left({\eta}^{\ast }\ \right)=\mathit{\operatorname{var}}\left({\beta}_1^{\ast }+2{\beta}_2^{\ast}\mathit{\ln}{Y}_t^{\ast}\right) $$
(12b)

Introducing the rescaling in Eq. (3), and utilising the standard properties of variance and covariance,Footnote 14 Eq. (12b) can be expressed as follows:

$$ \mathit{\operatorname{var}}\left({\eta}^{\ast }\ \right)=\mathit{\operatorname{var}}\left({\beta}_1^{\ast }+2{\beta}_2^{\ast}\mathit{\ln}{Y}_t^{\ast}\right)=\mathit{\operatorname{var}}\Big({\beta}_1^{\ast }+2{\beta}_2^{\ast}\ln \left(a{Y}_t\right)=\mathit{\operatorname{var}}\left({\beta}_1^{\ast}\right)+4\ln \left(a{Y}_t\right)\ast \mathit{\operatorname{cov}}\left({\beta}_1^{\ast },{\beta}_2^{\ast}\right)+4{\mathit{\ln}}^2\left({aY}_t\right)\ast \mathit{\operatorname{var}}\left({\beta}_2^{\ast}\right)=\mathit{\operatorname{var}}\left({\beta}_1-2{\beta}_2 lna\right)+4\mathit{\operatorname{cov}}\left({\beta}_1-2{\beta}_2 lna,{\beta}_2\right)\left( lna+\mathit{\ln}{Y}_t\right)+4\ast \mathit{\operatorname{var}}\left({\beta}_2\right){\left( lna+\mathit{\ln}{Y}_t\right)}^2=\mathit{\operatorname{var}}\left({\beta}_1\right)-4 lna\ast \mathit{\operatorname{cov}}\left({\beta}_1,{\beta}_2\right)+4{\mathit{\ln}}^2a\ast \mathit{\operatorname{var}}\left({\beta}_2\right)+4 lna\ast \mathit{\operatorname{cov}}\left({\beta}_1,{\beta}_2\right)-8{\mathit{\ln}}^2a\ast \mathit{\operatorname{var}}\left({\beta}_2\right)+4\mathit{\ln}{Y}_t\ast \mathit{\operatorname{cov}}\left({\beta}_1,{\beta}_2\right)-8 lna\ast \mathit{\ln}{Y}_t\ast \mathit{\operatorname{var}}\left({\beta}_2\right)+4{\mathit{\ln}}^2a\ast \mathit{\operatorname{var}}\left({\beta}_2\right)+8 lna\ast \mathit{\ln}{Y}_t\ast \mathit{\operatorname{var}}\left({\beta}_2\right)+4{\mathit{\ln}}^2{Y}_t\ast \mathit{\operatorname{var}}\left({\beta}_2\right)=\mathit{\operatorname{var}}\left({\beta}_1\right)+4\mathit{\ln}{Y}_t\ast \mathit{\operatorname{cov}}\left({\beta}_1,{\beta}_2\right)+4{\mathit{\ln}}^2{Y}_t\ast \mathit{\operatorname{var}}\left({\beta}_2\right)=\mathit{\operatorname{var}}\left(\eta \right) $$
(12c)

Eq. (11d) and Eq. (12c) show that the elasticity estimates and their variances are not affected by the rescalingFootnote 15. Hence, their standard errors, t-values, and significance levels are all not unit dependent.

Sufficient condition

Mathematically, conditions are formalised as being ‘necessary’ and ‘sufficient’. As discussed above, the necessary condition for an estimated quadratic (inverted U-shaped) EKC is that the estimated leading term, β2, is negative (and statistically significant). Mathematically, the sufficient condition for a quadratic (inverted U-shaped) EKC is that \( -\frac{\beta_1}{2{\beta}_2} \)Footnote 16 is a real number, which holds when β2 < 0 (and statistically significant). Hence, the sufficient and necessary conditions for a quadratic (inverted U-shaped) EKC are effectively the same. Furthermore, the turning point needs to be within the data range of the sample. In other words, the sufficient condition for an estimated quadratic (inverted U-shaped) EKC, as in Eq. (2a), is that the turning point is within the sample range and the estimated pairwise elasticities are positive and significant for the initial upward sloping part of the estimated curve but they approach zero and become insignificant at the first turning point, thereafter becoming negative and significant on the downward sloping part (similar sufficient conditions for the cubic and quartic EKCs are detailed in Appendix 2).

Summary

In this section, we have shown mathematically that for the quadratic logarithmic EKC, the signs, sizes, and significances of the non-leading coefficients are unit dependent, whereas the signs, sizes, and significances of the leading squared term and the estimated elasticity are unit independent. Furthermore, we have also shown mathematically that the estimated turning point, although rescaled, is effectively the same and thus not actually dependent upon the units of measurement of the variables. Although this illustration is for the quadratic EKC specification, the results are easily generalised for the cubic and quartic specifications (as shown in Appendix 2). The next section highlights this by presenting some empirical illustrations for all three specifications.

Empirical Illustration

This section empirically illustrates the findings from the previous section by employing the Baek (2015) data used by Jaforullah and King (2017) to estimate a CO2 emissions EKC for Denmark and Sweden, respectively.Footnote 17 For simplicity, we use ordinary least squares (OLS) to estimate the different EKC specifications. Nonetheless, we realise that more sophisticated methodologies have been, and in all likelihood would be, employed, but the different estimation methods do not change the key messages from this paper.Footnote 18 The next three sub-sections therefore give illustrations for the quadratic, cubic, and quartic specifications, respectively, highlighting the mathematical findings from the previous section and developing an estimation strategy for developing logarithmic EKCs.

Empirical illustration of the quadratic logarithmic EKC

The results of the empirical illustrations are given in Table 1, showing the estimation results with different scaling factors given in different columns. This shows that the empirical results confirm the finding of the mathematical derivations above. For Denmark, \( \hat{\beta_2}=-1.42 \)with an associated t-value at −9.62; both clearly do not change by rescaling the income variable. Similarly, for Sweden, \( \hat{\beta_2}=-1.16 \) and the associated t-value is −4.83. In other words, the sign, size, and statistical significance of this estimated coefficient \( \left(\hat{\beta_2}\right) \) are unit invariant.

Table 1 Quadratic specification estimation results

However, this is not the case for \( \hat{\beta_1} \). Table 1 clearly shows that both the size and the sign for both Denmark and Sweden vary according to the different rescaling, which is further shown in Fig. 1, which gives examples of scaling factors to illustrate the possibility of estimated coefficients with opposite signs and with significant and insignificant values (according to t-values). Figure 1 therefore shows that there is a range of scaling factors whereby the estimates of \( \hat{\beta_1} \) would be negative and another range that would suggest that \( \hat{\beta_1} \)is not significantly different from zero. As can be seen from Table 1, for Denmark, when the scaling factor is 0.00003, \( \hat{\beta_1} \) is negative, and for the last five scaling factors, it is positive. Furthermore, when the scaling factor is 0.000032, \( \hat{\beta_1} \) is also statistically insignificant. For Sweden, \( \hat{\beta_1} \) is negative when the scaling factor is 0.00003 and statistically insignificant when it is 0.000049. In short, Table 1 and Fig. 1 illustrate that, for both countries, \( \hat{\beta_1} \) can take negative or positive values as well as zero (e.g. the first column in Table 1). It also shows that there is a range of scaling factors (illustrated by the horizontal 10% significance line in Fig. 1) for which the estimated coefficients become statistically insignificant (e.g. the second column in Table 1). This clearly shows that the sign, size, and statistical significance of the coefficient \( \hat{\beta_1} \) are unit dependent.

Fig. 1
figure 1

Various estimates for \( \hat{\beta_1} \) and associated significance levels for quadratic EKC

Figure 2 illustrates the estimated shapes of the EKCs and the turning points for three different scaling factors: a—labelled as ‘Act’ for the actual original data (where a =1), ‘Min’ for the smallest scaling factor used in the chart, and ‘Max’ for the largest scaling factor used in the chart. Note, in Fig. 2 the three different, relatively extreme, values for the income scaling factor a shown for the income axis are chosen for illustrative purposes only and highlight that the same results would apply for CO2 emissions whatever scaling factor were used. Figure 2 shows, as detailed above, that the estimated EKC turning point for both Denmark and Sweden is scaled by the same scaling factor used to scale the independent variable in functional form Eq. (2a). Thus, the shape of the EKCs for each country is identical, and the turning points in terms of emissions are the same, but with the income just scaled accordingly. In other words, \( \hat{\beta_1} \) does not have an impact on the actual shape of the estimated EKC and where the important turning point is—whether \( \hat{\beta_1} \) is positive or negative and/or statistically significant or insignificant. Therefore, decisions by researchers on the acceptance or otherwise the existence and shape of an estimated EKC should not be based on \( \hat{\beta_1} \).

Fig. 2
figure 2

Estimated turning points with scaled per capita income. (Note: Act, a =1; Min, a =0.00003; Max, a =33333. These represent the examples of the scaling factors shown on the income axes.)

Table 1 also shows that, for both countries, the estimated summary elasticities (calculated at mean values as discussed in Gujarati and Porter 2009), as well as their t-values (and thus their statistical significance) are invariant to rescaling. Thus, when deciding on the acceptance or otherwise the existence and shape of an estimated logarithmic EKC, there should be greater focus on the estimated elasticity and not the estimate of β1. In other words, since the estimate of β1 is unit variant in terms of sign and significance, it does not provide any useful information about the shape of the relationship.Footnote 19 Instead, for an inverted U-shaped EKC to hold, one of the necessary conditions is having a statistically significant elasticity that is positive before the turning point and becomes negative after the turning point (or at least for some sample values). Figure 3 illustrates the estimated pointwise elasticities for both countries, both against income and time, using the examples of three different scaling factors a. These clearly show that the elasticity estimates are not unit dependent (both when plotted against income and time). Figure 3 also shows that, for the quadratic case, the turning points for both countries are within the sample size, which is important when deciding upon the preferred specification—an issue we will return to later as we explore the cubic and quartic cases.

Fig. 3
figure 3

Estimated pointwise elasticities with 95% confidence intervals. (Note: Act, a =1; Min, a =0.00003; Max, a =33333. These represent the examples of the scaling factors shown on the income axes.)

The charts plotted against income in Fig. 3 also show that for both countries, the estimated pointwise elasticities are positive and significant at low levels of income but gradually fall, becoming insignificant the closer to the estimated turning point, where the elasticity would be zero. Thereafter, as income increases, the estimated elasticities continue to fall, gradually becoming more negative and significant. The charts against time in Fig. 3 show that for both countries, the estimated pointwise elasticities are initially positive and fall over time and then become increasingly negative.

Empirical illustration of the cubic logarithmic EKC

The results for the empirical estimation for the cubic case are presented in Table 2 and illustrated in Fig. 4. These show that the cubic specification results display the same issue as the quadratic case. Although the size, sign, and significance of the estimated coefficients of the leading term (\( \hat{\beta_3} \)) are unit independent, this is not the case for \( \hat{\beta_1} \) and \( \hat{\beta_2} \) since Table 2 and Fig. 4 clearly show that, for all the illustrative examples, their sign, size, and significance are unit dependent. Moreover, there is a range of scaling factors (illustrated by the horizontal 10% significance line in the second column of Fig. 4) for which the coefficients \( \hat{\beta_1} \) and \( \hat{\beta_2} \) become statistically insignificant. Furthermore, like the quadratic case, the estimated summary elasticities and their t-values (and thus their statistical significance) are invariant to rescaling.

Table 2 Cubic specification estimation results
Fig. 4
figure 4

Various estimates for \( \hat{\beta_1} \) and \( \hat{\beta_2} \) and associated significance levels for cubic EKC

Like Fig. 2 for the quadratic specification, Fig. 5 shows for the cubic specification the estimated shapes of the EKCs and the turning points for three illustrative scaling factors: a—again labelled ‘Act’, ‘Min’, and ‘Max’ shown on the income axis. Again, this demonstrates that the estimated EKC turning points for both countries are just scaled by that used to scale the independent variable in functional form Eq. (2b). Thus, the shapes of the EKCs for each country are identical, and the turning points are at the same level of emissions, with the income just scaled accordingly. So, for the cubic case, both \( \hat{\beta_1} \) and \( \hat{\beta_2} \) have no impact on the actual shape of the estimated EKC and where the important turning point is—whether \( \hat{\beta_1} \) and \( \hat{\beta_2} \) are positive or negative and/or statistically significant/insignificant. Therefore, decisions by researchers on the acceptance or otherwise the existence and shape of an estimated EKC should not be based on \( \hat{\beta_1} \)and/or \( \hat{\beta_2} \). Furthermore, it is interesting to note that, for this specification, the second turning point for Denmark does not fall within the sample range (unlike Sweden), which suggests that despite \( \hat{\beta_3} \) being positive and significant, the cubic specification is not appropriate for Denmark (although it might be for Sweden).

Fig. 5
figure 5

Estimated turning points with scaled per capita income. (Note: For Denmark, Act, a =1; Min, a =0.000017; Max, a =1000; for Sweden, Act, a =1; Min, a =0.000035; Max, a =1000. These represent the examples of the scaling factors shown on the Denmark and Sweden income axes, respectively.)

Figure 6 displays the estimated pointwise elasticities for Denmark and Sweden against income and time using the different scaling factors. Like Fig. 3 for the quadratic case, Fig. 6 clearly shows that for the cubic case, the elasticity estimates are not unit dependent (both when plotted against income and time). For both Denmark and Sweden, the estimated pointwise elasticities are positive and significant at low levels of income but gradually fall, becoming insignificant the closer to the estimated first turning point, where the elasticity would be zero. Thereafter, as income increases, the estimated elasticities continue to fall, gradually becoming more negative and significant. However, Denmark then follows a different path to that of Sweden. For Denmark, the estimated elasticities flatten and do not reach the second turning point, given it is outside of the data range, whereas for Sweden, the estimated elasticities start to rise again, becoming around zero at the estimated second turning point and then continue to rise. As for the charts against time in the second column of Fig. 6, these show slightly different patterns but still reflect the situation in the first column. Again, it is interesting to focus on the illustrative estimates for Denmark presented in Fig. 6, since the plots against both income and time show that the cubic or N-shaped specification is not appropriate. For there to be an N-shaped pattern, the elasticity should have at least some positive values after the second turning point when plotted against income (and close to the end of the sample when plotted against time). However, this is clearly not the case for Denmark in Fig. 6, which is in line with the finding of the second turning point to be outside of the sample range for Denmark. Nonetheless, this is the case for Sweden in Fig. 6, which would be expected given both estimated turning points are within the data sample range, and Fig. 6, therefore, suggests that an estimated N-shaped EKC might be appropriate for Sweden.

Fig. 6
figure 6

Estimated pointwise elasticities with 95% confidence intervals. (Note: For Denmark, Act, a =1; Min, a =0.000017; Max, a =1000; for Sweden, Act, a =1, Min, a =0.000035; Max, a =1000. These represent the examples of the scaling factors shown on the Denmark and Sweden income axes, respectively.)

Empirical illustration of the quartic logarithmic EKC

The results for the empirical estimation for the quartic case are presented in Table 3 and illustrated in Fig. 7a and 7b for a range of illustrative scaling factors. These results show that the quartic specification results are consistent with the quadratic and cubic cases. Although the size, sign, and significance of the estimated coefficients of the leading term (\( \hat{\beta_4} \)) are unit independent, this is not the case for \( \hat{\beta_1} \), \( \hat{\beta_2} \), and \( \hat{\beta_3} \), since Table 3 and Fig. 7a and 7b clearly show that their sign, size, and significance are generally unit dependent. Moreover, there is a range of scaling factors (illustrated by the horizontal 10% significance line in the second column of Fig. 7a and 7b) for which the coefficients \( \hat{\beta_1} \), \( \hat{\beta_2} \), and \( \hat{\beta_3} \) become statistically insignificant. Furthermore, like the quadratic and cubic cases, the elasticities and their t-values (and thus their statistical significance) are invariant to rescaling. It is interesting to note that, for the quartic case, the leading term (\( \hat{\beta_4} \)) and the summary elasticity (\( \hat{\eta} \)) are insignificant for both Denmark and Sweden. This suggests that the quartic specification is not the appropriate representation of the EKC for both countries.

Table 3 Quartic specification estimation results
Fig. 7
figure 7figure 7

Various estimates for \( \hat{\beta_1} \), \( \hat{\beta_2} \), \( \hat{\beta_3} \), and associated significance levels for quartic EKC—a Denmark, b Sweden

Figure 8 illustrates the estimated shapes of the EKCs and the turning points for three illustrative scaling factors for the quartic case—once again labelled as ‘Act’, ‘Min’, and ‘Max’ shown on the income axis. Like before, this shows that the estimated EKC turning points are identical, and the turning points are at the same level of emissions but with income just scaled accordingly. So in this case, \( \hat{\beta_1} \), \( \hat{\beta_2} \), and \( \hat{\beta_3} \) have no impact on the actual shape of the estimated EKC and where the important turning points are. Whether \( \hat{\beta_1} \), \( \hat{\beta_2} \), and/or \( \hat{\beta_3} \) are positive/negative and/or statistically significant/insignificant has no impact on the estimated EKC whatsoever. Therefore, decisions by researchers on the acceptance or otherwise the existence and shape of an estimated quartic EKC should not be based on \( \hat{\beta_1} \), \( \hat{\beta_2} \), and/or \( \hat{\beta_3} \).

Fig. 8
figure 8

Estimated turning points with scaled per capita income. (Notes: Act, a =1; Min, a =0.00003; Max, a =100. These represent the examples of the scaling factors shown on the income axes. For Denmark, only one turning point is real, and other two turning points (roots) are complex.)

In terms of the actual shapes, it is interesting to note that only one turning point is established for Denmark since the other two have complex roots. This is consistent with the cubic case where the second turning point was outside the data sample range—all pointing to the EKC for Denmark being a simple quadratic function—whereas for Sweden, all three turning points are determined, but the first one is not within the data sample range. This coupled with the insignificant \( \hat{\beta_4} \) suggests that for Sweden also, the quartic specification is not the appropriate one, but the cubic specification is.

Finally, for completeness, Fig. 9 displays the estimated pointwise elasticities for both countries against income and time using the different scaling factors and again shows that the elasticity estimates are not unit dependent. Since the pointwise elasticity is insignificant for both country cases, it confirms what has been suggested above in terms of the insignificance of the leading term and the turning points: the quartic specification is not the appropriate representation of the EKC relationship for either country.

Fig. 9
figure 9

Estimated pointwise elasticities with 95% confidence intervals. (Notes: Act, a =1; Min, a =0.00003; Max, a =100. These represent the examples of the scaling factors shown on the income axes. For Denmark, only one turning point is real, and other two turning points (roots) are complex.)

Summary

The empirical illustrations presented in this section clearly highlight the potential problem when estimating non-linear logarithmic EKCs. We have shown that the signs, sizes, t-values, and statistical significances of the estimated coefficients of the variables other than the leading term (the variables with the highest power) are unit dependent, whereas the sign, size, t-value, and statistical significance of the estimated coefficient of the leading term (the variable with the highest power of the polynomial functional form) are unit of measurement invariant. Furthermore, we have also shown that the estimated elasticities and their t-values (and consequentially their statistical significances) are also invariant to any rescaling (they are unit of measurement invariant), while the turning points, although rescaled by the same rescaling factor, are effectively the same.

We have also highlighted the factors that should be considered when choosing the preferred specification for an estimated logarithmic EKC. The focus should clearly be on the sign and size of the leading term, the elasticity estimates, and where the turning point(s) are relative to the sample data range. Researchers should not just choose a certain polynomial and estimate it blindly without a detailed scrutiny of all the factors discussed above. The next section summarises and discusses the consequences of these issues and details a suggested way forward for future researchers when estimating logarithmic EKCs.

Summary and Conclusion

Understanding the relationship between environmental pollution and its different drivers is a major issue in the environmental economics literature, with much research focussed on attempting to find the existence or otherwise of EKCs. The use of an appropriate functional form for the EKC and the proper interpretation of findings are at the heart of this research. As Jaforullah and King (2017) and Mikayilov et al. (2018), inter alia, discuss, using inappropriate functional forms can result in a relationship that does not properly demonstrate the response of pollution to its drivers. Moreover, when using different polynomials with the variables in logarithmic form (as shown here with the quadratic, cubic, and quartic specifications), it is important to understand the effect of measuring the variables in different units of measurement and how to choose an appropriate preferred estimated EKC specification. In this regard, this paper investigates the effect on the estimated coefficients, their t-values and statistical significance, the turning points of the function, and the estimated elasticities. It shows that:

  1. I.

    The signs, sizes, t-values, and statistical significance of the estimated coefficient of the non-leading terms are unit dependent, rendering the estimates arbitrary. Thus, they should not contribute to any decision about the conclusion of the existence of an EKC nor its shape.

  2. II.

    The sign, size, t-value, and statistical significance of the estimated coefficient of the leading term are unit independent. Thus, this should contribute to the decision about the conclusion of the existence of an EKC and its shape.

  3. III.

    Rescaling the independent income variable causes the turning point to also be rescaled, but its location with respect to the minimum and maximum value of emissions stays unchanged. Thus, this should contribute to the decision about the conclusion of the existence of an EKC and its shape.

  4. IV.

    The signs, sizes, t-values, and significance of estimated elasticities, like the estimate of the coefficient of the leading term, are also unit independent. Thus, these should contribute to the decision about the conclusion of the existence of an EKC and its shape.

This clearly demonstrates that the practice followed in many previously published environmental economics papers, whereby the estimates of the non-leading terms are used in the decision to choose the preferred specification of an estimated logarithmic EKC, is incorrect and should not be followed since it could result in misleading conclusions. Instead, we suggest the following strategy should be adopted by researchers for determining the existence and shape of an estimated logarithmic EKC.

  1. a)

    Given the potential problems with the logarithmic specification, researchers should initially investigate whether it might be appropriate to estimate the EKC in levels, like in Eq. (1), or in logarithms, like in Eq. (2). For this, following Moosa (2017), we suggest that non-nested tests (such as those detailed in Pesaran and Pesaran 2009) are conducted to try to determine which would be the most appropriate, since some of the issues outlined in this paper might be avoided if it is clear that a level version of the EKC like those in Eq. (1) is preferred. If, however, the logarithmic version of the EKC, like those in Eq. (2), is preferred (or, as sometimes happens with non-nested tests, a clear distinction between the specifications is not possible), then the criteria (set out in b) below should be followed.

  2. b)

    Estimate a polynomial logarithmic EKC like those presented in Eq. (2), with the initial order of the polynomial chosen by the researcher. However, given the growing interest in the literature in attempting to estimate higher-order specifications, this choice should be ‘reasonably high’ to be able to test down in the spirit of the ‘general-to-specific’ methodology, based on the following criteria.

    1. i.

      Check the statistical significance of the leading term; i.e. check whether β4 is significant if estimating a quartic specification as in Eq. (2c), or whether β3 is significant if estimating a cubic specification as in Eq. (2b), or whether β2 is significant if estimating a quadratic specification as in Eq. (2a).

      If the leading term is not statistically significant, try re-estimating with a lower-order polynomial.

      If the leading term is statistically significant, go to ii.

    2. ii.

      Check the sign of the leading term to ensure that the estimates conform with the a priori expectations about the shape of the EKC; i.e. for a quartic specification as in Eq. (2c), if β4 < 0, it suggests an M-shaped relationship; for a cubic specification as in Eq. (2b), if β3 > 0, it suggests an N-shaped relationship; and for a quadratic specification as in Eq. (2a), if β2 < 0, it suggests an inverted U-shaped relationship.Footnote 20 Furthermore, if the estimates do not conform with a priori expectations, then a judgement is needed as to whether the estimated shape suggested is acceptable or not.

      If it is decided that the estimated shape is not acceptable, then alternative specifications would need to be explored.Footnote 21

      If it is decided that the estimated shape is acceptable, then go to iii to further test the acceptability of the estimated model.

    3. iii.

      Check that all the acceptable estimated turning points for the specification estimated, be it quartic, cubic, or quadratic, are within a reasonable range—i.e. greater than the sample minimum value and smaller than the sample maximum value.

      If the turning points are not within the reasonable range, then alternative specifications would need to be explored. For example, if for an estimated M-shaped quartic EKC the income level for the highest turning point is found to be higher than the sample maximum income value, or the income level for the highest turning point is found to be higher than the sample maximum income value for an estimated N-shaped cubic EKC, then it would suggest that lower polynomials should be explored. Ultimately, if the income level for the highest turning point is found to be higher than the sample maximum income value for an estimated inverted U-shaped quadratic EKC, in all probability, it would suggest that the PIR is monotonically increasing. In short, we suggest a ‘general-to-specific’ type approach until a statistically acceptable model consistent with a priori expectations is obtained.Footnote 22

      If, however, the turning points are within the reasonable range, then go to iv to further test the acceptability of the estimated model.

    4. iv.

      Check that the estimated summary elasticity is significant and that the pointwise elasticities follow the pattern in terms of the sign, size, and significance, as discussed and shown in Fig. 3, Fig. 6, and Fig. 9 for the appropriate EKC polynomial being estimated. In other words, for any estimated EKC, be it inverted U-shaped, N-shaped, or M-shaped, the estimated pairwise elasticities should be positive and significant for the initial upward sloping part of the estimated curve, but they approach zero and become insignificant at the first turning point, thereafter becoming negative and significant on the downward sloping part. If, however, an N- or M-shaped EKC is being considered, after the first turning point, the estimated pairwise elasticities should continue to be negative and significant but approach zero and become insignificant at the second turning point, thereafter becoming positive again and significant on the next upward sloping part of the estimated curve. If, however, an M-shaped EKC is being considered, the estimated pairwise elasticities should continue to be positive and significant after the second turning point, but they approach zero and become insignificant at the third turning point, thereafter becoming negative again and significant on the next downward sloping part of the estimated curve.Footnote 23

      If this condition is not satisfied, then again alternative specifications would need to be explored.Footnote 24

      However, if this final condition is accepted following the acceptance of all the previous conditions, then the estimated relationship could be considered as the preferred estimate for appropriate economic and policy analysis.

This paper has explored important issues around estimating logarithmic EKCs using various orders of polynomial specifications in detail. It highlights some of the common pitfalls but also develops a modelling strategy that should ensure a consistent approach for researchers investigating logarithmic EKCs. Footnote 25 That said, this paper focuses very much on the one type of specification, albeit an extremely popular type, and when undertaking such modelling, it is vital that the correct and appropriate econometric estimation and testing techniques are employed to the highest standard and considered alongside the modelling strategy suggested above. Moreover, future research should consider how the non-linear polynomial specifications considered in this paper could be used, or compared, with econometric non-linear modelling methods as discussed in Lieb (2003), Dinda (2004), and Kijima et al. (2010), inter alia. These include the structural time series method (Harvey 1989); time-varying coefficient cointegration (Park and Hahn 1999); quantile cointegration regression (Xiao 2009); and the multiplicative indicator saturation approach (Ericsson 2012; Castle and Hendry 2019).