Empirical likelihood for spatial dynamic panel data models

Li, Yinghua; Qin, Yongsong

doi:10.1007/s42952-021-00150-4

Empirical likelihood for spatial dynamic panel data models

Research Article
Published: 29 September 2021

Volume 51, pages 500–525, (2022)
Cite this article

Download PDF

Journal of the Korean Statistical Society Aims and scope Submit manuscript

Empirical likelihood for spatial dynamic panel data models

Download PDF

Yinghua Li¹ &
Yongsong Qin¹

1966 Accesses
Explore all metrics

Abstract

Spatial dynamic panel data (SDPD) models have received great attention in economics in recent 10 years. Existing approaches for the estimation and test of SDPD models are quasi-maximum likelihood (QML) approach and generalized method of moments (GMM). In this article, we introduce the empirical likelihood (EL) method to the statistical inference for SDPD models. The EL ratio statistics are constructed for the parameters of spatial dynamic panel data models. It is shown that the limiting distributions of the empirical likelihood ratio statistics are chi-squared distributions, which are used to construct confidence regions for the parameters of the models. Simulation results show that the EL based confidence regions outperform the normal approximation based confidence regions.

Robust Two-Stage Estimation in General Spatial Dynamic Panel Data Models

Article 12 December 2023

Multiple Testing for Different Structures of Spatial Dynamic Panel Data Models

Specification tests for spatial panel data models

Article 30 July 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Real data are often observed at different locations and times, which are called as spatial panel data (SPD). Examples are economic growth rates of major cities in China over last 40 years, monthly unemployment rates of states in USA in the last decade and daily infection rates of COVID-19 in major cites in Hubei province in China over 3 months since December 31, 2019. These data may be modelled by SPD models. The research to various SPD models can be found in Anselin (1988), Elhorst (2003), Baltagi et al. (2003), Baltagi and Li (2006), Chen and Conley (2001), Pesaran (2004), Kapoor et al. (2007), Baltagi et al. (2007), Lee and Yu (2010a), Mutl and Pfaffermayr (2011), Parent and LeSage (2011) and Baltagi et al. (2013), among others. By adding a dynamic element into a SPD model Anselin (2001) proposes a spatial dynamic panel data (SDPD) model, which increases the flexibility of a SPD model. Obviously, SPD models belong to SDPD models. There has been a growing interest in the statistical inferences for SDPD models since then. For an overview on the SPDP models, refer to Yu et al. (2008) and Lee and Yu (2010b), Su and Yang (2015), Lee and Yu (2015a, b), Yu and Lee (2010), Elhorst (2010), Elhorst (2005), Yang et al. (2006), Mutl (2006), Su and Yang (2007), and Lee and Yu (2010c), among others. There are two popular methods for the estimation and test of SPD and SDPD models: quasi-maximum likelihood (QML) approach and generalized method of moments (GMM), which can be seen in above references.

In this article, we use the empirical likelihood (EL) method, proposed by Owen (1988, 1990), to construct the confidence regions for the parameters in a SDPD model. It is observed, in the case of independent observations, that the EL method to construct confidence intervals/regions has many advantages over its counterparts like the normal-approximation-based method and the bootstrap method (e.g., Hall and La Scala 1990; Hall 1992). A excellent review on EL for regressions can be found in Chen and Keilegom (2009). There are a lot of references on EL methods for independent samples or in the context of sample surveys. To save space, we list a few of them such as Owen (2001), Qin and Lawless (1994), Chen and Qin (1993), Zhong and Rao (2000) and Wu (2004). The date are dependent and follow certain structures in SDPD models. The study of the EL method for some SPDP models enjoys a certain progress. For example, there are a few articles studying the EL method for the pure spatial data (PSD, the special case of SPD with a fixed time). For instance, Nordman (2008a, b) and Bandyopadhyay et al. (2015) use the blockwise EL (BEL) proposed by Kitamura (1997) to PSD. Recently, by exploring inherent martingale structures, Qin (2021) and Jin and Lee (2019) use the EL method to construct confidence intervals/regions in PSD models. Further, Li and Qin (2020) extends the EL method proposed by Qin (2021) and Jin and Lee (2019) to SPD models. We note that there is no research work on the EL method for SDPD models.

There are many kinds of SDPD models. Lee and Yu (2010c) gives a review on the classification and research development of some SDPD models. Su and Yang (2015) introduces the QML method to SDPD models with spatial errors, where different types of space-specific effects and different ways that initial observations being generated (exogenously or endogenously) are investigated. Since SDPD models are quite complicated, as a starting point, in this article, we study the EL method for the SDPD models in Su and Yang (2015) with the restriction that there is no space-specific effects (or named as zero drift) and initial observations are generated exogenously. The study of the EL method for general SDPD models without above restriction is left for our future study. Our research results show that the EL based confidence regions generally outperform the normal approximation (NA) based confidence regions when the space units are large enough.

The rest of the article is organized as follows. Section 2 presents the main results. Results from a simulation study are reported in Sect. 3. Section 4 gives the analysis of real data. All technical details are presented in Sect. 5.

2 Main results

In this article, we suppose that there are n individual units and T time periods and the sampling data satisfy the following SDPD model with spatial error:

$$\begin{aligned} y_t= & {} \rho y_{t-1}+x_t\beta +z\gamma +\epsilon _t, \end{aligned}$$

(1)

$$\begin{aligned} \epsilon _t= & {} \lambda W_n\epsilon _t+\nu _t, t=1, 2, \ldots , T, \end{aligned}$$

(2)

where $y_t=(y_{1t},\ldots , y_{nt})'$ is an n-dimensional column vector of observed dependent variables, $\rho (|\rho |< 1)$ characterizes the dynamic effect, $x_t=(x_{1t},\ldots , x_{nt})'$ is an $n\times p$ matrix of time-varying exogenous variables, $z=(z_1,\ldots , z_n)'$ is an $n\times q$ matrix of time-invariant exogenous variables, and $\beta $ and $\gamma $ are $p\times 1$ and $q\times 1$ regression coefficients, respectively. The disturbance vector $\epsilon _t=(\epsilon _{1t},\ldots , \epsilon _{nt})'$ is an $n\times 1$ vector of errors. The parameter $\lambda $ is a spatial autoregressive coefficient and $W_n$ is an $n\times n$ spatial weighting matrix of constants, $\nu _t=(\nu _{1t},\ldots , \nu _{nt})'$ is an $n\times 1$ column vector, and $\{\nu _{it}\}$ are i.i.d. across t and i with zero mean and variance $\sigma ^2_\nu $. The spatial weighting matrix is also called contiguity matrix, which is determined by the spatial dependence of n spatial units. There are many ways to define $W_n$ (e.g. pages 17–19 in Anselin 1988). Let $W_{ij}$ be the (i, j) element of $W_n$. Commonly used $W_n$ includes Rook contiguity, Bishop contiguity and Queen contiguity as follows. Rook contiguity: define $W_{ij} = 1$ if the units i and j share a common side and $W_{ij} = 0$, otherwise. Bishop contiguity: define $W_{ij} = 1$ if the units i and j share a common vertex and $W_{ij} = 0$, otherwise. Queen contiguity: define $W_{ij} = 1$ if the units i and j share a common side or vertex and $W_{ij} = 0$, otherwise. The choice of $W_n$ is important. Our results hold true for all these commonly used $W_n$.

The models (2.1)–(2.3) in Su and Yang (2015) are as follows:

$$\begin{aligned} y_t= & {} \rho y_{t-1}+x_t\beta +z\gamma +\mu +\epsilon _t, \\ \epsilon _t= & {} \lambda W_n\epsilon _t+\nu _t, t=1, 2, \ldots , T, \end{aligned}$$

where $\mu =(\mu _1, \ldots , \mu _n)'$ represent the unobservable individual or space-specific effects and other notations are the same as in model (1) and (2). Compared to models (2.1)–(2.3) in Su and Yang (2015), the only difference is that there is no space-specific effects $\mu $ in model (1) and (2). Our initial investigation shows that the EL method for SDPD models with space-specific effects may need an adjusted EL method. Further research is needed and left for our future work.

We develop the EL method for the SDPD model when $y_0$ is exogenous. In this case, we can treat $y_0$ as a fixed constant vector as it contains no information about the model parameters. For convenience, we use $\mathbf{1} _k$ to denote a $k\times 1$ vector of ones, $\mathbf{0} _k$ to denote a $k\times 1$ vector of zeros, and $J_k=\mathbf{1} _k \mathbf{1} '_k$, where $\otimes $ is the Kronecker product.

Let $Y=(y'_1, y'_2, \ldots , y'_T)'$, $Y_{-1}=(y'_0, y'_1, \ldots , y'_{T-1})'$, $X=(x'_1, x'_2, \ldots , x'_T)'$, $\nu =(\nu '_1, \nu '_2, \ldots , \nu '_T)'$, $Z=\mathbf{1} _T\otimes z$, $B=B(\lambda )=I_n-\lambda W_n$ and $\epsilon =(\epsilon '_1, \epsilon '_2, \ldots , \epsilon '_T)'$. Then model (1) and (2) can be written in a matrix form as:

$$\begin{aligned} \left( \begin{array}{l} y_1\\ y_2\\ \vdots \\ y_T \end{array} \right) = \rho \left( \begin{array}{c} y_0\\ y_1\\ \vdots \\ y_{T-1}\end{array} \right) +\left( \begin{array}{c} x_1\\ x_2\\ \vdots \\ x_T \end{array} \right) \beta +\left( \begin{array}{l} z\\ z\\ \vdots \\ z \end{array} \right) \gamma + \left( \begin{array}{c} \epsilon _1\\ \epsilon _2\\ \vdots \\ \epsilon _T \end{array} \right) , \end{aligned}$$

with

$$\begin{aligned} \left( \begin{array}{l} \epsilon _1\\ \epsilon _2\\ \vdots \\ \epsilon _T \end{array} \right) =\left( \begin{array}{llllll}B^{-1} &{} 0 &{} 0 &{} \cdots &{} 0 &{} 0\\ 0 &{} B^{-1}&{} 0 &{} \cdots &{} 0 &{} 0\\ \vdots \\ 0&{} 0&{} 0&{} \cdots &{} 0 &{} B^{-1}\end{array} \right) \left( \begin{array}{l} \nu _1\\ \nu _2\\ \vdots \\ \nu _T \end{array} \right) , \end{aligned}$$

or

$$\begin{aligned} Y=\rho Y_{-1}+X\beta +Z\gamma +\epsilon , \end{aligned}$$

(3)

with

$$\begin{aligned} \epsilon =(I_T\otimes B^{-1})\nu , \end{aligned}$$

(4)

where $\epsilon \sim (0, \sigma ^2_\nu \Omega )$, with

$$\begin{aligned} \Omega =\Omega (\lambda )=I_T\otimes (B'B)^{-1}. \end{aligned}$$

(5)

Let $\theta =(\beta ', \gamma ', \rho )'$ and $\psi =(\theta ', \sigma ^2_\nu , \lambda )'$. We adopt the QML method to derive the estimating equations for the EL method. Under the assumption of normality (which is only used at this moment), based on (3) and (4), the log-likelihood function (ignoring constants) is

$$\begin{aligned} \widetilde{L}(\psi )=-{nT\over 2}\log \sigma _\nu ^2 -\frac{1}{2}\log |\Omega |-{1\over 2\sigma _\nu ^2}\epsilon '\Omega ^{-1}\epsilon , \end{aligned}$$

(6)

where $\epsilon =Y-\rho Y_{-1}-X\beta -Z\gamma $. It can be shown that

$$\begin{aligned}&\partial \widetilde{ L}(\psi )/\partial \theta = \sigma _\nu ^{-2}\widetilde{X}'\Omega ^{-1}\epsilon ,\\&\partial \widetilde{L}(\psi )/\partial \sigma _\nu ^2 =-\frac{nT}{2\sigma _\nu ^2}+\frac{1}{2\sigma _\nu ^4}\epsilon '\Omega ^{-1}\epsilon , \\&\partial \widetilde{L}(\psi )/\partial \lambda = -\frac{1}{2}tr(\Omega ^{-1}(I_T\otimes A))+\frac{1}{2\sigma ^{2}_\nu }\epsilon '\Omega ^{-1}(I_T\otimes A)\Omega ^{-1}\epsilon , \end{aligned}$$

where $\widetilde{X}=(X, Z, Y_{-1})$, $A=(B' B)^{-1}(W'_nB+B' W_n)(B' B)^{-1}$. Letting above derivatives be 0, we obtain the following estimating equations of the QML method:

$$\begin{aligned}&\widetilde{X}'\Omega ^{-1}\epsilon =0, \end{aligned}$$

(7)

$$\begin{aligned}&-nT\sigma _\nu ^2+\epsilon '\Omega ^{-1}\epsilon =0, \end{aligned}$$

(8)

$$\begin{aligned}&-\sigma ^{2}_\nu tr(\Omega ^{-1}(I_T\otimes A))+ \epsilon '\Omega ^{-1}(I_T\otimes A)\Omega ^{-1}\epsilon =0, \end{aligned}$$

(9)

Substituting (4) into (7)–(9), we have

$$\begin{aligned}&\widetilde{X}'(I_T\otimes B')\nu =0, \end{aligned}$$

(10)

$$\begin{aligned}&-nT\sigma _\nu ^2+\nu ' \nu =0, \end{aligned}$$

(11)

$$\begin{aligned}&-\sigma ^{2}_\nu tr(I_T\otimes (BAB'))+\nu '(I_T\otimes (BAB'))\nu =0. \end{aligned}$$

(12)

Noting that $\widetilde{X}=(X, Z, Y_{-1})$ and $Y_{-1}$ contains X, z and $\nu $, we need to separate out $\nu $ from $\widetilde{X}$. To this end, denote $l_{\rho }=(0, c_{\rho , 1}, \ldots , c_{\rho ,T-1})'$, $c_{\rho , t}=(1-\rho ^t)/(1-\rho )$, $Y_0=(Y'_{0, 0}, Y'_{0, 1}, \ldots , Y'_{0, T-1})'$, $Y_{0, t}=\rho ^ty_0$,

$$\begin{aligned} F_{\rho }=\left( \begin{array}{lllll}0 &{} 1 &{} \rho &{}\cdots &{} \rho ^{T-2} \\ 0&{} 0 &{} 1&{}\cdots &{}\rho ^{T-3}\\ \vdots &{} \vdots &{} \vdots &{}\ddots &{}\vdots \\ 0 &{}0 &{}0&{}\cdots &{}1\\ 0 &{}0 &{}0&{}\cdots &{}0 \end{array} \right) , \end{aligned}$$

$A_x=F_{\rho }' \otimes I_n $ and $A_\nu =F_{\rho }' \otimes B^{-1} $. We use (B.3) in Su and Yang (2015) to obtain that

$$\begin{aligned} Y_{-1}=A_xX\beta +(l_{\rho }\otimes I_n )z\gamma +A_\nu \nu +Y_0, \end{aligned}$$

Let $\widetilde{X}_1=\left( X,\ Z \right) $ and $\widetilde{X}_2= A_xX\beta +(l_{\rho }\otimes I_n )z\gamma +Y_0$. Then (10) can be decomposed into

$$\begin{aligned}&\widetilde{X}'_1(I_T\otimes B')\nu =0, \end{aligned}$$

(13)

$$\begin{aligned}&\widetilde{X}'_2(I_T\otimes B')\nu +\nu 'A'_\nu (I_T\otimes B')\nu =0. \end{aligned}$$

(14)

For convenience, let $e=\nu $, i.e.

$$\begin{aligned} e_{(nT)\times 1}=\left( \begin{array}{l} e_1\\ e_2\\ \vdots \\ e_{nT}\\ \end{array} \right) =\left( \begin{array}{l} \nu _1\\ \nu _2\\ \vdots \\ \nu _{T}\\ \end{array} \right) .\end{aligned}$$

(15)

Then (11)–(14) can be rewritten as

$$\begin{aligned}&\widetilde{X}'_1(I_T\otimes B')e=0, \end{aligned}$$

(16)

$$\begin{aligned}&\widetilde{X}'_2(I_T\otimes B')e+e'A'_\nu (I_T\otimes B')e=0, \end{aligned}$$

(17)

$$\begin{aligned}&-nT\sigma _\nu ^2+e'e=0, \end{aligned}$$

(18)

$$\begin{aligned}&-\sigma ^{2}_\nu tr(I_T\otimes (BAB'))+e'(I_T\otimes (BAB'))e=0. \end{aligned}$$

(19)

Observing that the above estimating equations include the quadratic forms of e, to use the EL method, we need to change the quadratic forms into the linear forms of a well behaved random variables. To this end, we let $H_1={1\over 2}\left( A'_\nu (I_T\otimes B')+(I_T\otimes B)A_\nu \right) $ and $H_2=I_T\otimes (BAB')$. Use ${h}_{ij,k}$, $a_{i,1}$ and $a_{i,2}$ to denote the (i, j) element of the matrix $H_k$ ($k=1, 2$), the i-th column of the matrix $\widetilde{X}_1'(I_T\otimes B')$ and the i-th element of the vector $\widetilde{X}_2'(I_T\otimes B')$, respectively, and adapt the convention that any sum with an upper index of less than one is zero. To deal with the quadratic form in (17) and (19), we follow Kelejian and Prucha (2001) to introduce a martingale difference array. Define the $\sigma $-fields: ${\mathcal {F}}_{0}=\{ {\emptyset }, \Omega \}, {\mathcal {F}}_{i}=\sigma (e_1, e_2, \ldots , e_i), 1\le i\le nT$. Let

$$\begin{aligned} M_{ik}=h_{ii,k}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,k}e_j, k=1,2. \end{aligned}$$

(20)

Then $ {\mathcal {F}}_{i-1} \subseteq {\mathcal {F}}_{i}, M_{ik}$ is ${\mathcal {F}}_{i}$-measurable and $E(M_{ik}|{\mathcal {F}}_{i-1})=0$. Thus $\{M_{ik}, {\mathcal {F}}_{i}, 1\le i\le nT\}$ form a martingale difference array and

$$\begin{aligned} e'H_k e-\sigma ^{2}_\nu tr(H_k)=\sum ^{nT}_{i=1}M_{ik}, k=1,2. \end{aligned}$$

(21)

Based on (16)–(21), we propose the following EL ratio statistic for $\psi \in R^{p+q+3}$:

$$\begin{aligned} L(\psi )=\sup _{p_i, 1\le i\le nT}\prod ^{nT}_{i=1}((nT)p_i), \end{aligned}$$

where $\{p_i\}$ satisfy

$$\begin{aligned}&p_i\ge 0, 1\le i\le nT, \sum ^{nT}_{i=1}p_i=1, \\&\sum ^{nT}_{i=1}p_i a_{i,1}e_i=0, \\&\sum ^{nT}_{i=1}p_i \left\{ a_{i,2}e_i+ h_{ii,1}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,1}e_j \right\} =0, \\&\sum ^{nT}_{i=1}p_i(e^2_i-\sigma ^2_\nu )=0,\\&\sum ^{nT}_{i=1}p_i \left\{ h_{ii,2}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,2}e_j \right\} =0. \end{aligned}$$

Let

$$\begin{aligned} \omega _{i}(\psi )=\left( \begin{array}{l} a_{i,1}e_i\\ e^2_i-\sigma ^2_\nu \\ a_{i,2}e_i+h_{ii,1}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,1}e_j\\ h_{ii,2}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,2}e_j\\ \end{array} \right) _{(p+q+3)\times 1}, \end{aligned}$$

where $e_i$ is the ith component of $(I_T\otimes B)(Y-\rho Y_{-1}-X\beta -Z\gamma )$. Following Owen (1990), one can show that

$$\begin{aligned} \ell (\psi )\hat{=}-2\log L(\psi )=2\sum ^{nT}_{i=1}\log \{1+\tilde{\lambda }'(\psi )\omega _{i}(\psi )\}, \end{aligned}$$

(22)

where $\tilde{\lambda }(\psi )\in R^{p+q+3}$ is the solution of the following equation:

$$\begin{aligned} {1\over nT}\sum ^{nT}_{i=1}{\omega _{i}(\psi )\over 1+\tilde{\lambda }'(\psi )\omega _{i}(\psi )}=0. \end{aligned}$$

(23)

Let $\vartheta _j=E\nu _{11}^j, j=3, 4$. Use Vec(diagA) to denote the vector formed by the diagonal elements of a matrix A, ||a|| to denote the $L_2$-norm of a vector a, and $\lambda _{min}(H)$ and $\lambda _{max}(H)$ to denote the minimum and maximum eigenvalues of a matrix H, respectively. To obtain the asymptotic distribution of $\ell (\psi )$, we need following assumptions.

A1. (1) $\nu _{jt}$ are mutually independent, and they are independent of $x_{ks}$ and $z_k$ for all j, k, t, s;
(2) All elements in $(x_{it}, z_i)$ have $4+\eta _1$ moments for some $\eta _1>0$.
A2. (1) $\{\nu _{it}, t=1,\ldots , T, i=1,\ldots , n \}$ are independent and identically distributed for all i and t with mean 0, variance $\sigma ^2_\nu >0$ and $E|\nu _{it}|^{4+\eta _1}<\infty $ for some $\eta _1>0$.
(2) $\{x_{it}, t=\ldots , -1, 0, 1, \ldots \}$ and $\{z_i\}$ are strictly exogenous and independent across i.
(3) $|\rho |<1$.
A3. Let $W_n$ and $\{B^{-1} \}$ be as described above. They satisfy the following conditions:
(1) The row and column sums of $W_n$ are uniformly bounded in absolute value;
(2) $\{B^{-1}\}$ are uniformly bounded in either row or column sums, uniformly in $\lambda $ in a compact parameter space $\Lambda $, and $\underline{c}_\lambda \le \inf _{\lambda \in \Lambda }\lambda _{\max }(B' B)\le \sup _{\lambda \in \Lambda }\lambda _{\max }(B' B)\le \overline{c}_\lambda <\infty $.
A4. There are constants $c_j>0, j=1, 2$, such that
$$\begin{aligned} 0<c_1\le \lambda _{min}\left( (nT)^{-1}\Sigma _{p+q+3} \right) \le \lambda _{max}\left( (nT)^{-1}\Sigma _{p+q+3} \right) \le c_2<\infty , \end{aligned}$$
where
$$\begin{aligned}&\Sigma _{p+q+3}=\Sigma '_{p+q+3} =Cov\left\{ \sum ^{nT}_{i=1}\omega _{i}(\psi ) \right\} \nonumber \\&\quad =\left( \begin{array}{llll} \Sigma _{11}&{} \Sigma _{12} &{} \Sigma _{13} &{} \Sigma _{14}\\ *&{} \Sigma _{22} &{} \Sigma _{23}&{} \Sigma _{24}\\ *&{} *&{} \Sigma _{33} &{}\Sigma _{34}\\ *&{} *&{} *&{} \ \Sigma _{44}\\ \end{array} \right) _{(p+q+3)\times (p+q+3)}, \end{aligned}$$
(24)
where
$$\begin{aligned} \Sigma _{11}= & {} \sigma ^2_\nu E\left( \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_1\right) , \Sigma _{12}=\vartheta _3 E(\widetilde{X}_1')(I_T\otimes B')\mathbf{1} _{nT}, \\ \Sigma _{13}= & {} \sigma ^2_\nu E\left( \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_2\right) , \Sigma _{14}=\vartheta _3E(\widetilde{X}_1')(I_T\otimes B')vec_D(H_2) \\ \Sigma _{22}= & {} nT(\vartheta _4-\sigma _\nu ^4),\ \ \Sigma _{23}=(\vartheta _4-3\sigma _\nu ^4)\mathbf{1} '_{nT}vec_D(H_1)+2\sigma _\nu ^4tr(H_1)+\vartheta _3E(\widetilde{X}_2')(I_T\otimes B')\mathbf{1} _{nT}, \\ \Sigma _{24}= & {} (\vartheta _4-3\sigma _\nu ^4)\mathbf{1} '_{nT}vec_D(H_2)+2\sigma _\nu ^4tr(H_2),\\ \Sigma _{33}= & {} (\vartheta _4-3\sigma _\nu ^4)||vec_D(H_1)||^2+2\sigma _\nu ^4tr(H_1^2)+\sigma ^2_\nu E\left( \widetilde{X}_2'\Omega ^{-1}\widetilde{X}_2\right) +2\vartheta _3 E(\widetilde{X}_2')(I_T\otimes B')vec_D(H_1), \\ \Sigma _{34}= & {} (\vartheta _4-3\sigma _\nu ^4)vec'_D(H_1)vec_D(H_2)+2\sigma _\nu ^4tr(H_1H_2)+\vartheta _3 E(\widetilde{X}_2')(I_T\otimes B')vec_D(H_2), \\ \Sigma _{44}= & {} (\vartheta _4-3\sigma _\nu ^4)||vec_D(H_2)||^2+2\sigma _\nu ^4tr(H_2^2). \end{aligned}$$
A5. $n\rightarrow \infty $ but T is fixed.

Remark 1

Conditions A1–A3 are common assumptions for spatial models, which are used in Su and Yang (2015), and the analog of $0<c_1\le \lambda _{min}\left( (nT)^{-1}\Sigma _{p+q+3} \right) $ is employed in the assumption of Theorem 1 in Kelejian and Prucha (2001).

We now state the main results.

Theorem 1

Suppose that Assumptions A1–A5 are satisfied. Then under model (1)–(2), as $n\rightarrow \infty,$

$$\begin{aligned} \ell ({\psi } ){\mathop {\longrightarrow }\limits ^{d}}\chi ^2_{p+q+3}, \end{aligned}$$

where $\chi ^2_{p+q+3}$ is a chi-squared distributed random variable with $p+q+3$ degrees of freedom.

Let $z_{\alpha }(p+q+3)$ satisfy $P(\chi ^2_{p+q+3}\le z_{\alpha }(p+q+3))=\alpha $ for $0<\alpha <1$. It follows from Theorem 1 that an EL based confidence region for $\psi $ with asymptotically correct coverage probability $\alpha $ can be constructed as

$$\begin{aligned} \{ \psi : \ell (\psi )\le z_{\alpha }(p+q+3) \}. \end{aligned}$$

3 Simulations

Recall that $\theta =(\beta ', \gamma ', \rho )'$ and $\psi =(\theta ', \sigma ^2_\nu , \lambda )'$. Denote $P_\lambda =\Omega ^{-1}\Omega _\lambda \Omega ^{-1}$, $\Omega _\lambda =I_T\otimes A$ and $\Omega _{\lambda \lambda }=I_T\otimes \{2(B'B)^{-1}[(W'_nB+B' W_n)A-W_n'W_n(B'B)^{-1}]\}$. It can be shown that

$$\begin{aligned}&\frac{\partial ^2\widetilde{L}(\psi )}{\partial \theta \partial \theta '}=-\frac{1}{\sigma ^2_\nu }\widetilde{X}'\Omega ^{-1}\widetilde{X},\ \\&\frac{\partial ^2\widetilde{L}(\psi )}{\partial \theta \partial \sigma _\nu ^2}=-\frac{1}{\sigma ^4_\nu }\widetilde{X}'\Omega ^{-1}\epsilon , \\&\frac{\partial ^2\widetilde{L}(\psi )}{\partial \theta \partial \lambda }=-\frac{1}{\sigma ^2_\nu }\widetilde{X}'P_\lambda \epsilon ,\ \\&\frac{\partial ^2\widetilde{L}(\psi )}{\partial \sigma _\nu ^2\partial \sigma _\nu ^2}=-\frac{1}{\sigma ^6_\nu }\epsilon '\Omega ^{-1}\epsilon +\frac{nT}{2\sigma _\nu ^4}, \\&\frac{\partial ^2\widetilde{L}(\psi )}{\partial \sigma _\nu ^2\partial \lambda }=-\frac{1}{2\sigma ^4_\nu }\epsilon 'P_\lambda \epsilon , \\&\frac{\partial ^2\widetilde{L}(\psi )}{\partial \lambda ^2}=\frac{1}{2}tr(P_\lambda \Omega _\lambda -\Omega ^{-1}\Omega _{\lambda \lambda })\\&- \frac{1}{2\sigma _\nu ^2}\epsilon '(2P_\lambda \Omega _\lambda -\Omega ^{-1}\Omega _{\lambda \lambda })\Omega ^{-1}\epsilon . \end{aligned}$$

According to Su and Yang (2015), the QMLE $\widehat{\psi }$ of $\psi $ satisfies:

$$\begin{aligned} \sqrt{nT}(\widehat{\psi }-\psi ){\mathop {\longrightarrow }\limits ^{d}}N(0,-\Sigma ^{-1}), \end{aligned}$$

where $\Sigma =\lim _{n\rightarrow \infty }\frac{1}{nT}E[\Sigma _n(\psi )]$ and $\Sigma _n(\psi )=\frac{\partial ^2}{\partial \psi \partial \psi '}\widetilde{L}(\psi )$.

Based on the above asymptotic result, we can obtain the NA based confidence region for $\psi $. However, we note that the NA method depends on the availability of a consistent estimator of the asymptotic covariance matrix in practical applications, while the EL method does not. This can save the implementation time for the EL method and the EL method outperforms the NA method.

We conducted a small simulation study to compare the finite sample performances of the confidence regions based on EL and NA methods with confidence level $\alpha =0.95$, and report the proportion of $\ell (\psi ) \le z_{0.95}(p+q+3)$ and $(\widehat{\psi }-\psi )'(-\Sigma )(\widehat{\psi }-\psi )\le z_{0.95}(p+q+3)$ respectively in 1000 replications.

In the simulations, we used the following two models:

(1)
Model 1:

$y_t=\rho y_{t-1}+x_t\beta +z\gamma +\epsilon _t, \epsilon _t=\lambda W_n\epsilon _t+\nu _t,\ t=1, 2,3$, where $x_t$ were generated from N(0, 4), alternatively, $x_t$ can be randomly generated in a similar fashion as in Hsiao et al. (2002), and the elements of z were randomly generated from Bernoulli(0.5). We selected $\beta =1$, $\gamma =1$, $\sigma _\nu ^2=1$ and $(\rho , \lambda )$ were taken as $(-0.8, -0.7)$, $(-0.2, -0.1)$, (0.2, 0.1), (0.8, 0.7), $(-0.8, 0.7)$ and $(0.2, -0.1)$ respectively, and $\nu _{it}'s$ were i.i.d. from N(0, 1), t(5) and $\chi ^2_{4}-4$, respectively;
(2)
Model 2:

$y_t=\rho y_{t-1}+x_t\beta +z\gamma +\epsilon _t, \epsilon _t=\lambda W_n\epsilon _t+\nu _t,\ t=1, 2,3$, where $x_t=\left( x_t^{(1)}, x_t^{(2)}\right) $ is an $n\times 2$ matrix, where $x_t^{(1)}$ were randomly generated from N(0, 1) and $x_t^{(2)}$ were randomly generated from N(0, 4). Moreover, $z=\left( z^{(1)}, z^{(2)}\right) $ is an $n\times 2$ matrix, the elements of $z^{(1)}$ were randomly generated from Bernoulli(0.3) and the elements of $z^{(2)}$ were randomly generated from Bernoulli(0.6). We selected $\beta =(1.5, 1.0)'$, $\gamma =(2, 1.2)'$, $(\rho , \lambda )$ were taken as $(-0.8, -0.7)$, $(-0.2, -0.1)$, (0.2, 0.1), (0.8, 0.7), $(-0.8, 0.7)$ and $(0.2, -0.1)$ respectively, and $\nu _{it}'s$ were i.i.d. from N(0, 1), $t(5), \chi ^2_{4}-4, 0.1N(0, 4)+0.9N(0, 1)$ and $0.1t(3)+0.9t(5)$, respectively.

The results of simulations under model 1 are reported in Tables 1, 2 and 3, and the results of simulations under model 2 are reported in Tables 4, 5, 6, 7 and 8.

Table 1 Coverage probabilities of the NA and EL confidence regions with $\nu _{it}\sim N(0,1)$ under model 1

Full size table

Table 2 Coverage probabilities of the NA and EL confidence regions with $\nu _{it}\sim t(5)$ under model 1

Full size table

Table 3 Coverage probabilities of the NA and EL confidence regions with $\nu _{it}+4\sim \chi ^2_4$ under model 1

Full size table

Table 4 Coverage probabilities of the NA and EL confidence regions with $\nu _{it}\sim N(0,1)$ under model 2

Full size table

Table 5 Coverage probabilities of the NA and EL confidence regions with $\nu _{it}\sim t(5)$ under model 2

Full size table

Table 6 Coverage probabilities of the NA and EL confidence regions with $\nu _{it}+4\sim \chi ^2_4$ under model 2

Full size table

Table 7 Coverage probabilities of the NA and EL confidence regions with $\nu _{it}\sim 0.1N(0,4)+0.9N(0,1)$ under model 2

Full size table

Table 8 Coverage probabilities of the NA and EL confidence regions with $\nu _{it}\sim 0.1t(3)+0.9t(5)$ under model 2

Full size table

For the contiguity weight matrix $W_n=(W_{ij})$, we took $W_{ij}=1$ if spatial units i and j are neighbours by queen contiguity rule (namely, they share common border or vertex), $W_{ij}=0$ otherwise (Anselin 1988, P.18). We considered five ideal cases of spatial units: $n=m\times m$ regular grid with $m=7, 10, 13,16, 20$, denoting $W_n$ as $grid_{49}, grid_{100}, grid_{169}, grid_{256} $ and $ grid_{400}$, respectively. A transformation is often used in applications to convert the matrix $W_n$ to the unity of row-sums. We used the standardized version of $W_n$ in our simulations, namely $W_{ij}$ was replaced by $W_{ij}/\sum _{j=1}^nW_{ij}$.

Simulation results under model 1 show that the confidence regions based on NA behave well with coverage probabilities being very close to the nominal level 0.95 when the error term $\epsilon _i$ is normally distributed and n is large, but not well in other cases. The coverage probabilities of the confidence regions based on NA fall to the range [0.812, 0.862] for the t distribution and [0.809, 0.868] for the $\chi ^2$ distribution, which are far from the nominal level 0.95. Simulation results under model 2 are similar to those under model 1.

We can see, from Tables 1, 2, 3, 4, 5, 6, 7 and 8, that the coverage probabilities of confidence regions based on EL method converge to the nominal level 0.95 as the number of spatial units n is large enough, whether the error term $\epsilon _i$ is normally distributed or not. These results show that the EL based confidence regions generally outperform the NA based confidence regions when n is large enough.

4 A real data example

In order to illustrate the proposed method in Sect. 2, we conducted a real data analysis. The data come from 288 prefecture-level cities in China, collected from National Bureau of Statistics of China and Anjuke. There were three variables: the logarithm of housing price per square meter ($y_t$), the logarithm of income per household ($x_t$) and the urbanization rate (z) from the years of 2010 to 2017. In order to ensure the stability and eliminate the influence of dimension, we first did difference and standardization on the above data, and then considered fitting the data via the following model: $y_t=\rho y_{t-1}+x_t\beta +z\gamma +\epsilon _t, \epsilon _t=\lambda W_n\epsilon _t+\nu _t,\ t=1, 2, \ldots , 8$, where $n=288$ and the spatial weighting matrix $W_n$ was selected by the method in Sect. 3.

We separately employed the EL method in Sect. 2 and the NA method in Sect. 3 to obtain the confidence intervals for parameters $\beta , \gamma , \rho , \lambda $ and $\sigma ^2_\nu $ with confidence level 0.95, which were shown in Table 9.

Table 9 Analysis results for the average price of commercial housing data (with ALs shown in brackets)

Full size table

Table 9 shows that the estimator of the spatial parameter is $\lambda = 0.3743$, and 0 is not in its confidence interval, which implies that there exists a spatial relationship among the disturbances. The results also show that the lengths of the EL based intervals are uniformly shorter than those of the NA based intervals, which implies that the EL based method performs better than the NA based method for the real data.

References

Anselin, L. (1988). Spatial Econometrics: Methods and Models. Kluwer Academic Press.
Book Google Scholar
Anselin, L. (2001). Spatial econometrics. In B. H. Baltagi (Ed.), A Companion to Theoretical Econometrics (pp. 310–330). Blackwell Publishers Ltd.
Google Scholar
Baltagi, B., Egger, P., & Pfaffermayr, M. (2013). A generalized spatial panel model with random effects. Econometric Reviews, 32, 650–685.
Article MathSciNet Google Scholar
Baltagi, B., & Li, D. (2006). Prediction in the panel data model with spatial correlation: The case of liquor. Spatial Economic Analysis, 1, 175–185.
Article Google Scholar
Baltagi, B., Song, S. K., Jung, B. C., & Koh, W. (2007). Testing for serial correlation, spatial autocorrelation and random effects using panel data. Journal of Econometrics, 140, 5–51.
Article MathSciNet Google Scholar
Baltagi, B., Song, S. H., & Koh, W. (2003). Testing panel data regression models with spatial error correlation. Journal of Econometrics, 117, 123–150.
Article MathSciNet Google Scholar
Bandyopadhyay, S., Lahiri, S. N., & Nordman, D. J. (2015). A frequency domain empirical likelihood method for irregularly spaced spatial data. The Annals of Statistics, 43(2), 519–545.
Article MathSciNet Google Scholar
Chen, X., & Conley, T. G. (2001). A new semiparametric spatial model for panel time series. Journal of Econometrics, 105, 59–83.
Article MathSciNet Google Scholar
Chen, S. X., & Keilegom, I. V. (2009). A review on empirical likelihood for regressions (with discussions). Test, 3, 415–447.
Article Google Scholar
Chen, J., & Qin, J. (1993). Empirical likelihood estimation for finite populations and the effective usage of auxiliary information. Biometrika, 80, 107–116.
Article MathSciNet Google Scholar
Elhorst, J. P. (2003). Specification and estimation of spatial panel data models. International Regional Science Review, 26, 244–268.
Article Google Scholar
Elhorst, J. P. (2005). Unconditional maximum likelihood estimation of linear and loglinear dynamic models for spatial panels. Geographical Analysis, 37, 85–106.
Article Google Scholar
Elhorst, J. P. (2010). Dynamic panels with endogenous interaction effects when T is small. Regional Science and Urban Economics, 40, 272–282.
Article Google Scholar
Hall, P. (1992). The Bootstrap and Edgeworth Expansion. Springer-Verlag.
Book Google Scholar
Hall, P., & La Scala, B. (1990). Methodology and algorithms of empirical likelihood. International Statistical Review, 58, 109–127.
Article Google Scholar
Hsiao, C., Pesaran, M. H., & Tahmiscioglu, A. K. (2002). Maximum likelihood estimation of fixed effects dynamic panel data models covering short time periods. Journal of Econometrics, 109, 107–150.
Article MathSciNet Google Scholar
Jin, F., & Lee, L. F. (2019). GEL estimation and tests of spatial autoregressive models. Journal of Econometrics, 208, 585–612.
Article MathSciNet Google Scholar
Kapoor, M., Kelejian, H. H., & Prucha, I. R. (2007). Panel data models with spatially correlated error components. Journal of Econometrics, 140, 97–130.
Article MathSciNet Google Scholar
Kelejian, H. H., & Prucha, I. R. (2001). On the asymptotic distribution of the Moran $I$ test statistic with applications. Journal of Econometrics, 104, 219–257.
Article MathSciNet Google Scholar
Kitamura, Y. (1997). Empirical likelihood methods with weakly dependent processes. The Annals of Statistics, 25, 2084–2102.
Article MathSciNet Google Scholar
Lee, L. F., & Yu, J. (2010a). A spatial dynamic panel data model with both time and individual fixed effects. Econometric Theory, 26, 564–597.
Article MathSciNet Google Scholar
Lee, L. F., & Yu, J. (2010b). Estimation of spatial autoregressive panel data models with fixed effects. Journal of Econometrics, 154(2), 165–185.
Article MathSciNet Google Scholar
Lee, L. F., & Yu, J. (2010c). Some recent developments in spatial panel data models. Regional Science and Urban Economics, 40, 255–271.
Article Google Scholar
Lee, L. F., Yu, J., (2015a). Spatial Panel Data Models. In: Baltagi B (ed) The Oxford Handbooks: Panel Data. Oxford University Press, Oxford, England.
Lee, L. F., & Yu, J. (2015b). Estimation of fixed effects panel regression models with separable and nonseparable space-time filters. Journal of Econometrics, 184, 174–192.
Article MathSciNet Google Scholar
Li, Y., & Qin, Y. (2020). Empirical likelihood for panel data models with spatial errors. Communications in Statistics-Theory and Methods. https://doi.org/10.1080/03610926.2020.1780449.
Article Google Scholar
Mutl, J. (2006). Dynamic Panel Data Models with Spatially Correlated Disturbances. College Park: University of Maryland. (Ph.D. thesis).
Mutl, J., & Pfaffermayr, M. (2011). The Hausman test in a Cliff and Ord panel model. The Economic Journal, 14, 48–76.
MathSciNet MATH Google Scholar
Nordman, D. J. (2008a). A blockwise empirical likelihood for spatial lattice data, Statist. Sinica, 18, 1111–1129.
MathSciNet MATH Google Scholar
Nordman, D. J. (2008b). An empirical likelihood method for spatial regression. Metrika, 68, 351–363.
Article MathSciNet Google Scholar
Owen, A. B. (1988). Empirical likelihood ratio confidence intervals for a single functional. Biometrika, 75, 237–249.
Article MathSciNet Google Scholar
Owen, A. B. (1990). Empirical likelihood ratio confidence regions. The Annals of Statistics, 18, 90–120.
Article MathSciNet Google Scholar
Owen, A. B. (2001). Empirical Likelihood. Chapman & Hall.
MATH Google Scholar
Parent, O., & LeSage, J. P. (2011). A spaceCtime filter for panel data models containing random effects. Computational Statistics & Data Analysis, 55, 475–490.
Article MathSciNet Google Scholar
Pesaran, M. H. (2004). General Diagnostic Tests for Cross Section Dependence in Panels, Working Paper No. 1229, University of Cambridge.
Qin, Y. (2021). Empirical likelihood for spatial autoregressive models with spatial autoregressive disturbances. Sankhyā A: The Indian Journal of Statistics, 83, 1–25.
Article MathSciNet Google Scholar
Qin, J., & Lawless, J. (1994). Empirical likelihood and general estimating equations. The Annals of Statistics, 22, 300–325.
Article MathSciNet Google Scholar
Su, L., & Yang, Z. (2007). QML Estimation of Dynamic Panel Data Models with Spatial Errors, Working Paper. Singapore Management University.
Su, L., & Yang, Z. (2015). QML estimation of dynamic panel data models with spatial errors. Journal of Econometrics, 185, 230–258.
Article MathSciNet Google Scholar
Wu, C. B. (2004). Weighted empirical likelihood inference. Statistics & Probability Letters, 66, 67–79.
Article MathSciNet Google Scholar
Yang, Z., Li, C., & Tse, Y. K. (2006). Functional form and spatial dependence in dynamic panels. Economics Letters, 91, 138–145.
Article MathSciNet Google Scholar
Yu, J., de Jong, R., & Lee, L. F. (2008). Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both n and T are large. Journal of Econometrics, 146, 118–134.
Article MathSciNet Google Scholar
Yu, J., & Lee, L. F. (2010). Estimation of unit root spatial dynamic panel data models. Econometric Theory, 26, 1332–1362.
Article MathSciNet Google Scholar
Zhong, B., & Rao, J. N. K. (2000). Empirical likelihood inference under stratified random sampling using auxiliary population information. Biometrika, 87, 929–938.
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (12061017, 12161009). The authors are thankful to the referees for constructive suggestions.

Author information

Authors and Affiliations

College of Mathematics and Statistics, Guangxi Normal University, Guilin, 541004, Guangxi, China
Yinghua Li & Yongsong Qin

Authors

Yinghua Li
View author publications
You can also search for this author in PubMed Google Scholar
Yongsong Qin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yongsong Qin.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

In the proof of the main results, we need to use Theorem 1 in Kelejian and Prucha (2001). We now state this result. Let

$$\begin{aligned} \widetilde{Q}_n=\sum ^n_{i=1}\sum ^n_{j=1}a_{nij}\epsilon _{ni}\epsilon _{nj}+\sum ^n_{i=1}b_{ni}\epsilon _{ni}, \end{aligned}$$

where $\epsilon _{ni}$ are real valued random variables, and the $a_{nij}$ and $b_{ni}$ denote the real valued coefficients of the linear-quadratic form. We need the following assumptions in Lemma 1.

(C1)
$\{\epsilon _{ni}, 1\le i\le n\}$ are independent random variables with mean 0 and $\sup _{1\le i\le n, n\ge 1}E|\epsilon _{ni}|^{4+\eta _1}<\infty $ for some $\eta _1>0$;
(C2)
For all $1\le i, j\le n, n\ge 1, a_{nij}=a_{nji}$, $\sup _{1\le j\le n, n\ge 1} \sum ^n_{i=1}|a_{nij}|<\infty $, and $\sup _{n\ge 1}n^{-1}\sum ^n_{i=1}|b_{ni}|^{2+\eta _2}<\infty $ for some $\eta _2>0$.

Given above assumptions (C1) and (C2), the mean and variance of $\tilde{Q}_n$ are given as (e.g. Kelejian & Prucha, 2001)

$$\begin{aligned} \nu _{\widetilde{Q}}=\sum ^n_{i=1}a_{nii}\sigma ^2_{ni}, \end{aligned}$$

$$\begin{aligned} \sigma ^2_{\widetilde{Q}}= & {} 2\sum ^n_{i=1}\sum ^{n}_{j=1}a^2_{nij}\sigma ^2_{ni}\sigma ^2_{nj}+\sum ^n_{i=1}b^2_{ni}\sigma ^2_{ni}\nonumber \\&+\sum ^n_{i=1}\{ a^2_{nii}(\mu ^{(4)}_{ni}-3\sigma ^4_{ni})+2b_{ni}a_{nii}\mu ^{(3)}_{ni} \}, \end{aligned}$$

(25)

with $\sigma ^2_{ni}=E(\epsilon _{ni}^2)$ and $\mu ^{(s)}_{ni}=E(\epsilon _{ni}^s)$ for $s=3, 4$.

Lemma 1

Suppose that Assumptions C1 and C2 hold true and $n^{-1}\sigma ^2_{\widetilde{Q}}\ge c$ for some constant $c>0$ . Then

$$\begin{aligned} {\widetilde{Q}_n-\nu _{\widetilde{Q}}\over \sigma _{\widetilde{Q}} } {\mathop {\longrightarrow }\limits ^{d}}N(0, 1). \end{aligned}$$

Proof

See Theorem 1 in Kelejian and Prucha (2001). $\square $

Lemma 2

Let $\xi _1, \xi _2,\ldots , \xi _n$ be a sequence of stationary random variables, with $E|\xi _1|^s<\infty $ for some constants $s>0$ . Then

$$\begin{aligned} \max _{1\le i \le n }|\xi _i|=o(n^{1/s}), \ \ a.s. \end{aligned}$$

Proof

Using Borel–Cantelli lemma and following the proof of (2.3) in Owen (1990), one can prove Lemma 2, where there is no need to assume that $\xi _1, \xi _2,\ldots , \xi _n$ are in dependent in using Borel–Cantelli lemma. $\square $

Lemma 3

Suppose that Assumptions A1–A5 are satisfied. Then as $n\rightarrow \infty,$

$$\begin{aligned}&Z_n=\max _{1\le i \le nT}||\omega _{i}(\psi )||=o_p((nT)^{2/(4+\eta _1)})\ \ a.s., \end{aligned}$$

(26)

$$\begin{aligned}&\Sigma _{p+q+3}^{-1/2}{\sum _{i=1}^{nT} \omega _{i}(\psi )}{\mathop {\longrightarrow }\limits ^{d}}N(0, I_{p+q+3}), \end{aligned}$$

(27)

$$\begin{aligned}&(nT)^{-1}\sum _{i=1}^{nT} \omega _{i}(\psi )\omega _{i}'(\psi )=(nT)^{-1}\Sigma _{p+q+3}+o_p(1), \end{aligned}$$

(28)

$$\begin{aligned}&\sum _{i=1}^{nT} ||\omega _{i}(\psi )||^3=O_p(nT). \end{aligned}$$

(29)

Proof

Note that

$$\begin{aligned} Z_n\le & {} \max _{1\le i \le nT}\bigg \{\max _{1\le i \le nT}||a_{i,1}e_i||, \max _{1\le i \le nT}|e^2_i-\sigma ^2_\nu |, \max _{1\le i \le nT} \bigg | a_{i,2}e_i+h_{ii,1}(e^2_i-\sigma ^2_\nu )\\&+2e_i\sum ^{i-1}_{j=1}h_{ij,1}e_j\bigg |, \max _{1\le i \le nT} \bigg | h_{ii,2}(e^2_i-\sigma ^2_\nu )+2e_i\sum ^{i-1}_{j=1}h_{ij,2}e_j\bigg | \bigg \}. \end{aligned}$$

By Conditions A1–A3 and Lemma 2, we have

$$\begin{aligned} \max _{1\le i \le nT}||a_{i,k}e_i||=o_p((nT)^{1/(4+\eta _1)}),\ \ \max _{1\le i \le nT}|e^2_i-\sigma ^2_\nu |=o_p((nT)^{2/(4+\eta _1)}). \end{aligned}$$

In addition, by Lemma B.2. in Su and Yang (2015), $A'_\nu (I_T\otimes B')$ and $(I_T\otimes (BAB'))$ are uniformly bounded in both row and column sums, it follows that

$$\begin{aligned}&\max _{1\le i \le nT}|h_{ii,k}(e^2_i-\sigma ^2_\nu )|=\max _{1\le i \le nT}|h_{ii,k}|o_p((nT)^{2/(4+\eta _1)})=o_p((nT)^{2/(4+\eta _1)}). \\&\max _{1\le i \le nT}\left| e_i\sum ^{i-1}_{j=1}h_{ij,k}e_j \right| \le (\max _{1\le i \le nT}|e_i|)^2\cdot \max _{1\le i \le nT}\left( \sum ^{i-1}_{j=1}|h_{ij,k}| \right) =o_p((nT)^{2/(4+\eta _1)}), k=1, 2.\ \ \end{aligned}$$

Thus $Z_n=o_p((nT)^{2/(4+\eta _1)})$. (26) is proved. $\square $

We now prove (27). For any given ${l}=(l'_1,l_2, l_3, l_4)'\in R^{p+q+3}$ with $||{l}||=1$, where $l_1\in R^{p+q}$, $l_2, l_3, l_4\in R$, it is clear that

$$\begin{aligned} l'\omega _{i}(\psi )= & {} l_1' a_{i,1}e_i+l_2(e^2_i-\sigma ^2_\nu )+l_3\{a_{i,2}e_i+h_{ii,1}(e^2_i-\sigma ^2_\nu )+2e_i \sum ^{i-1}_{j=1}h_{ij,1}e_j\}\\&+l_4\{h_{ii,2}(e^2_i-\sigma ^2_\nu )+2e_i \sum ^{i-1}_{j=1}h_{ij,2}e_j\}\\= & {} (l_2+l_3h_{ii,1}+l_4h_{ii,2})(e^2_i-\sigma ^2_\nu )+2e_i\sum ^{i-1}_{j=1}(l_3h_{ij,1}+l_4h_{ij,2})e_j\\&+(l_1' a_{i,1}+l_3a_{i,2})e_i. \end{aligned}$$

Denote

$$\begin{aligned} Q_n=\sum _{i=1}^{nT} l'\omega _{i}(\psi )= \sum ^{nT}_{i=1}\sum ^{{nT}}_{j=1}u_{ij}e_ie_j+\sum ^{nT}_{i=1}b_{i}e_i=e'U_{1n} e+U_{2n}e , \end{aligned}$$

where

$$\begin{aligned} U_{1n}= & {} (u_{ij})_{(nT)\times (nT)},\ \ U_{2n}=(b_{i})_{1\times (nT)},\\ u_{ii}= & {} l_2+l_3h_{ii,1}+l_4h_{ii,2}, \ u_{ij}=l_3h_{ij,1}+l_4h_{ij,2} (i\ne j), \ b_i=l_1' a_{i,1}+l_3a_{i,2}. \end{aligned}$$

Note that

$$\begin{aligned} U_{1n}=l_2I_{nT}+l_3H_1+l_4H_2,\ \ U_{2n}=l_1'\widetilde{X}_1'(I_T\otimes B')+l_3\widetilde{X}_2'(I_T\otimes B'). \end{aligned}$$

The conditional expectation and variance given X, Z are denoted as $E^*$ and $Var^*$, respectively. Then from (15) and note that $E(\nu )=0$, we know that the variance of $Q_n$ is

$$\begin{aligned} \sigma ^2_{Q_n}= & {} Var\left( \sum _{i=1}^{nT} l'\omega _{i}(\psi )\right) \nonumber \\= & {} Var(\nu ' U_{1n}\nu )+Var(U_{2n}\nu ) +2Cov(\nu ' U_{1n}\nu , U_{2n}\nu ), \end{aligned}$$

(30)

and

$$\begin{aligned} Var^*(Q_n)= & {} (\vartheta _4-3\sigma ^4_\nu )||vec_D( U_{1n})||^2+\sigma ^4_\nu [tr( U_{1n}U'_{1n})+tr(U_{1n}^2)],\nonumber \\&+\sigma ^2_\nu U_{2n}U'_{2n} +2\vartheta _3U_{2n}vec_D( U_{1n}). \end{aligned}$$

(31)

Further,

$$\begin{aligned}&||vec_D(U_{1n})||^2=||vec_D(l_2I_{nT}+l_3H_1+l_4H_2)||^2\nonumber \\&\quad = ||l_2vec_D(I_{nT})+l_3vec_D(H_1)+l_4vec_D(H_2)||^2\nonumber \\&\quad = l_2^2nT+l_3^2||vec_D(H_1)||^2+l_4^2||vec_D(H_2)||^2+2l_2l_3\mathbf{1} '_{nT}vec_D(H_1)\nonumber \\&\qquad +2l_2l_4\mathbf{1} '_{nT}vec_D(H_2)+2l_3l_4vec'_D(H_1)vec_D(H_2)\nonumber \\&\quad =\tilde{l}' G_1 \tilde{l}, \end{aligned}$$

(32)

where $\tilde{l}=(l_2, l_3, l_4)'$, $G_{1}=\left( \begin{array}{lll}nT &{} \mathbf{1} '_{nT}vec_D(H_1) &{} \mathbf{1} '_{nT}vec_D(H_2) \\ *&{} ||vec_D(H_1)||^2 &{} vec'_D(H_1)vec_D(H_2) \\ *&{}*&{} ||vec_D(H_2)||^2 \end{array} \right) $. And

$$\begin{aligned} tr(U_{1n}U'_{1n})= & {} l_2^2nT+2l_2l_3tr(H_1)+2l_2l_4tr(H_2)+l_3^2tr(H_1^2)\nonumber \\&+2l_3l_4tr(H_1H_2) +l_4^2tr(H_2^2)\nonumber \\= & {} \widetilde{l}' G_2 \widetilde{l}, \end{aligned}$$

(33)

where $G_{2}=\left( \begin{array}{lll}nT &{} tr(H_1) &{} tr(H_2) \\ *&{} tr(H_1^2) &{} tr(H_1H_2) \\ *&{}*&{} tr(H_2^2) \end{array} \right) $. Moreover,

$$\begin{aligned} U_{2n}U'_{2n}= & {} l_1' \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_1l_1+l_3^2\widetilde{X}_2'\Omega ^{-1}\widetilde{X}_2+2l_1' l_3 \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_2. \end{aligned}$$

(34)

It is easy to show (e.g. Su & Yang, 2015) that, $n^{-1}\widetilde{X}_1'\Omega ^{-1}\widetilde{X}_1$, $n^{-1}\widetilde{X}_2'\Omega ^{-1}\widetilde{X}_2$, and $n^{-1}\widetilde{X}_1'\Omega ^{-1}\widetilde{X}_2$ converge in probability to their expectations. We have

$$\begin{aligned} U_{2n}U'_{2n}= & {} l_1' E\left( \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_1\right) l_1+l_3^2E\left( \widetilde{X}_2'\Omega ^{-1}\widetilde{X}_2\right) \nonumber \\&+2l_1' l_3 E\left( \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_2\right) +o_p(n), \end{aligned}$$

(35)

and

$$\begin{aligned} U_{2n}vec_D( U_{1n})= & {} l_1'l_2 \widetilde{X}_1'(I_T\otimes B')\mathbf{1} _{nT}+l_1'l_3 \widetilde{X}_1'(I_T\otimes B')vec_D(H_1)\nonumber \\&+l_1'l_4 \widetilde{X}_1'(I_T\otimes B')vec_D(H_2)+l_3l_2\widetilde{X}_2'(I_T\otimes B')\mathbf{1} _{nT}\nonumber \\&+l_3^2\widetilde{X}_2'(I_T\otimes B')vec_D(H_1) +l_3l_4 \widetilde{X}_2'(I_T\otimes B')vec_D(H_2)\nonumber \\= & {} \widetilde{l}' \left( \mathbf{1} _{nT}, vec_D(H_1),vec_D(H_2)\right) '(I_T\otimes B)E(\widetilde{X}_1)l_1\nonumber \\&+\widetilde{l}' l_3\left( \mathbf{1} _{nT}, vec_D(H_1),vec_D(H_2)\right) '(I_T\otimes B)E(\widetilde{X}_2)+o_p(n). \end{aligned}$$

(36)

Combine (31)–(36), we have

$$\begin{aligned} Var^*(Q_n)= & {} (\vartheta _4-3\sigma ^4_\nu )||vec_D( U_{1n})||^2+\sigma ^4_\nu [tr( U_{1n}U'_{1n})+tr(U_{1n}^2)]\nonumber \\&+\sigma ^2_\nu U_{2n}U'_{2n} +2\vartheta _3U_{2n}vec_D( U_{1n})\nonumber \\= & {} (\vartheta _4-3\sigma ^4_\nu )\tilde{l}' G_1 \tilde{l}+2\sigma ^4_\nu \widetilde{l}' G_2 \widetilde{l}+l_1' \sigma ^2_\nu E\left( \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_1\right) l_1\nonumber \\&+l_3^2\sigma ^2_\nu E\left( \widetilde{X}_2'\Omega ^{-1}\widetilde{X}_2\right) +2l_1' l_3\sigma ^2_\nu E\left( \widetilde{X}_1'\Omega ^{-1}\widetilde{X}_2\right) \nonumber \\&+2\widetilde{l}'\vartheta _3 \left( \mathbf{1} _{nT}, vec_D(H_1),vec_D(H_2)\right) '(I_T\otimes B)E(\widetilde{X}_1)l_1\nonumber \\&+2\widetilde{l}' l_3\vartheta _3\left( \mathbf{1} _{nT}, vec_D(H_1),vec_D(H_2)\right) '(I_T\otimes B)E(\widetilde{X}_2)+o_p(n)\nonumber \\= & {} l'\Sigma _{p+q+3}l+o_p(n), \end{aligned}$$

(37)

where $\Sigma _{p+q+3}$ is given in (24). From Condition A4, one can see that $(nT)^{-1}Var^*(Q_n)\ge c_1>0$. From Lemma 1, we have

$$\begin{aligned} {Q_n-E^*(Q_n)\over \sqrt{Var^*(Q_n)} }{\mathop {\longrightarrow }\limits ^{d^*}} N(0, 1), \end{aligned}$$

(38)

where $d^{*}$ stands for convergence in distribution given X, Z. Noting that $(nT)^{-1}Var^*(Q_n)\ge c_1>0$ and

$$\begin{aligned} Var^*(Q_n)=\sigma ^2_{Q_n}+o_p(n), \end{aligned}$$

one can show that

$$\begin{aligned} \frac{\sigma ^2_{Q_n}}{Var^*(Q_n)}{\mathop {\longrightarrow }\limits ^{P}} 1. \end{aligned}$$

(39)

Combing $E^*(Q_n)=0$, (38) and (39), we thus have

$$\begin{aligned} {Q_n\over \sigma _{Q_n}} {\mathop {\longrightarrow }\limits ^{d}} N(0, 1). \end{aligned}$$

Then (27) holds true.

Next we will prove (28), i. e.

$$\begin{aligned} (nT)^{-1}\sum _{i=1}^{nT} (l'\omega _{i}(\psi ))^2=(nT)^{-1}\sigma _{Q_n}^2+o_p(1). \end{aligned}$$

(40)

Let

$$\begin{aligned} N_{in}= & {} l'\omega _{i}(\psi )\nonumber \\= & {} u_{ii}(e^2_i-\sigma ^2_\nu )+2\sum ^{i-1}_{j=1}u_{ij}e_ie_j+b_{i}e_i\nonumber \\= & {} u_{ii}(e^2_i-\sigma ^2_\nu )+R_ie_i, \end{aligned}$$

(41)

where $R_i=2\sum ^{i-1}_{j=1}u_{ij}e_j+b_{i}$. Let ${\mathcal {F}}_{0}=\{ {\emptyset }, \Omega \}, {\mathcal {F}}_{i}=\sigma (e_1, e_2, \ldots , e_i), 1\le i\le nT$. Then $\{N_{in}, {\mathcal {F}}_{i}, 1\le i\le nT\}$ form a martingale difference array given X, Z. From (30) and (37), one can see that

$$\begin{aligned} \sigma _{Q_n}^2=\sum _{i=1}^{nT}E^*(N_{in}^2) + o_p(n). \end{aligned}$$

It follows that

$$\begin{aligned}&(nT)^{-1}\sum _{i=1}^{nT} \{ l'\omega _i(\psi ) \}^2-(nT)^{-1}\sigma _{Q_n}^2\nonumber \\&\quad = (nT)^{-1}\sum _{i=1}^{nT}\left( N_{in}^2-E^*(N_{in}^2)\right) + o_p(1)\nonumber \\&\quad = (nT)^{-1}\sum _{i=1}^{nT}\left\{ N_{in}^2-E^*(N_{in}^2|{\mathcal {F}}_{i-1})+E^*(N_{in}^2|{\mathcal {F}}_{i-1})-E^*(N_{in}^2) \right\} +o_p(1)\nonumber \\&\quad = (nT)^{-1}S_{n1}+(nT)^{-1}S_{n2}+o_p(1), \end{aligned}$$

(42)

where $S_{n1}=\sum _{i=1}^{nT}\{N_{in}^2-E^*(N_{in}^2|{\mathcal {F}}_{i-1})\}$, $S_{n2}=\sum _{i=1}^{nT}\{E^*(N_{in}^2|{\mathcal {F}}_{i-1})-E^*(N_{in}^2)\}$. Next we will show that: (1)

$$\begin{aligned} (nT)^{-1}S_{n1}=o_p(1), \end{aligned}$$

(43)

and (2)

$$\begin{aligned} (nT)^{-1}S_{n2}=o_p(1). \end{aligned}$$

(44)

To show (1) and (2), it is sufficient to show that $(nT)^{-2}E(S^2_{n1})\rightarrow 0$ and $(nT)^{-2}E(S^2_{n2})\rightarrow 0$, respectively. Obviously,

$$\begin{aligned} N_{in}^2=u_{ii}^2(e^2_i-\sigma ^2_\nu )^2+R_i^2e_i^2+2u_{ii}R_i(e^2_i-\sigma ^2_\nu )e_i. \end{aligned}$$

Thus

$$\begin{aligned} E^*(N_{in}^2|{\mathcal {F}}_{i-1})=u_{ii}^2E(e^2_i-\sigma ^2_\nu )^2+R_i^2\sigma ^2_\nu +2u_{ii}R_i\vartheta _3. \end{aligned}$$

It follows that

$$\begin{aligned} (nT)^{-2}E(S^2_{n1})= & {} (nT)^{-2}\sum _{i=1}^{nT}E\{N_{in}^2-E^*(N_{in}^2|{\mathcal {F}}_{i-1})\}^2\nonumber \\= & {} (nT)^{-2}\sum _{i=1}^{nT} E[u_{ii}^2\{(e^2_i-\sigma ^2_\nu )^2-E(e^2_i-\sigma ^2_\nu )^2\}+R_i^2(e_i^2-\sigma ^2_\nu )\nonumber \\&+2u_{ii}R_i (e^3_i-\sigma ^2_\nu e_i-\vartheta _3)]^2\nonumber \\\le & {} C(nT)^{-2}\sum _{i=1}^{nT} E[u_{ii}^4\{(e^2_i-\sigma ^2_\nu )^2-E(e^2_i-\sigma ^2_\nu )^2\}^2]\nonumber \\&+C(nT)^{-2}\sum _{i=1}^{nT} E\{R_i^4(e_i^2-\sigma ^2_\nu )^2\}\nonumber \\&+C(nT)^{-2}\sum _{i=1}^{nT} E\{u_{ii}^2R_i^2(e^3_i-\sigma ^2_\nu e_i-\vartheta _3)^2\}. \end{aligned}$$

(45)

By Conditions A1–A3, we have

$$\begin{aligned}&(nT)^{-2}\sum _{i=1}^{nT} E[u_{ii}^4\{(e^2_i-\sigma ^2_\nu )^2-E(e^2_i-\sigma ^2_\nu )^2\}^2]\nonumber \\&\quad \le C(nT)^{-2}\sum _{i=1}^{nT}u_{ii}^4 \le C(nT)^{-2}\sum _{i=1}^{nT}|l_2+l_3h_{ii,1}+l_4h_{ii,2}|^4\nonumber \\&\quad \le C(nT)^{-2}\sum _{i=1}^{nT} |l_2+l_3h_{ii,1}+l_4h_{ii,2}|^4 \le C n^{-1}\rightarrow 0, \end{aligned}$$

(46)

and

$$\begin{aligned}&(nT)^{-2}\sum _{i=1}^{nT} E\{R_i^4(e_i^2-\sigma ^2_\nu )^2\}=(nT)^{-2}\sum _{i=1}^{nT} E[E^*\{R_i^4(e_i^2-\sigma ^2_\nu )^2\}]\nonumber \\&\quad \le C (nT)^{-2}\sum _{i=1}^{nT} E(\sum ^{i-1}_{j=1}u_{ij}e_j+b_{i})^4\nonumber \\&\quad \le C (nT)^{-2}\sum _{i=1}^{nT} E(\sum ^{i-1}_{j=1}u_{ij}e_j)^4+ C (nT)^{-2}\sum _{i=1}^{nT}Eb_{i}^4\nonumber \\&\quad \le C (nT)^{-2}\sum _{i=1}^{nT} \sum ^{i-1}_{j=1}u_{ij}^4\vartheta _4+C (nT)^{-2}\sum _{i=1}^{nT} (\sum ^{i-1}_{j=1}u_{ij}^2\sigma ^2_\nu )^2\nonumber \\&\qquad +C (nT)^{-2}\sum _{i=1}^{nT} E(l_1' a_{i,1}+l_3a_{i,2})^4\nonumber \\&\quad \le C (nT)^{-2}\sum _{i=1}^{nT} \sum ^{i-1}_{j=1} |l_3h_{ij,1}+l_4h_{ij,2}|^4\nonumber \\&\qquad +C (nT)^{-2}\sum _{i=1}^{nT} \left( \sum ^{i-1}_{j=1} |l_3h_{ij,1}+l_4h_{ij,2}|^2\right) ^2\nonumber \\&\qquad +C (nT)^{-2}\sum _{i=1}^{nT} E(l_1' a_{i,1}+l_3a_{i,2})^4 \le C n^{-1}\rightarrow 0. \end{aligned}$$

(47)

Similarly, we can prove that

$$\begin{aligned} (nT)^{-2}\sum _{i=1}^{nT} E\{u_{ii}^2R_i^2(e^3_i-\sigma ^2e_i-\vartheta _3)^2\} \rightarrow 0. \end{aligned}$$

(48)

From (45)–(48), we have $(nT)^{-2}E(S_{n1}^2)\rightarrow 0$. Furthermore,

$$\begin{aligned} E^*(N_{in}^2)= & {} E^*\{E^*(N_{in}^2|{\mathcal {F}}_{i-1})\}=u_{ii}^2E(e^2_i-\sigma ^2_\nu )^2+\sigma ^2_\nu E^*(R_i^2)+2u_{ii}\vartheta _3E^*(R_i)\\= & {} u_{ii}^2E(e^2_i-\sigma ^2_\nu )^2+\sigma ^2_\nu \left( 4\sum ^{i-1}_{j=1}u_{ij}^2\sigma ^2_\nu +b_i^2 \right) +2u_{ii}\vartheta _3b_i. \end{aligned}$$

Thus,

$$\begin{aligned} (nT)^{-2}E(S_{n2}^2)= & {} (nT)^{-2}E \left[ \sum _{i=1}^{nT}\{E^*(N_{in}^2|{\mathcal {F}}_{i-1})-E^*(N_{in}^2)\} \right] ^2\nonumber \\= & {} (nT)^{-2}E \left[ \sum _{i=1}^{nT} \left\{ R_i^2\sigma ^2_\nu -\sigma ^2_\nu \left( 4\sum ^{i-1}_{j=1}u_{ij}^2\sigma ^2_\nu +b_i^2 \right) +2u_{ii}\vartheta _3(R_i-b_i) \right\} \right] ^2\nonumber \\= & {} (nT)^{-2}\sum _{i=1}^{nT} E \left[ \sigma ^2_\nu \left\{ \left( 2\sum ^{i-1}_{j=1}u_{ij}e_j\right) ^2-4\sum ^{i-1}_{j=1}u_{ij}^2\sigma ^2_\nu \right\} +4 \left( \sum ^{i-1}_{j=1}u_{ij}e_j \right) b_i\sigma ^2_\nu \right. \nonumber \\&\left. +2u_{ii}\vartheta _3 \left( 2\sum ^{i-1}_{j=1}u_{ij}e_j \right) \right] ^2\nonumber \\\le & {} C(nT)^{-2}\sum _{i=1}^{nT}E \left[ \sigma ^2_\nu \left\{ \left( \sum ^{i-1}_{j=1}u_{ij}e_j \right) ^2-\sum ^{i-1}_{j=1}u_{ij}^2\sigma ^2_\nu \right\} \right] ^2\nonumber \\&+C(nT)^{-2}\sum _{i=1}^{nT}E \left\{ \left( \sum ^{i-1}_{j=1}u_{ij}e_j \right) b_i\sigma ^2_\nu \right\} ^2\nonumber \\&+C(nT)^{-2}\sum _{i=1}^{nT}E \left\{ 2u_{ii}\vartheta _3 \left( \sum ^{i-1}_{j=1}u_{ij}e_j \right) \right\} ^2. \end{aligned}$$

(49)

Note that

$$\begin{aligned}&(nT)^{-2}\sum _{i=1}^{nT}E \left[ \sigma ^2_\nu \left\{ \left( \sum ^{i-1}_{j=1}u_{ij}e_j \right) ^2-\sum ^{i-1}_{j=1}u_{ij}^2\sigma ^2_\nu \right\} \right] ^2\nonumber \\&\quad \le (nT)^{-2}\sigma ^4_\nu \sum _{i=1}^{nT} E \left( \sum ^{i-1}_{j=1}u_{ij}e_j \right) ^4\nonumber \\&\quad \le C(nT)^{-2}\sum _{i=1}^{nT}\sum ^{i-1}_{j=1}u_{ij}^4\vartheta _4+C (nT)^{-2}\sum _{i=1}^{nT} \left( \sum ^{i-1}_{j=1}u_{ij}^2\sigma ^2_\nu \right) ^2\nonumber \\&\quad \le C n^{-1}\rightarrow 0, \end{aligned}$$

(50)

$$\begin{aligned}&(nT)^{-2}\sum _{i=1}^{nT}E \left\{ \left( \sum ^{i-1}_{j=1}u_{ij}e_j \right) b_i\sigma ^2_\nu \right\} ^2=(nT)^{-2}\sigma ^6_\nu \sum _{i=1}^{nT}E(b_i^2)\sum ^{i-1}_{j=1} u_{ij}^2\nonumber \\&\quad \le C (nT)^{-2}\rightarrow 0, \end{aligned}$$

(51)

and

$$\begin{aligned}&(nT)^{-2}\sum _{i=1}^{nT}E \left\{ 2u_{ii}\vartheta _3 \left( \sum ^{i-1}_{j=1}u_{ij}e_j \right) \right\} ^2=4\vartheta _3^2\sigma ^2_\nu (nT)^{-2} \sum _{i=1}^{nT}u_{ii}^2\sum ^{i-1}_{j=1}u_{ij}^2\nonumber \\&\quad \le C (nT)^{-1}\rightarrow 0, \end{aligned}$$

(52)

where we have used Conditions A2 and A3. From (49)–(52), we have $(nT)^{-2}ES_{n2}^2\rightarrow 0$. The proof of (28) is thus complete.

Finally, we will prove (29). Note that

$$\begin{aligned}&\sum _{i=1}^{nT} E||\omega _i(\psi )||^3 \le \sum _{i=1}^{nT}E||a_{i,1}e_i||^3+\sum _{i=1}^{nT}E|e^2_i-\sigma ^2_\nu |^3\nonumber \\&\quad +\sum _{i=1}^{nT}E|a_{i,2}e_i+h_{ii,1}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,1}e_j|^3\nonumber \\&\quad +\sum _{i=1}^{nT}E|h_{ii,2}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,2}e_j|^3. \end{aligned}$$

(53)

By Conditions A2 and A3, we have

$$\begin{aligned}&\sum _{i=1}^{nT}E||a_{i,1}e_i||^3 \le CnT \left( \max _{1\le i \le nT}E||a_{i,1}||^3 \right) E|e_1|^3=O(nT), \end{aligned}$$

(54)

$$\begin{aligned}&\sum _{i=1}^{nT}E|e^2_i-\sigma ^2_\nu |^3=O(nT), \end{aligned}$$

(55)

$$\begin{aligned}&\sum _{i=1}^{nT}E \left| a_{i,2}e_i+h_{ii,1}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,1}e_j \right| ^3 \nonumber \\&\quad \le C\sum _{i=1}^{nT}E|a_{i,2}e_i|^3+C\sum _{i=1}^{nT}E|h_{ii,1}(e^2_i-\sigma _\nu ^2)|^3+ C\sum _{i=1}^{nT}E \left| 2e_i\sum ^{i-1}_{j=1}h_{ij,1}e_j \right| ^3 \nonumber \\&\quad \le C\sum _{i=1}^{nT}E|a_{i,2}|^3E|e_i|^3+C\sum _{i=1}^{nT}E|h_{ii,1}(e^2_i-\sigma ^2_\nu )|^3\nonumber \\&\qquad + C\sum _{i=1}^{nT}E|e_i|^3\sum ^{i-1}_{j=1}E|h_{ij,1}e_j|^3 + C\sum _{i=1}^{nT}E|e_i|^3 \left\{ \sum ^{i-1}_{j=1}E(h_{ij,1}e_j)^2 \right\} ^{3/2}\nonumber \\&\quad = O(nT). \end{aligned}$$

(56)

Similarly,

$$\begin{aligned} \sum _{i=1}^{nT}E \left| h_{ii,2}(e^2_i-\sigma _\nu ^2)+2e_i\sum ^{i-1}_{j=1}h_{ij,2}e_j \right| ^3=O(nT). \end{aligned}$$

(57)

From (53)–(57), we have

$$\begin{aligned} \sum _{i=1}^{nT} E||\omega _i(\psi )||^3=O(nT). \end{aligned}$$

(58)

Further, using (58) and Markov inequality, we obtain $\sum _{i=1}^{nT} ||\omega _i(\psi )||^3=O_p(nT^2)$. Thus (29) is proved.

Proof of Theorem 1

Using Lemma 3 and following the proof of Theorem 1 in Qin (2021), one can easily show that Theorem 1 holds true. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Y., Qin, Y. Empirical likelihood for spatial dynamic panel data models. J. Korean Stat. Soc. 51, 500–525 (2022). https://doi.org/10.1007/s42952-021-00150-4

Download citation

Received: 29 September 2020
Accepted: 13 September 2021
Published: 29 September 2021
Issue Date: June 2022
DOI: https://doi.org/10.1007/s42952-021-00150-4

Keywords

Mathematics subject classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Empirical likelihood for spatial dynamic panel data models

Abstract

Similar content being viewed by others

Robust Two-Stage Estimation in General Spatial Dynamic Panel Data Models

Multiple Testing for Different Structures of Spatial Dynamic Panel Data Models

Specification tests for spatial panel data models

1 Introduction

2 Main results

Remark 1

Theorem 1

3 Simulations

4 A real data example

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix

Lemma 1

Proof

Lemma 2

Proof

Lemma 3

Proof

Proof of Theorem 1

Rights and permissions

About this article

Cite this article

Keywords

Mathematics subject classification

Navigation

Empirical likelihood for spatial dynamic panel data models

Abstract

Similar content being viewed by others

Robust Two-Stage Estimation in General Spatial Dynamic Panel Data Models

Multiple Testing for Different Structures of Spatial Dynamic Panel Data Models

Specification tests for spatial panel data models

1 Introduction

2 Main results

Remark 1

Theorem 1

3 Simulations

4 A real data example

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix

Appendix

Lemma 1

Proof

Lemma 2

Proof

Lemma 3

Proof

Proof of Theorem 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics subject classification

Search

Navigation