1 Introduction

Over the past 40 years since China's reform and opening up, its economic development has achieved remarkable results that have caught the world's attention. A period of high economic growth has emerged under the impetus of the globalized economy, especially since China became a member of the World Trade Organization (WTO) in 2001. However, behind this unprecedented growth trend is the factor-input type of ‘sloppy growth’. This process has brought about many hidden dangers, including the depletion of local resources, economic inefficiency, increased environmental pollution, economic structural imbalance, and unbalanced economic development, among others (Chen and Chen 2018; Zhang 2021). As China’s economy moves into a ‘new normal’, the past high-input, high-consumption resource-input-based approach to economic growth is no longer sustainable. The internal and external conditions that supported past economic growth have changed, and China’s economic development requires a change to its approach to growth (Zhao and Ge 2019; Zhang and Kong 2022).

High-quality development has become the main feature of contemporary China’s society and economy. It can be said that the relationship and focus between the quantity of economic growth and the quality of economic growth have begun to change, and the quality of economic growth has become a critical factor in determining economic development (Zhang 2022). At the same time, in a stage during which the quantity of economic growth in China is rapidly increasing, differences in the level of economic development between different regions are subsequently expanding due to differences in natural endowments, policy tilts, etc. There is obvious heterogeneity in both the quality of economic development and the growth rate, with both factors being higher in the eastern regions and lower in the western regions (Su and Guo 2021; Kong et al. 2021b; Wong et al. 2021a). Therefore, reducing regional disparity cannot be ignored when encouraging high-quality economic growth. The spatial correlation of regional economic growth indicates that the economic growth process of a region is closely related to the regions surrounding it. Spatial proximity, spatial accessibility, and knowledge spillover between regions are important mechanisms for convergence in economic growth clubs (Bin et al. 2016; Zhang et al. 2020).

The relationship between the urbanization process and the quality of economic growth is extremely close, and urban economic growth is a highly important component of China’s economic growth (Kong et al. 2021a; Zhang et al. 2021). China has also entered into a phase of rapid urbanization. According to statistics, China’s urbanization rate has increased by 50 percentage points from 1949 to the present and has exceeded 60% in 2019.Footnote 1 As the core space and important carrier of regional economic development, the economic growth of China's cities is closely linked to the nation’s overall socioeconomic growth. It can be said that cities bear the burden of economic development, and urban economic growth is an important component of China’s economic growth (Jiang and Yang 2020; Kong et al. 2021c; Wong et al. 2021b). Improvements in the quality of China’s economic growth must therefore be specifically reflected in the quality of economic growth in cities. This begs the question, how can the quality of urban economic growth be understood? How can it be measured? Furthermore, how has the quality of China’s urban economic growth evolved over time? What kind of spatial dependence and heterogeneity do territorial units have at the microscopic scale? What development trends are likely to emerge in the future? And considering that the agglomeration economy is the source and driver of urban expansion and economic growth (Sun et al. 2015), what will the trend of urban economic growth influenced by the agglomeration economy be?

This paper intends to answer these questions by taking the following three approaches. First, this paper takes 284 Chinese cities of prefecture level and above as research objects and measures the quality of urban economic growth in these cities in four dimensions: economic growth dynamics, structure, mode, and outcome. On this basis, we attempt to generalize their spatial and temporal evolution characteristics to determine the spatial convergence trend of economic growth quality in Chinese cities. The results of this study can help to clarify the future trend of urban economic development quality and to objectively understand the source of the gap in China’s urban economic development quality; this study can also help to deepen understanding of the current situation of regional economies and provide a reference for future urban economic development quality.

Second, this study will identify the key factors affecting the spatial convergence of urban economic growth quality and their degrees of influence. Since this paper takes cities as the object of investigation, and considering that urban economic growth has distinctive spatial aggregation characteristics compared with traditional economic growth, it is necessary to focus on distinguishing urban economic growth from traditional economic growth in the study of urban economic growth quality. Since 1978, spatial shift and agglomeration of industries have occurred in China, and widening regional economic disparity in China is closely related to this phenomenon. Especially after 1995, industrial agglomeration has become the dominant factor affecting regional disparity and industrial structure (Geo 2020). Therefore, the second problem that this paper aims to solve is the identification of whether the effect of agglomeration economy on the spatial convergence of urban economic growth quality is a result of specialized agglomeration or diversified agglomeration.

Third, the balanced development of regional economic growth is a key concern of government macro-control, and fiscal policy plays an active role in coordinating the balanced development of regional economies. The reasonable selection and application of fiscal policy tools can effectively optimize the supply structure, transform growth dynamics, and promote the synergistic development of heterogeneous regional spaces (Bian et al. 2019; Zhang and Yang 2021). Therefore, this paper further introduces fiscal policy tools, which can be used to analyze the factors influencing the spatial convergence of urban economic growth quality, and examines the changes in impact brought about by changes in the scale of fiscal expenditure. The study’s results are used to analyze the relationship and path of influence between the regional economic growth gap, the spatial convergence trend, and fiscal policy tools in China in order to provide reliable empirical evidence for policy makers.

2 Literature review

The study of spatial convergence of economic growth quality can be based on the relationship between economic growth and convergence (Lee and Yu 2012; Song et al. 2020). Early mainstream studies of economic convergence theory are mainly situated within the fields of neoclassical growth theory and endogenous growth theory. Early neoclassical economics assumed technology invariance as a premise, but neoclassical growth theory achieved a theoretical breakthrough by putting forward a theory of technological progress, arguing that total factor productivity has a role in economic growth and revealing the process by which economic growth converges to a long-term steady state (Solow 1956). However, due to the assumption of exogenous technology in neoclassical theory, it is difficult for it to explain differences in economic growth quality in the long run. Endogenous growth theory thus proposes the core concept of endogenous technology. It assumes that the factors involved have diminishing marginal returns and that the endogenous driving force of continuous technological progress lies in the exogenous accumulation of knowledge; this formulation better explains the external fact of differences in growth rates in the long run. In addition to these two mainstream theories, most scholars have combined new economic geography with the spatial convergence of economic growth quality. New economic geography assumes that individual economies operate with the same initial conditions and economic structure, and explores endogenous evolutionary divergence in the spatial dimension. Research in new economic geography finds that the systemic endogenous forces of two homogeneous individual economies, independent of other external influences, can contribute to regional evolutionary divergence, industrial agglomeration, and even the formation of core–edge structures. This finding has important implications for studying the spatial attributes of regional economic growth, which affect club convergence (Borsi and Metiu 2015).

The existence of convergence in neoclassical growth theory has been supported empirically by domestic and international scholars using country and regional samples. Initially, Baumol (1986), basing their work on Maddison’s (1982) data analysis, obtained a more pronounced productivity convergence in sample countries during the period 1870–1979, and Summers and Heston (1984) used output per capita data to find similar results. De Long (1988) argues against this. His study obtained \(\sigma\) convergence, indicating that the type of convergence is controversial, but convergence is unquestionable. Following these initial studies, studies on \(\sigma\) convergence have increased in number (Chen and Fleisher 1996; Jian et al. 1996). Sala-i-Martin (2002) has reconfirmed the relevance of neoclassical growth theory by using different country samples over different examination periods. Based on the availability of country-specific data and the consideration of excessive sample variation, scholars began to focus on the convergence of regional economic differences within countries. For example, Barro et al. (1992) used data from 48 US states. Based on the absolute \(\beta\) convergence model, they concluded that there was significant absolute convergence in per capita income or per capita output in each state. In 1995, Barro applied the same method to study convergence in the US, Japanese, and European regions and again confirmed the existence of absolute convergence. However, Young et al. (2008) conducted a distribution function analysis using 1970 and 1998 US county-level cross-sectional data and found that both counties and states displayed a single-peaked distribution; that is, growth is parallel between counties and between states and there is no \(\sigma\) convergence. Caggiano and Leonida (2009) find that economic growth is conditionally convergent in OECD countries. Moreover, Caggiano and Leonida (2013) argue that the presence of clustering and/or polarization may be a common problem in regional models, but do not exclude the existence of absolute convergence.

In the 1970s, Weeks and Yao (2003) found large differences between China’s regional economies, but these differences narrowed after the agricultural reform. When industrialization became widespread in different regions of China in the 1990s, economic disparity between regions reemerged as a result of the different levels of industrialization in different regions. Due to the advantage of endowment combined with policy inclination, the economic development of the eastern region was far in advance of other regions. The inter-regional economic growth rate was clearly higher in the east and lower in the west, and this gap continues to widen (Démurger et al. 2002). However, club convergence and conditional \(\beta\) convergence are evident within the eastern region (Shen and Ma 2002).

Furthermore, to investigate whether there a trend of convergence was evident at the region-wide level in China, Lin and Liu (2003) used economic growth data from 1978 to 1999 for different Chinese provinces, and Xu and Li (2004) produced the first analysis of economic convergence in China using urban data. Both these studies found that there was indeed a trend of convergence. Jiang (2012), on the basis of the Solow growth model, found that labor productivity converges rapidly and conditionally across Chinese provinces. Wang et al. (2013) used non-stationary factor analysis to find that the conditional convergence of China's regional economies is extremely weak. Using a static spatial panel data model, Chen et al. (2018) found absolute and conditional convergence in urban economic growth in China, while Sun and Cao (2018) found that club convergence can be observed in China's urban economy through a nonlinear time-varying factor model study.

With the development of spatial economics, the spatial convergence of economic growth has received wide attention. The foreign scholars Seya et al. (2012), and Ahmad and Hall (2017) used spatial error and spatial Durbin model empirical analysis to examine spatial convergence. The Chinese scholar Ying (2000) was the first to study the spatial correlation of economic growth. Following Ying, many scholars such as Pan (2010), and Huang and Yuan (2014) explored the spatial convergence of China’s economy at the provincial level, the city level and the three major regional levels of east, central and west using spatial econometric models based on the consideration of geographical factors. Their findings all suggest the existence of conditional \(\beta\) convergence or spatial club convergence in China, but do not support the existence of absolute convergence. Following the significant impact on global economic growth of the outbreak of COVID-19 in 2020 (Nandan and Mallick 2021), scholars have initiated a new discussion on the spatial convergence of economic growth. Wang et al. (2020) established a spatial Durbin model to empirically test the spatial convergence of economic growth between provinces in China. They found that economic growth between provinces in China is congruent with the law of conditional convergence. Cartone et al. (2021) studied differences in the determinants of economic growth between 187 regions of 12 European countries and used spatial quantile regression methods to find differences in the rates of conditional convergence of investment, population growth, and human capital in these European regions.

In conclusion, research on the spatial convergence of economic growth quality can be divided into general economic growth convergence theory and spatial economic growth convergence theory. General economic growth convergence theory mainly revolves around neoclassical growth theory and endogenous growth theory. Neoclassical theory assumes that technology is exogenous, which can account for the convergence of economic growth to a steady state in the long run, to a certain extent. However, its assumption of technological exogeneity can hardly explain differences in economic growth quality in the long run. The endogenous growth model, which treats the dynamics of economic growth as resulting from technological progress, solves this difficulty precisely. With the rise of spatial economics, geographic factors have been given full attention in the study of economic growth convergence. The theory of spatial convergence of economic growth, with new economic geography theory as its core, was thus born. This theory holds that individual economies interact spatially, resulting in spillover effects in economic development.

3 Analysis of regional differences in the green economic growth of Chinese cities

3.1 Indicator system establishment

Drawing on the research of the scholar Guo et al. (2020) and combining the characteristics of urban economy and data availability, this section selects four dimensions (economic growth dynamics, structure, mode, and outcome) to construct an indicator system for urban economic growth quality. The index system includes 9 secondary indicators and 18 tertiary indicators. The specific content of the indicators is shown in Table 1.

Table 1 Index system of economic growth quality of Chinese cities

3.2 Measurement process of urban economic growth quality index

In this paper, the entropy weighting and Delphi methods are applied to establish a comprehensive urban economic growth quality index. The entropy method of assigning weights is based on the amount of information reflected by the level of data evolution, which can objectively reflect the importance of each indicator in the evaluation system as a whole. This can reduce the influence of human subjective factors on the indicator weights. The Delphi method is an expert opinion survey method. Combining the advantages of subjective assignment and objective assignment, this paper assigns equal weights to indicators of the four dimensions of economic growth dynamics, structure, mode and result: each dimension accounts for 25%. The specific measurement process is as follows.

  1. (1)

    Standardization. The indicators' units and orders of magnitude are very different, and direct calculation will cause large errors, so the original data need to be invariant.

    $$x_{ij}^{\prime } = \frac{{x_{ij} - \min \left( {x_{ij} } \right)}}{{\max \left( {x_{ij} } \right) - \min \left( {x_{ij} } \right)}}\quad {\text{Positive}}\;{\text{indicators}}$$
    (1)
    $$x_{ij}^{\prime } = \frac{{\max \left( {x_{ij} } \right) - x_{ij} }}{{\max \left( {x_{ij} } \right) - \min \left( {x_{ij} } \right)}}\quad {\text{Negative}}\;{\text{indicators}}$$
    (2)

    In Eqs. (1) and (3), i represents the city, j represents the measured index, \(x_{ij}\) represents the value of the j-th index of the i-th city, and \(x_{ij}^{^{\prime}}\) is the result after data normalization.

  2. (2)

    Calculate the share of the i-th city under the j-th indicator.

    $$S_{ij} = \frac{{x_{ij} }}{{\mathop \sum \nolimits_{i = 1}^{n} x_{ij} }}$$
    (3)
  3. (3)

    Calculation of the entropy value of the j-th indicator.

    $$e_{j} = - k\mathop \sum \limits_{i = 1}^{n} s_{ij} ln\left( {s_{ij} } \right)$$
    (4)

    where \(k = \frac{1}{ln\left( n \right)}\), \(e_{j} \ge 0\) and satisfies \(e_{j} \ge 0\).

  4. (4)

    Calculation of information entropy redundancy.

    $$d = 1 - e_{j}$$
    (5)
  5. (5)

    Calculation of the weights of each indicator.

    $$w_{j} = \frac{{d_{j} }}{{\mathop \sum \nolimits_{j = 1}^{m} d_{j} }}$$
    (6)
  6. (6)

    Measurement of the composite index of economic growth quality. This formula is calculated through a multiple linear weighting function, as follows.

    $${\text{QEG}}_{i} = \mathop \sum \limits_{j = 1}^{m} w_{j} \times s_{ij}$$
    (7)

    In Eq. 7, \({\text{QEG}}_{i}\) represents the economic growth quality of city i, \({\text{QEG}} \subseteq \left[ {0,1} \right]\). The larger the \({\text{QEG}}_{i}\) index, the higher the economic growth quality of the city. Conversely, the smaller the \({\text{QEG}}_{i}\) index, the lower the economic growth quality of the city.

3.3 Measurement results and analysis

Based on the economic growth quality measured in the previous section, this section examines overall differences in the economic growth quality of 284 cities in China in terms of mean, standard deviation, minimum, median, and maximum values. The statistical information of the basic data is shown in Table 2. On this basis, the trend of growth rate of each indicator relative to 2005 was plotted, as shown in Fig. 1.

Table 2 Descriptive statistics of economic growth quality in Chinese cities
Fig. 1
figure 1

Growth rate of descriptive statistics

In general, China’s urban economy shows a fluctuating upward trend. Specifically, average urban economic growth quality rose from 0.2609 in 2005 to 0.2993 in 2018, an increase of 14.71%, implying an overall improvement in the quality of China's urban economic growth. A more pronounced trough of fluctuation occurred in 2008, when urban economic growth quality dropped to 0.2550, a 2.26% decrease compared to 2005. The main reason for this trough is that the global systemic financial crisis affected the economic system in 2008, leading to a rapid economic decline and a consequent drop in the quality of economic growth. However, with the help of the ‘four trillion’ investment plan, China's real economy recovered rapidly. At this time, investment differed greatly between different cities, leading to another divergence in the development of each city's economy, and the differences between cities began to expand again. The growth curve of standard deviation shown in Fig. 1 always lies below the zero level and shows a fluctuating decline. It shrank from 0.0730 in 2005 to 0.0426 in 2018, an overall reduction of 41.65%, indicating a trend of narrowing differences in the quality of urban economic growth. The minimum value gradually increased from 0.1492 in 2005 to 0.1816 in 2018, while the maximum value expanded from 0.5113 in 2005 to 0.5240 in 2018. The average value across all years is higher than the median value of the maximum and minimum values, indicating that the difference in urban economic growth quality had a narrowing trend and balanced development.

Next, we used nonparametric kernel function estimation to empirically analyze urban economic growth quality distribution characteristics over the years. In this section, representative years at three-year intervals (2005, 2008, 2011, 2014, and 2017) were selected to plot the dynamic evolution of nonparametric estimations of urban economic growth quality in Fig. 2.

Fig. 2
figure 2

Evolution of the distribution of economic growth quality levels in Chinese cities

Overall, the basic shape of the size distribution curve of urban economic growth quality is consistent over the years. China shows a single-peaked state in the evolution of urban economic growth quality distribution during 2005–2018. There is a significant narrowing of the wave width from 2005 onward. The wave width in 2017 is the narrowest among the representative years, which to some extent indicates a narrowing trend in absolute differences in the quality of urban economic growth. From the evolution of the crests, urban economic growth quality moves to the right and the height of the main crest increases over time, with the crest reaching its highest point in 2017. This indicates that the quality of urban economic growth has improved, which is consistent with the findings of previous studies.

According to the distribution characteristics of the density curve, cities with low levels of development of economic growth quality occupied a large proportion of this distribution before 2011 and gradually transitioned to medium–high levels of economic growth quality after 2011. In the early period, China's economy pursued an increase in total economic volume by extensive and rough economic growth, ignoring the problems of environmental pollution, resource depletion, low economic efficiency and structural imbalance (Chen and Chen 2018). As China's economic growth enters a ‘new normal’, the problem of non-synchronization of the quality of economic growth and the quantity of economic growth is gradually being exposed. A series of key measures for high-quality development emphasize quality as the core of development, and the Chinese economy is gradually entering a new stage in which quality is given primacy. Results at the city level also show that China's urbanization level has been developing rapidly since 2011, indicating a highly agglomerated stage of economic development.

4 Research design

4.1 Model setup and data description

As a first step, a general convergence model was constructed. \(\beta\) convergence is an important means for examining economic convergence among regions, and \(\sigma\) convergence among regions will only hold if \(\beta\) convergence exists among regions. Therefore, the general convergence model used here takes \(\beta\) convergence as the initial measurement model. An absolute \(\beta\) convergence model of economic growth quality was constructed.

$$d\left( {ln{\text{QEG}}_{it} } \right) = ln{\text{QEG}}_{it} - ln{\text{QEG}}_{it - 1} = \alpha + \beta ln{\text{QEG}}_{it - 1} + \varepsilon_{it}$$
(8)

The economic growth quality condition \(\beta\) convergence model is based on absolute \(\beta\) convergence controlling for the initial characteristics of economic individuals, i.e., the absolute \(\beta\) convergence is based on the introduction of control variables.

$$d\left( {lnQEG_{it} } \right) = lnQEG_{it} - lnQEG_{it - 1} = \alpha + \beta lnQEG_{it - 1} + \gamma X_{it} + \varepsilon_{it}$$
(9)

where \(\varepsilon_{it \sim }^{iid} N\left( {0,\sigma^{2} } \right)\), \({\text{QEG}}_{it}\) denotes the economic growth quality of city i in year t, and \({\text{QEG}}_{it - 1}\) denotes the economic growth quality of city i in year t-1. \(X_{it}\) in Eq. 9 is the set of control variables. If the coefficient \(\beta\) is less than 0 and statistically significant, this means that there is absolute \(\beta\) convergence and conditional \(\beta\) convergence in the quality of urban economic growth, which eventually converges to the steady-state \(\gamma_{0}\). The convergence steady-state value \(\gamma_{0} = \frac{\alpha }{1 - \beta }\), the convergence rate \(\theta = \frac{{ - ln\left( {1 + \beta } \right)}}{t}\), and the convergence half-life cycle \(\tau = \frac{ln2}{\theta }\) can be calculated from the estimated value of the convergence coefficient \(\beta\).

In the second step, a spatial convergence model was constructed. Cities are divided into artificial administrative divisions, cities are open in spatial scope, and there are economic interactions between different cities. In addition, the spatial correlation pattern of economic growth quality and the boundaries between cities may not be uniform, generating neighborhood measurement errors. The statistics in the research process are related to the sample space, and subsequently, the quality of economic growth in different cities may be affected by spatial correlation. If the spatial factor is ignored, large errors may result, so a spatial lag model (SAR) and a spatial error model (SEM) were constructed. The specific expressions of these models are given in Eqs. (10) and (11).

$$\begin{gathered} d\left( {ln{\text{QEG}}_{it} } \right) = ln{\text{QEG}}_{it} - ln{\text{QEG}}_{it - 1} \hfill \\ \alpha S + \rho W_{n} \left( {ln{\text{QEG}}_{it} - ln{\text{QEG}}_{it - 1} } \right) + \beta ln{\text{QEG}}_{it - 1} + \gamma X_{it} + \mu_{it} \hfill \\ \end{gathered}$$
(10)
$$\begin{gathered} d\left( {ln{\text{QEG}}_{it} } \right) = ln{\text{QEG}}_{it} - ln{\text{QEG}}_{it - 1} \hfill \\ \alpha S + + \beta {\text{QEG}}_{it - 1} + \gamma X_{it} + \varphi_{it} ,\;\varphi_{it} = \lambda W\varphi_{it} + \mu_{it} ,\;\varphi_{it} = \lambda W\varphi_{it} + \mu_{it} \hfill \\ \end{gathered}$$
(11)

where \(\varepsilon_{it \sim }^{iid} N\left( {0,\sigma^{2} } \right)\), S is the spatial unit column vector, and W is the spatial autocorrelation weight matrix. Equation (10) is the spatial lag model, where \(\rho\) is the spatial autocorrelation parameter. When \(\rho\) is greater than 0, this indicates a positive spatial correlation of spatial economic growth quality among cities; when \(\rho\) is less than 0, this indicates a negative spatial correlation of spatial economic growth quality among cities. \(\beta\) is the convergence coefficient consistent with Eqs. (8) and (9). Equation (11) is the spatial error model, where \(\lambda\) is the parameter measuring the spatial correlation between the regression residuals.

In the third step, the impact of the aggregation of economic and fiscal policy instruments on the spatial convergence of the quality of urban economic growth was examined. A specific model was constructed as follows.

$$\begin{gathered} d\left( {ln{\text{QEG}}_{it} } \right) = ln{\text{QEG}}_{it} - ln{\text{QEG}}_{it - 1} = \alpha S + \rho W_{n} \left( {ln{\text{QEG}}_{it} - ln{\text{QEG}}_{it - 1} } \right) \hfill \\ + \eta_{1} SP_{it} + \eta_{2} {\text{DV}}_{it} + \eta_{3} {\text{FOS}}_{it} + \beta ln{\text{QEG}}_{it - 1} + \gamma X_{it} + \mu_{it} \hfill \\ \end{gathered}$$
(12)
$$\begin{gathered} d\left( {ln{\text{QEG}}_{it} } \right) = ln{\text{QEG}}_{it} - ln{\text{QEG}}_{it - 1} = \alpha S + + \beta {\text{QEG}}_{it - 1} \hfill \\ + \eta_{1} SP_{it} + \eta_{2} {\text{DV}}_{it} + \eta_{3} {\text{FOS}}_{it} + \gamma X_{it} + \varphi_{it} ,\,\varphi_{it} = \lambda W\varphi_{it} + \mu_{it} \hfill \\ \hfill \\ \end{gathered}$$
(13)

In Eqs. (12) and (13), \(SP\) denotes the degree of specialization and agglomeration of productive service industries, \({\text{DV}}\) denotes the level of diversification and agglomeration of productive service industries, \({\text{FOS}}\) denotes the scale of government fiscal expenditure, and \(X_{it}\) is the set of control variables.

4.2 Setting of spatial weight matrix

In spatial econometric analysis, the spatial weight matrix is a powerful tool for the conceptualization of spatial relationships, which reflects the structure and intensity of spatial effects and determines the degree of contribution of spatial units to neighboring units. In addition to geographical factors, studies have shown that spatial correlation among cities is also related to the level of economic development (Gu and Pang 2008). This paper therefore set the spatial weight matrix according to two sets of factors: geographical factors and economic factors.

4.2.1 Distance matrix of neighboring weights

If two cities are adjacent, then the weight is 1, and if they are not adjacent, the weight is 0.

$$W_{ij}^{n} = \left\{ {\begin{array}{*{20}c} {1,} & {i = j} \\ {0,} & {i \ne j} \\ \end{array} ,\quad W_{ij}^{^{\prime}n} = \frac{{W_{ij}^{n} }}{{\mathop \sum \nolimits_{j} W_{ij}^{n} }},\quad i \ne j} \right.$$
(14)

In Eq. 17, i and j denote the i-th and j-th cities. To simplify the model and to easily interpret the empirical results, the spatial weight matrix is row normalized so that the sum of the elements in each row is 1, yielding \(W_{ij}^{^{\prime}n}\).

4.2.2 Geographical distance weight matrix

According to the general rule of spatial correlation between regions, the shorter the distance interval between regions, the stronger the correlation between regions. As the distance interval expands, the correlation between regions will gradually weaken. Therefore, this paper assigns weights according to the inverse of the geographical distance between different cities, as shown in Eq. (15).

$$W_{ij}^{d} = \left\{ {\begin{array}{*{20}c} {1/d_{ij} ,} & {i \ne j} \\ {0,} & {i = j} \\ \end{array} } \right.,\quad W_{ij}^{^{\prime}d} = \frac{{W_{ij}^{d} }}{{\mathop \sum \nolimits_{j} W_{ij}^{d} }},\quad i \ne j$$
(15)

where \(d_{ij}\) is the geographic distance between two places, the weight is assigned to 0 when the inverse of the geographic distance is infinite, and \(W_{ij}^{^{\prime}d}\) is the normalized matrix.

4.2.3 Economic distance weight matrix

Given that competition and spillover effects are more likely to be triggered between cities with similar levels of economic development, an economic distance weight matrix was constructed. This matrix combines both spatial and economic factors which lead to differences, as shown in Eq. (16).

$$W_{ij}^{e} = W_{ij}^{d} diag\left( {\overline{Y}_{1} /\overline{Y},\overline{Y}_{2} /\overline{Y}, \ldots \overline{Y}_{n} /\overline{Y}} \right)\quad W_{ij}^{^{\prime}e} = \frac{{W_{ij}^{e} }}{{\mathop \sum \nolimits_{j} W_{ij}^{e} }}\quad i \ne j$$
(16)

where \(\overline{Y}_{i}\) is the economic indicator of the i-th city, \(\overline{Y}_{i} = \frac{1}{n}\), \(\overline{Y} = \frac{1}{n}\), is the mean value of economic indicators for all cities in the total observation period, and the normalized weight is \(W_{ij}^{^{\prime}e}\).

4.3 Variable descriptions and data sources

In this paper, the quality of economic growth in Chinese cities was selected as the research target. The sample data were obtained from the China City Statistical Yearbook. The data of the number of invention indicators were obtained from the CNRDS database. Data processing was performed in three main steps, as follows.

  1. (1)

    Different processing methods were selected to process cases of missing values for individual indicators. Indicators with low levels of missing data were filled with the average value of the two periods before and after the missing value, or with the value of the previous and later periods. We chose the interpolation method for indicators with missing data in individual years for all cities. For indicators with high levels of missing data, we considered replacing the variables.

  2. (2)

    Some indicators required simple calculation. The relevant proportion indicators were measured, and the import and export data and foreign direct investment data expressed in US dollars were converted according to the exchange rate of US dollars to RMB in the relevant year.

  3. (3)

    Because there were serious missing data problems for Tibet, Bijie, Tongren and several other cities, these cities were excluded from the sample. After this processing, panel data for 284 cities from 2005 to 2018 were ultimately obtained.

The core variables included the concentration of productive service industries and the scale of fiscal spending. Production service industry agglomeration was divided into the degree of specialized agglomeration and the level of diversified agglomeration. Referring to Combes (2000), this paper expresses the production service industry specialization agglomeration (SP) indicator as: \({\text{SP}}_{i} = \left( {\overline{E}_{is} /E_{i} } \right)/\left( {\overline{E}_{s} /E} \right)\), where \({\text{SP}}_{i}\) is the degree of specialization and agglomeration of productive service industry in city i; \(\overline{E}_{is}\) is the number of productive service employment in city i; \(E_{i}\) is the total employment in city i; \(\overline{E}_{s}\) is the number of people employed in productive services nationwide; and \(E\) is the overall number of people employed nationwide. The level of of the diversification agglomeration (DV) indicator is tabulated as \(DV_{i} = \mathop \sum \limits_{s} \left( {E_{is} /E_{i} } \right) \times \left[ {{{\left[ {{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 {\mathop \sum \limits_{{s^{\prime} = 1,\,s^{\prime} \ne s}}^{n} \left[ {E_{{is^{\prime}}} /\left( {E_{i} - E_{is} } \right)} \right]^{2} }}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{${\mathop \sum \limits_{{s^{\prime} = 1,\,s^{\prime} \ne s}}^{n} \left[ {E_{{is^{\prime}}} /\left( {E_{i} - E_{is} } \right)} \right]^{2} }$}}} \right]} \mathord{\left/ {\vphantom {{\left[ {{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 {\mathop \sum \limits_{{s^{\prime} = 1,\,s^{\prime} \ne s}}^{n} \left[ {E_{{is^{\prime}}} /\left( {E_{i} - E_{is} } \right)} \right]^{2} }}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{${\mathop \sum \limits_{{s^{\prime} = 1,\,s^{\prime} \ne s}}^{n} \left[ {E_{{is^{\prime}}} /\left( {E_{i} - E_{is} } \right)} \right]^{2} }$}}} \right]} {{1 \mathord{\left/ {\vphantom {1 {\mathop \sum \limits_{{s^{\prime} = 1,\;s^{\prime} \ne s}}^{n} \left[ {E_{{s^{\prime } }} /\left( {E - E_{s} } \right)} \right]^{2} }}} \right. \kern-\nulldelimiterspace} {\mathop \sum \limits_{{s^{\prime} = 1,\;s^{\prime} \ne s}}^{n} \left[ {E_{{s^{\prime } }} /\left( {E - E_{s} } \right)} \right]^{2} }}}}} \right. \kern-\nulldelimiterspace} {{1 \mathord{\left/ {\vphantom {1 {\mathop \sum \limits_{{s^{\prime} = 1,\;s^{\prime} \ne s}}^{n} \left[ {E_{{s^{\prime } }} /\left( {E - E_{s} } \right)} \right]^{2} }}} \right. \kern-\nulldelimiterspace} {\mathop \sum \limits_{{s^{\prime} = 1,\;s^{\prime} \ne s}}^{n} \left[ {E_{{s^{\prime } }} /\left( {E - E_{s} } \right)} \right]^{2} }}}}} \right]\), where \(E_{{is^{^{\prime}} }}\) denotes the number of people employed in a particular productive service industry \(s^{\prime}\) (in city i and industry s); \(E_{{s^{\prime}}}\) is the number of people employed in a productive service industry \(s^{\prime}\) nationwide in addition to industry s; and \(E_{s}\) represents the number of people employed in productive service industry s nationwide. The fiscal expenditure size (FOS) is expressed using the ratio of local fiscal general budgetary expenditure to GDP, drawing on the methodology of Wu (2021).

In this paper, we selected the level of informatization (INF), city size (POP), physical capital (K), transportation accessibility (TRAV), degree of government intervention (GOV), economic policy uncertainty index (EPU), and industrial SO2 emissions (SO2) as control variables. The level of informatization was measured using postal service revenue per capita. City size was measured using the total year-end population of the municipality. Physical capital was measured by capital stock calculated by the perpetual inventory method, and the capital stock depreciation rate was borrowed from Zhang et al. (2004) (set at 9.6%). Transportation accessibility was measured by total freight transportation per capita. The degree of government intervention was expressed using the city's fiscal revenue as a percentage of GDP. The Economic Policy Uncertainty Index was based on Baker's et al. (2016) methodology, which first calculates the proportion of daily news articles in the South China Morning Post (published in Hong Kong, China) that contained the keywords “China”, “economy”, “policy”, and “uncertainty” to the total number of articles published by the South China Morning Post in that month. Baker’s methodology then uses the index of global economic policy uncertainty published on an economic policy uncertainty websiteFootnote 2 to measure the index. This paper construct enterprises' pollution emission intensity index using industrial SO2 emissions, which are typically used to represent air pollution.

5 Empirical results and discussion

5.1 General convergence results

Table 3 shows the results of the estimation of the absolute \(\beta\) convergence model of economic growth quality. To consider the possibility of club convergence in China, this section empirically analyzes absolute \(\beta\) convergence at the national level and the three regional levels of east, central and west. The results show that the \(\beta\) coefficient is significantly smaller than 0 in the national sample, indicating the existence of absolute \(\beta\) convergence in China. Similarly, the eastern, central, and western samples show a trend of club convergence. The western region converges the fastest, followed by the central and eastern regions, while the central and western regions converge faster than the national average. However, the absolute convergence rate is still relatively slow for each sample.

Table 3 Test of absolute \(\beta\) convergence of urban economic growth quality

5.2 Spatial convergence analysis

Table 4 shows the spatial conditional \(\beta\) convergence results obtained using the maximum likelihood estimation method (ML). Columns (1)–(3) are the results of spatial lagged dynamic panel model estimation, and columns (4)–(6) are the results of spatial error dynamic panel model estimation. This paper uses a combination of Hausman and Lagrange multiplier (LM) tests to make judgments with regard to the choice of spatial model. The results of the Hausman statistical test tended to favor the empirical results of the fixed-effects model rather than the random-effects model. The LM test results indicated that the surface spatial lag model was more suitable for the empirical analysis in this paper. Therefore, this section mainly focuses on the results in columns (1)–(3). Although the spatial error model is slightly less reasonable than the spatial lag model, it still reflects the robustness of the results of this study.

Table 4 Quality conditions for China's economic growth \(\beta\) spatial convergence test

The \(ln{\text{QEG}}_{it - 1}\) coefficient is significantly negative, indicating that urban economic growth quality still eventually converges to a steady state after the inclusion of spatial factors; that is, there is spatial conditional \(\beta\) convergence. For different spatial weight matrices, the coefficients of \(ln{\text{QEG}}_{it - 1}\) differ. The absolute value of the convergence coefficient corresponding to the economic distance weight matrix is the largest, followed by the neighboring weight matrix, and the geographical distance weight matrix. This indicates that convergence is more significant between neighboring cities with similar economic development levels. In addition, the spatial effect coefficient \(\rho\) is significantly positive for the neighboring weights, geographic distance weights, and economic distance weights. This indicates that, due to the positive spatial spillover effect, the convergence of economic growth quality in the sampled cities is accelerated and the convergence period is shortened.

5.3 Explanations for the spatial convergence of the urban economic growth quality

A scatter plot was drawn with the specialization agglomeration index and diversification agglomeration index as the x-axis and the quality of economic growth as the y-axis, respectively. As shown in Fig. 3, there is a linear relationship between specialization and diversification agglomeration and the quality of urban economic growth. Specialization agglomeration is positively related to the quality of urban economic growth, while diversification agglomeration is inversely related to the quality of urban economic growth.

Fig. 3
figure 3

Scatter plot of productive service industry agglomeration and urban economic growth quality

Table 5 shows the estimation results of the spatially conditional \(\beta\) convergence model, which reflect the effects of productive service industry agglomeration and fiscal expenditure.

Table 5 Results of considering productive services agglomeration and fiscal expenditures

The specialized agglomeration of productive service industries plays a positive role in the spatial convergence of economic growth quality of cities. However, the higher the diversification agglomeration is, the more diffuse the economic growth quality is. After the agglomeration economy is factored in, the convergence speed of urban economic growth quality increases and the convergence period is shortened. The spatial lag model’s convergence speeds based on the neighborhood weight, geographic distance weight, and economic distance weight are 5.828%, 4.397%, and 9.702%, respectively. In the spatial condition β convergence model without considering industrial agglomeration, the convergence speeds corresponding to the three weight matrices are 3.543%, 3.479%, and 4.081%, respectively. The convergence speeds are increased by 64.49%, 26.39%, and 137.74%, respectively.

In terms of policy effects, the FOS coefficient is significantly positive, indicating that increasing the scale of fiscal expenditure can significantly promote the spatial convergence of urban economic growth quality. Therefore, fiscal policy instruments are important factors influencing the spatial convergence of urban economic growth quality. Finally, there is a significant positive spatial spillover effect among cities. This indicates that cities should consider the local industrial development level and the industrial development patterns of the surrounding areas when planning and designing industrial development.

5.4 Robustness test

The spatial spillover effects obtained from the spatial lag model and spatial error model include local effects and indirect effects from other cities. In order to ensure the robustness of the above conclusions, and considering that in practice cases of spatial lag and spatial error may exist simultaneously, this section uses the spatial Durbin model for re-testing. In addition, considering the possibility of spatial correlation jumps due to the length of the examined period, the examined period is further divided into the sub-periods 2005–2012 and 2013–2018. The results are presented in Table 6. In Table 6, W1, W2, and W3 correspond to the adjacency matrix, geographic distance matrix, and economic distance matrix, respectively.

Table 6 Results of spatial Durbin model

The results in Table 6 show that the specialized agglomeration of productive services is positive for the spatial convergence of the quality of urban economic growth, while diversified agglomeration increases the gap in urban economic growth quality. This is consistent with the results given in Table 5. Furthermore, from 2005 to 2012, specialization agglomeration is shown to be key to promoting the convergence of the urban economic growth quality gap; from 2013 to 2018, diversification agglomeration plays a major role in promoting economic growth quality. This result again proves that agglomeration economy is a key factor in the convergence of economic growth quality in cities. In addition, W*SP is significantly positive, while W*DV is significantly negative, indicating that specialized agglomeration in some cities can significantly contribute to the convergence of economic growth quality in other cities. Still, the specialized agglomeration of industries in neighboring cities contributes to the widening of regional differences. This situation is always present in both the short and long runs. Finally, we look at the impact of fiscal policy instruments. The coefficient of FOS is significantly positive, indicating that an increase in the size of fiscal spending can significantly contribute to the convergence of the quality of economic growth in cities. The estimated coefficient of W*FOS also has a positive value, indicating that fiscal spending in some cities also affects the convergence of the quality of economic growth in other cities.

6 Conclusions and policy recommendations

This paper takes 284 prefecture level and above cities in China from 2005 to 2018 as the research objects, constructs comprehensive indicators of urban economic growth quality using the entropy value method, and tries to summarize their spatiotemporal evolution characteristics to initially determine the convergence of urban economic growth quality in China. Given the possible spatial correlation of different cities, this paper examines the spatial convergence of urban economic growth quality by using a spatial econometric model starting from a general convergence trend. Furthermore, this paper also tries to explain the spatial convergence of urban economic growth quality fin terms of both agglomeration economy and fiscal policy instruments.

The study draws the following conclusions. First, there is a general convergence trend in China across the whole territory. The western region converges the fastest, followed by the central and eastern regions, while the central and western regions converge faster than the national average. However, the absolute convergence rate is still relatively slow, regardless of the sample. Second, in terms of spatial convergence, the convergence between neighboring cities with similar economic development levels is more significant. Due to the existence of a positive spatial spillover effect, the speed of economic growth quality convergence among cities in China is accelerated and the convergence period is shortened. Third, an examination of the factors influencing the spatial convergence of urban economic growth quality reveals that the specialized agglomeration of productive service industries has a positive influence on the spatial convergence of urban economic growth quality, while the influence of diversified agglomeration is negative. After factoring in agglomeration economy, the convergence speed of urban economic growth quality increases and the convergence period is shortened accordingly. The empirical results of this paper also indicate that fiscal policy instruments are important factors affecting the spatial convergence of urban economic growth quality. The increase of fiscal expenditure scale can significantly promote the convergence of urban economic growth quality. The fiscal expenditure of other cities also significantly affects the spatial convergence trend of local urban economic growth quality.

The empirical analysis of this paper shows that the quality of China’s urban economic growth is currently converging spatially. However, there is still a problem in that there is a low level of convergence and a decreasing rate of convergence. Therefore, measures need to be taken to break this bottleneck. This paper recommends the following policies.

First, the spatial distribution of cities should be coordinated and the horizontal and vertical interactions between regions should be strengthened. This paper demonstrates a significant spillover effect on the convergence of economic growth quality in China’s cities and an obvious center-periphery pattern. The level of economic growth quality is unevenly distributed; this is related to the spatial distribution of cities to a certain extent. Therefore, the government should improve regional transportation structure, promote the construction of an inter-city rapid railroad network, and build a more rational urban spatial structure system to improve spatial accessibility and the degree of inter-regional knowledge spillover.

Second, inter-regional industrial sectors should be promoted; this will encourage regional industries to drive each other and encourage factor circulation. The government should focus on lowering technical barriers, promoting the flow of outstanding talents, capital and technology, and encourage in-depth cooperation between regions with similar technical structures. The advantage of doing so is that these policies can improve the regional linkage effect and avoid inter-regional policy fragmentation in order to adjust the disadvantages of regional economies and stimulate their growth.

Third, a city network system should be built and the city network structure should be optimized. A scientific and rational urban network system will be beneficial for the optimal allocation of resources by market players and will help to realize urban economies of scale and the benefits of agglomeration. Given the systemic characteristics of different regional areas, the government should implement active fiscal policies and accurately grasp the different functional characteristics and service levels of cities in multiple networks to realize the complementary development and overall optimization of regional urban functions. The government should enhance network organization efficiency by improving the network resource domination ability of regional central cities and strengthening the support provided by node cities to the network system.