1 Introduction

In the midst of economic downturns, a healthy labour marketFootnote 1 indicated by rising employment is usually a pointer to output growth recovery (Course 2014). In this paper, we explore the predictive value of the United States Nonfarm Payroll (USNFP)—a measure of number of workers who actively participate in the production processFootnote 2—for output growth of the country in question. The United States being a highly developed market economy, is the world’s largest national economy in terms of GDP. Besides, it has the highest average household and employee income among OECD member states, which are pointers to a virile labour market. Therefore, any shock to this market is capable of impacting its production activities, and by extension, the economies of other countries of the world that are closely linked to it. This was particularly evident during the 2008 financial crisis, which began with the collapse of the U.S. housing market and quickly spread to the rest of the world. Thus, we hypothesize a positive nexus between the two variables (i.e. USNFP and output growth) following the modern theory of employment (see Wilson 1960; Basu and Foley 2013). Accordingly, an increase in the number of active workers (USNFP) would bring about a rise in aggregate demand via the income channel, which consequently improves output growth. In other words, higher (lower) incomes via additional (reduction in) jobs would lead to a rise (fall) in aggregate demand and by extension, an increase (a decrease) in investment in physical goods, hence growth (fall) in output level. The reason for a special focus on unemployment statistics is underscored by the signal it provides on the efficient utilization of resources which is necessary for growth in output (Blanchard and Johnson 2003). In addition, our underlying interest in exploring the out-of-sample predictability of USNFP for output growth lies in the evidence of moderate growth recorded in the U.S. economy whenever the former started to peak up (see Alguire 1996; Course 2014). Thus, we advance the literature to see if the information contained in the USNFP can be explored to improve the out-of-sample forecasts of output growth as limiting the analysis to impact assessment (that is, in-sample predictability) may not truly reflect the potential of realizing forecast gains when the USNFP is captured in the output growth projections (see Rapach and Zhou 2013). In implementing this objective and for the purpose of robustness checks, we opt for two different output measures (with Gross Domestic Product growth being used for the main analysis and growth in Industrial Production Index for robustness check) while we also account for relevant control variables such as oil (see, Cologni and Manera 2009; Ratti and Vespignani 2016; Koirala and Ma 2020, among others) and economic policy uncertainty index (see, Lensink et al. 1999; Kiesen 2013; Jovanovic and Ma 2021, among others) given their potential impact on the level of real economic activities. Finally, we employ the predictability approach of Westerlund and Narayan (2012, 2015) which accommodates salient features of both the predicted and predictor series in the estimation process, thus, improving the forecast outcomes.Footnote 3

Lending credence to other studies on employment and growth nexus (see for example, Clancy 2009; Goulas and Zervoyianni 2018), our results show that USNFP predicts the output growth pretty well given the positive and statistical significance of the predictor parameter with or without controlling for oil price return and rate of change in the U.S. Economic Policy Uncertainty. This shows that the growth in output in the U.S. is a reflection of favourable conditions in the labour market. Similarly, our out-of-sample forecast evaluation results attest to the predictive value of a USNFP-based model in the output growth forecasts. Following this brief introduction, the remainder of this paper is structured as follows. Section II offers information about data and methodology while Section III describes the results. Section IV concludes.

2 Data and methodology

2.1 Data

Our datasets consist of quarterly Gross Domestic Product (GDP) and the U.S. Nonfarm Payroll—USNFP over the period of 1947 and 2021 whose scope is governed by the available start date for GDP. The former is a measure of output growth as it covers all the sectors in an economy while the latter is a measure of active workers in the United States which excludes proprietors, private household employees, unpaid volunteers, farm employees, and the unincorporated self-employed). For the purpose of robustness, we also utilize data on U.S. Industrial Production Index (IPI) and since this variable is available at monthly frequency, we use the same for the empirical analysis. This additional analysis is crucial for two reasons: (i) we are able to test if the outcome will be sensitive to alternative measures of output growth (note that in standard growth literature involving a higher frequency than what is available for GDP growth, IPI growth is often used as a measure of real economic activity)Footnote 4; and (ii) we are also able to use a higher frequency data than that of GDP growth since monthly IPI data is available. We also account for two control variables namely oil price return and rate of change in U.S. economic policy uncertainty index (U.S. EPU).Footnote 5 The latter only starts from 1985 and therefore, the estimation period is resized accordingly. All the variables are obtained from fred.stlouisfed.org which is freely accessible.

We render some preliminary statistics to understand the salient features of the series and for brevity, we limit this analysis to the variables of interest (GDP growth—GDP_g, IPI growth—IPI_g and the percentage change in USNFP − ΔUSNFP). We focus on basic descriptive statistics such as the mean, standard deviation, skewness, kurtosis as well as a measure of unit root test of for relevant variables. From the Table 1, all the variables have a positive growth/return, with GDP_g and ΔUSNFP having the highest and lowest growth averages, respectively. Similarly, all the variables are negatively skewed while the kurtosis statistics suggests that all the variables in their growth/return forms are leptokurtic, denoting that they are heavily tailed. Furthermore, the statistical significance at one percent, of our ADF unit root test indicates that all our variables are stationary at level.

Table 1 Summary statistics and unit root tests

Finally, on the co-movement between the measures of output growth and growth in USNFP, Figs. 1 and 2 below show a positive relationship between output growth (whether GDP_g or IPI_g) and ΔUSNFP. This suggests that an increase in USNFP, a measure of employment, brings about an increase in GDP_g and IPI_g, both of which are a measure of output growth. This connection between employment and output growth is well entrenched in the modern theory of employment albeit without evidence for its out-of-sample predictive value (see Wilson 1960; Basu and Foley 2013). We further employ a formal test to establish the in-sample cointegration between the two variables using the familiar Bounds cointegration test and the results are presented in the appendix (see Tables 5 and 6). We are able to offer a formal evidence for the co-movement between USNFP and output growth as the null of hypothesis of no cointegration is rejected at all the conventional levels of statistical significance. In the next section, we extend the analysis to the out-of-sample predictability of a USNFP-based output growth model in order to offer a more insightful policy implications of our findings.

Fig. 1
figure 1

Co-movement of percentage changes in U.S. Nonfarm Payroll (\(\Delta\) USNP) and GDP growth (GDP_g)

Fig. 2
figure 2

Co-movement of percentage changes in U.S. Nonfarm Payroll (\(\Delta\) USNP) and IPI growth (IPI_g)

Table 2 Predictability results for output growth

3 Methodology

Relying on the modern theory of employment, where the argument of increased utilization of any resource leads to rise in production of physical goods is presented (see Wilson 1960), we construct a predictive model in order to establish the predictive value of USNFP in the out-of-sample predictability of output growth:

$$Growth_{t} = \alpha + \beta \Delta USNFP_{t - 1} + Control_{t - 1} + \varepsilon_{t}$$
(1)

where \(Growth_{t}\) is the output growth, \(\Delta USNFP\) is the percentage change in US Nonfarm Payroll, a measure of active workers in employment, all computed as 100*\(\Delta\) log \(({y}_{t})\) where \({y}_{t}\) is the level series; \(\alpha\) being the intercept; \(Control\) represents the control variables namely oil return computed as log return of oil price using West Texas Intermediate crude oil price, and rate of change in U.S. Economic Policy Uncertainty while \(\varepsilon_{t}\) is the zero mean idiosyncratic error term. Similarly, \(\beta\) shows the response of growth in output to employment growth.Footnote 6 Furthermore, to resolve the issue of conditional heteroscedasticity effect in the error term, we pre-weight all the data with the inverse of the standard deviation \({1 \mathord{\left/ {\vphantom {1 {\hat{\sigma }_{t}^{2} }}} \right. \kern-\nulldelimiterspace} {\hat{\sigma }_{t}^{2} }}\) obtained from a typical GARCH-type model and thereafter estimate the resulting equation with the OLS (see Westerlund and Narayan 2012, 2015).

Finally, we extend our predictive model as presented in Eq. (1) to include the out-of-sample period and, thereafter, we compare the forecast performance of the USNFP-based model (for both with and without control) with a benchmark model (historical average), using the relative root mean square error (RRMSE) and the Clark and West (2007) [CW] test. The RRMSE is computed as the ratio of RMSE of the USNFP-based model in Eq. (1) to that of the benchmark model, such that a value less (greater) than one is considered to indicate superior (inferior) performance of the former over the latter. The CW test suitable for nested models is a one-sided test and it is used to determine the difference in the forecast errors of the two competing models where the null hypothesis of a zero coefficient is rejected if the t-statistic is greater than + 1.282 (for a one sided 0.10 test), + 1.645 (for a one sided 0.05 test), and + 2.00 for 0.01 test (for a one sided 0.01 test). The forecast analysis is rendered for multiple out-of-sample (i.e., 1-month, 6-month, 12-month and 24-month; and 1- quarter, 4-quarter, 8-quarter and 12-quarter) ahead forecast horizons, while the 75:25 data split is respectively used for the in-sample and out-of-sample forecasts.

4 Results and discussion

4.1 Main findings

We begin the discussion of results by presenting the in-sample predictability of U.S. Nonfarm Payroll (USNFP) for GDP growth where we are able to determine the direction of relationship between the two variables (see Table 2 for the results with and without the control variables). Expectedly, we find evidence of a positive and statistically significant relationship between USNFP and GDP growth indicating that an increase in the non-farm labour market improves the productivity of the U.S. economy. In other words, when the USNFP, a measure of U.S. labour market condition, rises, the GDP growth in the U.S. follows suit. This supports our hypothesis which hinges on the modern theory of employment—postulating a positive relationship between the employment and growth. We also offer additional results by extending the predictive model of GDP growth to capture other important determinants of growth such as oil and policy uncertainty indices based on the U.S. Economic Policy Uncertainty. These additional predictors are distinctly captured in the regression models and the results are also embedded in Table 2 with those that do not account for the control variables for easy comparison. We observe that the sign and significance of the USNFP remain the same even after controlling for other predictors. In other words, the results are robust to alternative specifications for the growth model.

Consequently, we extend our analysis to include the out-of-sample forecast performance of our proposed USNFP-based growth model since the presence of in-sample predictability between the two variables does not necessarily guarantee out-of-sample forecast gains. As noted in the methodology and for the purpose of emphasis, we state again how the out-of-sample forecast is carried out. We conduct this analysis by comparing the out-of-sample forecast performance of the USNFP-based growth model with that of the benchmark (historical average) model using the relative root mean square error and Clark and West (2007) test. The results of the relative root mean square error presented in Table 3 whose values must be less than unity for the USNFP-based model to outperform the benchmark (historical average) model consistently favour the former. The difference in the forecast errors between these two competing models is evaluated using the Clark and West (2007) test and the results presented in Table 4 affirm the superiority of the USNFP-based growth model over the model that ignores the predictive value of USNFP in the out-of-sample forecast of U.S. GDP growth. As previously noted, we also consider the IPI growth for robustness purpose. We further note in the previous section that this consideration is important for two reasons. First, we are able to test if the outcome will be sensitive to alternative measures of output growth (note that in standard growth literature involving a higher frequency than what is available for GDP growth, IPI growth is often used as a measure of real economic activity). Second, we are also able to use a higher frequency data than that of GDP growth since monthly IPI data is available. The in-sample predictability results obtained for the IPI growth are also embedded in Table 2 and interestingly, the magnitudes and signficance of the predictor series (USNFP) for the alternative specification are similar to that of GDP growth, thus attesting to the robustness of the predictability outcomes. This conclusion about the similarity in the results of both GDP growth and IPI growth is also observed for the out-of-sample predictability analysis (se Tables 3 and 4). Thus, regardless of the choice of output growth governed by data frequency, the predictive value of U.S. Non-Farm Payroll in output growth forecast is significant both for the in-sample and out-of-sample periods. Our results also align well with the findings of Course (2014) and Goulas and Zervoyianni (2018), although these studies are limited to in-sample predictability (that is, impact analysis).

Table 3 Forecast evaluation using relative RMSE
Table 4 Forecast evaluation using Clark and West test

Several policy implications can be discerned from the study outcomes and we highlight a number of them as follows. First, the evidence here supports a strong connection between the labour market and the real sector of the economy (since industrial production index as used in this study is a good measure of real sector productivity) and it particularly suggests that an improvement in the labour market is an indication of a vibrant real sector which equally reverberates across the larger economy, thus translating into higher economic growth (judging by the significantly positive connection between USNFP and GDP growth—a measure of aggregate productivity growth which is larger in scope than the IPI growth). Second and flowing from the first implication, a more accurate forecast of U.S. output growth can be achieved with the inclusion of USNFP among other relevant predictors. This outcome is useful in fiscal budgeting where output growth forecast serves as one of the benchmarks for its implementation and the more realistic is the forecast, the higher the development outcomes associated with such a budget. Finally, given the strategic position of the U.S. in the global economy, information about its growth forecast is not only used by the domestic economy but also often required by various national and regional governments, multilateral agencies and investors whose economic and financial conditions are directly or indirectly linked with the U.S. economy. In sum, the need to constantly revise the U.S. growth models to improve their forecasts is important from both investment and policy perspectives and we hope our study offers a useful guide in this regard.

5 Conclusion

Exploring the predictive prowess of USNFP—a measure of the number of active United States workers—for output growth cannot have a better timing than now with the increasing concern to revive the global economy from the grip of coronavirus pandemic. Unlike other studies that have explored the effect of output growth on employment, we show that employment equally possesses some predictive contents for output growth. Exploiting the information contained in the USNFP, we find that the latter does not only predict output growth for the U.S., but also have a positive effect on it, confirming the a priori as espoused in the modern theory of employment. In other words, accounting for the predictive value of labour market dynamics in the U.S. may improve the accuracy of output growth projections. This shows that the growth in output in the U.S. mirrors a favorable condition in the labour market. Thus, policy directions geared towards strengthening the labour market have ripple effects on the macro economy and more importantly, the labour market dynamics have predictive value that can serve as a barometer for measuring economic progress and by extension output growth. It would be nice to see how the U.S. labour market connects with the financial markets subject to data availability. This should further provide information as regards whether the USNFP can serve as a good measure of economic conditions in the U.S. such that job gains (losses) can be seen to imply improvement (deterioration) in the financial markets.