Elsevier

Journal of Econometrics

Volume 105, Issue 1, November 2001, Pages 185-223
Journal of Econometrics

Yield curve estimation by kernel smoothing methods

https://doi.org/10.1016/S0304-4076(01)00075-6Get rights and content

Abstract

We introduce a new method for the estimation of discount functions, yield curves and forward curves from government issued coupon bonds. Our approach is nonparametric and does not assume a particular functional form for the discount function although we do show how to impose various restrictions in the estimation. Our method is based on kernel smoothing and is defined as the minimum of some localized population moment condition. The solution to the sample problem is not explicit and our estimation procedure is iterative, rather like the backfitting method of estimating additive nonparametric models. We establish the asymptotic normality of our methods using the asymptotic representation of our estimator as an infinite series with declining coefficients. The rate of convergence is standard for one dimensional nonparametric regression. We investigate the finite sample performance of our method, in comparison with other well-established methods, in a small simulation experiment.

Introduction

The term structure of interest rates is a central concept in monetary and financial economics. Prices of fixed income securities like bonds, swaps, and mortgage backed bonds (MBB's) are functions of the yield curve, and pricing of derivatives also depends on the yield curve. The spread between long and short term interest rates carries information about the level of future interest rates, see for example Campbell and Shiller (1991) and Engsted and Tanggaard (1995). The slope of the yield curve has frequently been used in empirical studies as a predictor of future inflation and national incomes, see Frankel and Lown (1994) and Estrella and Mishkin (1998) for example. Therefore, estimation of yield curves has had a long tradition among financial researchers and practitioners. See Campbell et al. (1997) for further discussion.

A fundamental problem is that the yields to maturity on coupon bonds are not directly comparable between bonds with different maturities or coupons. Thus, there is a need for a standardized way of measuring the term structure of interest rates. One such standard is the yield curve of zero-coupon bonds issued by sovereign lenders.

The construction of this yield curve poses several problems for applied research. First, many governments do not issue longer term (i.e., greater than 1–2 years) zero-coupon bonds. Hence the yield curve must be inferred from other instruments. A simple solution can be derived from the law of one price by assuming the absence of arbitrage. Arbitrage in the bond market will cause the price p of any bond (coupon or zero) with payments b(τj) at time τj to be equal to the discounted value of the future cash flow π=∑j=1mb(τj)d(τj), where the discount factor is d(τj) at time τj. The future income stream, b(τ1),…,b(τm), is assumed known and non-random. The second problem is that, in practice, small pricing errors perhaps due to non-synchronous trading, taxation, illiquidity, and bid–ask spreads necessitate adding an error term to π. The error term should be sufficiently small to ensure that they do not represent (gross) violations of the law of one price (no-arbitrage condition).1

The statistical problem we address is to estimate the function d(·) from a sample of coupon paying bonds. Note that, based on a continuous time approximation, we have d(t)=exp(−ty(t)), where y(t) is the yield curve, and y(t)=exp(−0tf(s)ds), where f(t) is the forward curve, see Anderson et al. (1996, pp. 12–13). Both of these relationships are invertible, so that knowing d is equivalent to knowing y or f.

Following the seminal work of McCulloch (1971), the standard approach to estimation here is to assume a parametric specification for d(t) or y(t) or f(t) and to use linear or nonlinear least squares to estimate the unknown parameters. For example, McCulloch 1971, McCulloch 1975, Shea (1984) use regression splines for d(t), Chambers et al. (1984) use polynomials for y(t), Vasicek and Fong (1982) use exponential splines for d(t), while Nelson and Siegel (1988) propose powers of exponentials for y(t). An approach based on linear programming methods has been suggested by Schaefer (1981). If the specification is considered parametric, i.e., to be a complete and correct representation of the functions of interest, i.e., the mean, then standard asymptotic theory can be used to derive the limiting distribution of the estimator and to justify confidence intervals obtained from this. However, a number of these authors are arguing against adherence to any fixed model and really are viewing the problem as being nonparametric. Some recent studies by Fisher et al. (1995) and Tanggaard (1997) have taken this line. When we view the estimation problem as nonparametric, there is little existing theory regarding the distribution of the estimators; for example, no one has established the asymptotic distribution of the spline estimates discussed above.

We adopt a nonparametric approach in which we do not a priori specify the functional form of the discount function or forward curve. We shall suppose that the discount function d(·) is a continuous and indeed smooth function of time to maturity. Although this is not guaranteed by purely arbitrage arguments, it does seem plausible. We propose a new class of methods for estimating d(·) based on perhaps the most central of all smoothing methods, the kernel method. The flexibility of our method is very important in practical applications because parametric estimates are often flawed by specification biases. It is not immediately obvious how to estimate the function d(·) by kernel methods, since this function affects the mean function indirectly through a convolution with the payment function. We first interpret the function d as the solution of some population mean squared error criterion. We then smooth the sample version of this to obtain an empirical criterion function, which is regular enough to provide consistent estimates. Several versions of the localization are possible including local constant and local linear [which has some well-known advantages in other contexts, see Tsybakov (1986) and Fan (1992)]. It turns out that our methods do not generally have explicit solutions, i.e., our estimator is defined as the solution of a linear integral equation. In practice, our solutions are defined through the method of successive approximations.2 We also give a ‘backfitting’ interpretation to our procedure, as in Opsomer and Ruppert (1997) and Mammen et al. (1999). We establish the convergence of our iterative scheme and establish the asymptotic properties of the estimator. We obtain a representation of our estimator as an infinite series with declining coefficients, which thereby provides its asymptotic distribution—it is asymptotically normal at the standard rate of convergence for one-dimensional kernel regression. The asymptotic distribution of the implied estimators of y(t) and f(t) can be easily obtained by the delta method. Our regularity conditions are ‘high level’, but we show how they are satisfied in some leading cases. We also exploit the relationships with the yield curve and forward curve to suggest alternative methods, thus we write d(·)=ψ(θ(·)) for some known function ψ , making θ now the object of estimation. The purpose of this is to give some added flexibility and/or to enforce consistency with theory. For example, by taking d(t)=exp(−ty(t)) we can directly impose the restrictions that d(0)=1 and d(t)>0 for all t, at the same time we are directly estimating the yield curve itself.

We point out that the estimation problem is similar to that considered in Engle et al. (1986) in which electricity demand over a billing period is modelled as a sum of individual daily demands each determined by temperature on the day concerned. They used splines, which effectively parameterize the function d and make the estimation problem standard nonlinear regression. They did not provide any asymptotic theory to justify their approach, at least not for the pointwise distribution of the nonparametric part. A similar estimation problem occurs quite widely with grouped data. For example, Chesher (1997) estimates the individual nutrient intake–age relationship from household level intake and individual characteristics like age. Again, he used splines but did not provide any justification for the validity of his method. A related problem arises in nonparametric simultaneous equations (Newey and Powell, 1988) and in estimating solutions of integral equations (Wahba, 1990; Nychka et al., 1984; O'Sullivan, 1986). See also Hausman and Newey (1995) for a related problem involving differential equations. With minor modifications we can provide new estimators in all these situations and find their asymptotic distribution.

In Section 2 we discuss (for reasons of completeness) smoothing of pure discount bonds. In Section 3 we present our new methods for smoothing the yield curve. In Section 3.1 we present the local constant version of our estimate, i.e., the object of interest is the discount function, while Section 3.2 gives the local linear extension. In Section 3.3 we describe the local constant exponential version and its one-step approximation. We present the asymptotic properties of our methods in Section 4. In Section 5 we provide a small simulation study that compares our method with some alternatives. Proofs are given in the appendix. In the sequel all integrals are Lebesgue integrals, and run from −∞ to +∞ unless otherwise stated.

Section snippets

Smoothing of pure discount bond yields

In an efficient bond market, $1 delivered at some future date τ has one, and only one, price, d. The function d(τ), giving the discount factor as a function of τ, is called the discount function. In practice small pricing errors due to rounding-off, tax effects, and inefficiencies distort the pricing mechanism. Therefore, the discount function must be derived from a sample of noisy zero coupon bond prices, p1,…,pn with times to maturity τ1⩽⋯⩽τn. We shall not require that the times be equally

Smoothing of coupon bonds

As discussed in Section 2, most bond markets do not posses zero coupon bonds for useful spans of maturities. The discount function must therefore be extracted from quotations of coupon bond data instead. Coupon bonds generate several payments at future dates, and in an efficient bond market, the present value of these future payments should, apart from a small error, be equal to the trading price, pi. As in Section 2, the sample consists of n bonds with quoted prices p1,…,pn. Furthermore, bi(τij

Asymptotic properties

Our main results are for the local constant and local linear discount function smoothing methods. We also provide a result for the one-step local exponential methods. As we show, the asymptotic variances of these procedures described above are identical (when estimating the same quantity), although the bias is certainly influenced by the chosen functional form.10

Simulation study

In this section we investigate the properties of the kernel yield curve estimator in a numerical example. We study the performance of the local constant exponential (LCE) and local linear exponential (LLE) estimators, and compare them with several well-known competitors.

We generated the data using the Nelson–Siegel yield curve model (Nelson and Siegel, 1987). This model has been in several empirical studies of term structure estimation methods (Bliss, 1996; Dahlquist and Svensson, 1996). We

Concluding comment

One nice feature of our projection approach to estimation is that it has a sensible intepretation under misspecification; we are finding the function d that is closest to the price vector in the chosen metric.

Acknowledgements

O. Linton was supported by the National Science Foundation, the Danish Social Science Research Council, and the North Atlantic Treaty Organization. He would like to thank Frank Diebold and two referees, Olaf Bunke, Greg Connor, Stewart Hodges, Andrew Jeffrey, Thong Nguyen, and Peter Phillips for helpful comments. E. Mammen was supported by the Deutsche Forschungsgemeinschaft, Sonderforschungsbereich 373 “Quantifikation und Simulation Ökonomischer Prozesse”, Humboldt-Universität zu Berlin and

References (55)

  • Y. Amihud et al.

    Liquidity, maturity, and the yields on U.S. Treasury securities

    The Journal of Finance

    (1991)
  • N. Anderson et al.

    Estimating and Interpreting the Yield Curve.

    (1996)
  • P.J. Bickel et al.

    Efficient and Adaptive Estimation for Semiparametric Models.

    (1993)
  • J.Y. Campbell et al.

    Yield spreads and interest rate movements: a birds eye view

    Review of Economic Studies

    (1991)
  • J.Y. Campbell et al.

    The Econometrics of Financial Markets.

    (1997)
  • D.R. Chambers et al.

    A new approach to estimation of the term structure of interest rates

    Journal of Financial and Quantitative Analysis

    (1984)
  • A. Chesher

    Diet revealed: semiparametric estimation of nutrient intake–age relationships

    Journal of the Royal Statistical Society A

    (1997)
  • T.G. Conley

    GMM estimation with cross-sectional dependence

    Journal of Econometrics

    (1999)
  • M. Dahlquist et al.

    Estimating the term structure of interest rates for monetary policy analysis

    Scandinavian Journal of Economics

    (1996)
  • R.F. Engle et al.

    Semiparametric estimates of the relation between weather and electricity sales

    Journal of the American Statistical Association

    (1986)
  • T. Engsted et al.

    The predictive power of yield spreads for future interest rates: evidence from the Danish term structure

    Scandinavian Journal of Economics

    (1995)
  • Y.H. Eom et al.

    Coupon effects and the pricing of Japanese government bonds: an empirical analysis

    Journal of Fixed Income

    (1998)
  • A. Estrella et al.

    Predicting U.S. recessions: financial variables as leading indicators

    Review of Economics and Statistics

    (1998)
  • J. Fan

    Design-adaptive nonparametric regression

    Journal of the American Statistical Association

    (1992)
  • Fisher, M.E., Nychka, D., Zervos, D., 1995. Fitting the term structure of interest rates with smoothing splines....
  • R.A. Fisher

    Theory of statistical estimation

    Proceedings of the Cambridge Philosophical Society

    (1925)
  • J.A. Frankel et al.

    An indicator of future inflation extracted from the steepness of the interest rate yield curve along its entire length

    Quarterly Journal of Economics

    (1994)
  • Gozalo, P.L., Linton, O.B., 1998. Local nonlinear least squares estimation: using parametric information...
  • W. Härdle

    Applied Nonparametric Regression.

    (1990)
  • W. Härdle et al.

    Comparing nonparametric versus parametric regression fits

    Annals of Statistics

    (1993)
  • T. Hastie et al.

    Generalized Additive Models.

    (1990)
  • J.A. Hausman et al.

    Nonparametric estimation of exact consumers surplus and deadweight loss

    Econometrica

    (1995)
  • D. Heath et al.

    Bond pricing and the term structure of interest rates: a new methodology for contingent claims valuation

    Econometrica

    (1992)
  • J. Hull et al.

    Pricing interest rate derivative securities

    Review of Financial Studies

    (1990)
  • J.V. Jordan

    Tax effects in term structure estimation

    Journal of Finance

    (1984)
  • L.V. Kantorovich et al.

    Functional Analysis in Normed Spaces.

    (1964)
  • R.W. Klein et al.

    An efficient semiparametric estimator for binary choice models

    Econometrica

    (1994)
  • Cited by (38)

    • Reconstructing the yield curve

      2021, Journal of Financial Economics
      Citation Excerpt :

      By contrast, parametric methods including the ones in the literature struggle to capture both features and need to make compromises. Our framework builds on the work of Linton et al. (2001), who introduced a non-parametric kernel-smoothing approach in estimating the yield curve. In particular, the authors focus on the asymptotic distribution of the yield curve estimate when it is assumed to be locally linear.

    • Estimation of a nonparametric model for bond prices from cross-section and time series information

      2021, Journal of Econometrics
      Citation Excerpt :

      Our approach has an advantage in comparison with other methods in terms of residual analysis and model diagnostics; we illustrate this aspect in Section 6.4. We use kernel-based nonparametric methodology to estimate the unknown discount function, generalizing Linton et al. (2001). We establish the properties of our procedure when both the cross-section and time series dimension are large.

    • Calendar effect and in-sample forecasting

      2021, Insurance: Mathematics and Economics
    • Statistical Modeling Using Local Gaussian Approximation

      2021, Statistical Modeling Using Local Gaussian Approximation
    • Estimating Chinese Treasury yield curves with Bayesian smoothing splines

      2018, Econometrics and Statistics
      Citation Excerpt :

      Other methods are used to estimate yield curves as well. Linton et al. (2001) and Jeffrey et al. (2006) use the kernel smoothing method (LMNT kernel smoothing, hereafter) to fit the discount curve and the zero-coupon yield curve, respectively. Fama and Bliss (1987) propose the Famma-Bliss method by using an iterative procedure called bootstrapping to extract a piecewise constant forward rate curve through exactly pricing selected bonds.

    View all citing articles on Scopus
    View full text