Skip to main content
Log in

Area-to-point parameter estimation with geographically weighted regression

  • Original Article
  • Published:
Journal of Geographical Systems Aims and scope Submit manuscript

Abstract

The modifiable areal unit problem (MAUP) is a problem by which aggregated units of data influence the results of spatial data analysis. Standard GWR, which ignores aggregation mechanisms, cannot be considered to serve as an efficient countermeasure of MAUP. Accordingly, this study proposes a type of GWR with aggregation mechanisms, termed area-to-point (ATP) GWR herein. ATP GWR, which is closely related to geostatistical approaches, estimates the disaggregate-level local trend parameters by using aggregated variables. We examine the effectiveness of ATP GWR for mitigating MAUP through a simulation study and an empirical study. The simulation study indicates that the method proposed herein is robust to the MAUP when the spatial scales of aggregation are not too global compared with the scale of the underlying spatial variations. The empirical studies demonstrate that the method provides intuitively consistent estimates.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Notes

  1. As an exception, Vichiensan et al. (2006) discuss a COSP using GWR. However, different from our study, they do not consider the volume-preserving property (Lam 1983) that the aggregation of disaggregated variables must be equal to the actual aggregate variable values (see Sect. 3.1).

  2. The effective range of the Gaussian function is defined by \( r\sqrt 3 \).

References

  • Berrocal VJ, Gelfand AE, Holland DM (2010) A spatio-temporal downscaler for outputs from numerical models. J Agric Biol Environ Stat 15(2):176–197

    Article  Google Scholar 

  • Berrocal VJ, Gelfand AE, Holland DM (2012) Space–time data fusion under error in computer model output: an application to modeling air quality. Biometrics 68(3):837–848

    Article  Google Scholar 

  • Butkiewicz T, Ross K (2010) Alleviating the modifiable areal unit problem within probe-based geospatial analyses. Comput Graph Forum 29(3):923–932

    Article  Google Scholar 

  • Cressie N (1993) Statistics for spatial data. Wiley, New York

  • Cressie N (1996) Change of support and the modifiable areal unit problem. Geogr Syst 3(2–3):159–180

    Google Scholar 

  • Farber S, Páez A, Volz E (2009) Topology and dependency tests in spatial and network autoregressive models. Geogr Anal 41(2):158–180

    Article  Google Scholar 

  • Finley AO (2011) Comparing spatially-varying coefficients models for analysis of ecological data with non-stationary and anisotropic residual dependence. Methods Ecol Evol 2(2):143–154

    Article  Google Scholar 

  • Fisher PF, Langford M (1996) Modeling sensitivity to accuracy in classified imagery: a study of areal interpolation. Prof Geogr 48(3):299–309

    Article  Google Scholar 

  • Fotheringham S, Brunsdon C, Charlton M (2002) Geographically weighted regression: the analysis of spatially varying relationships. Wiley, New York

    Google Scholar 

  • Fuentes M, Raftery AE (2005) Model evaluation and spatial interpolation by Bayesian combination of observations with outputs from numerical models. Biometrics 61(1):36–45

    Article  Google Scholar 

  • Gelfand AE (2010) Misaligned spatial data: the change of support problem. In: Gelfand AE, Diggle PJ, Guttorp P, Fuentes M (eds) Handbook of spatial statistics. CRC Press, Boca Raton, pp 517–539

    Chapter  Google Scholar 

  • Gelfand AE (2012) Hierarchical modeling for spatial data problems. Spat Stat 1:30–39

    Article  Google Scholar 

  • Goovaerts P (2008) Kriging and semivariogram deconvolution in the presence of irregular geographical units. Math Geosci 40(1):101–128

    Article  Google Scholar 

  • Gotway CA, Young LJ (2002) Combining incompatible spatial data. J Am Stat Assoc 97(458):632–648

    Article  Google Scholar 

  • Gotway CA, Young LJ (2007) A geostatistical approach to linking geographically aggregated data from different sources. J Comput Graph Stat 16(1):115–135

    Article  Google Scholar 

  • Haining RP, Kerry R, Oliver MA (2010) Geography, spatial data analysis, and geostatistics: an overview. Geogr Anal 42(1):1–31

    Article  Google Scholar 

  • Harris P, Brunsdon C, Fotheringham AS (2011) Links, comparisons and extensions of the geographically weighted regression model when used as a spatial predictor. Stoch Environ Res Risk Assess 25(2):123–138

    Article  Google Scholar 

  • Kyriakidis PC (2004) A geostatistical framework for area-to-point spatial interpolation. Geogr Anal 36(3):259–289

    Article  Google Scholar 

  • Kyriakidis PC, Yoo E-H (2005) Geostatistical prediction and simulation of point values from areal data. Geogr Anal 37(2):124–151

    Article  Google Scholar 

  • Lam N-S (1983) Spatial interpolation methods: a review. Am Cartogr 10(2):129–149

    Article  Google Scholar 

  • Lee SJ, Yeatts KB, Serre ML (2009) A Bayesian maximum entropy approach to address the change of support problem in the spatial analysis of childhood asthma prevalence across North Carolina. Spat Spatiotemporal Epidemiol 1(1):49–60

    Article  Google Scholar 

  • LeSage JP, Pace RK (2009) Introduction to spatial econometrics. Chapman and Hall/CRC, Boca Raton

    Book  Google Scholar 

  • Müller WG (1998) Collecting spatial data: optimum design of experiments for random fields. Physica-Verlag, Heidelberg

  • Nagle NN, Sweeney SH, Kyriakidis PC (2011) A geostatistical linear regression model for small area data. Geogr Anal 43(1):38–60

    Article  Google Scholar 

  • Odoi A, Martin W, Michel P, Holt J, Middleton D, Wilson J (2003) Geographical and temporal distribution of human giardiasis in Ontario Canada. Int J Health Geogr 2:5

    Article  Google Scholar 

  • Openshaw S (1984) The modifiable areal unit problem. Geo Books, Norwich

    Google Scholar 

  • Openshaw S, Rao L (1994) Algorithms for reengineering 1991 Census geography. Environ Plan A 27(3):425–446

    Article  Google Scholar 

  • Openshaw S, Taylor P (1979) A Million or so correlation coefficients: three experiments on the modifiable areal unit problem. In: Wrigley N (ed) Statistical methods in the spatial sciences. Pion, London, pp 127–144

    Google Scholar 

  • Páez A, Uchida T, Miyamoto K (2002) A general framework for estimation and inference of geographically weighted regression models: 1. Location-specific kernel bandwidths and a test for locational heterogeneity. Environ Plan A 34(4):733–754

    Article  Google Scholar 

  • Páez A, Long F, Farber S (2008) Moving window approaches for hedonic price estimation: an empirical comparison of modeling techniques. Urban Stud 45(8):1565–1581

    Article  Google Scholar 

  • Páez A, Farber S, Wheeler D (2011) A simulation-based study of geographically weighted regression as a method for investigating spatially varying relationships. Environ Plan A 43(12):2992–3010

    Article  Google Scholar 

  • Reynolds HD (1998) The modifiable areal unit problem: empirical analysis by statistical simulation. Thesis, University of Toronto

  • Sahu SK, Gelfand AE, Holland DM (2010) Fusing point and areal level space–time data with application to wet deposition. J R Stat Soc C Appl Stat 59(1):77–103

    Article  Google Scholar 

  • Schabenberger O, Gotway CA (2005) Statistical methods for spatial data analysis. Chapman and Hall/CRC, Boca Raton

    Google Scholar 

  • Siffel C, Strickl MC, Gardner BR, Kirby RS, Correa A (2006) Role of geographic information systems in birth defects surveillance and research. Birth Defects Res A Clin Mol Teratol 76(11):825–833

    Article  Google Scholar 

  • Swift A, Liu L, Uber J (2008) Reducing MAUP bias of correlation statistics between water quality and GI illness. Comput Environ Urban Syst 32(2):134–148

    Article  Google Scholar 

  • Tagashira N, Okabe A (2002) The modifiable areal unit problem in a regression model whose independent variable is a distance from a predetermined point. Geogr Anal 34(1):1–19

    Google Scholar 

  • Tranmer M, Steel D (1998) Using census data to investigate the causes of the ecological fallacy. Environ Plan A 30(5):817–831

    Article  Google Scholar 

  • Vichiensan V, Páez A, Kawai K, Miyamoto K (2006) Nonstationary spatial interpolation method for urban model development. Transp Res Rec 1977:103–111

    Article  Google Scholar 

  • Ware R, Lad F (2003) Approximating the distribution for sums of products of normal variables. University of Canterbury, Christchurch

    Google Scholar 

  • Wheeler D (2007) Diagnostic tools and a remedial method for collinearity in geographically weighted regression. Environ Plan A 39(10):2464–2481

    Article  Google Scholar 

  • Wheeler D (2009) Simultaneous coefficient penalization and model selection in geographically weighted regression: the geographically weighted lasso. Environ Plan A 41(3):722–742

    Article  Google Scholar 

  • Wheeler D, Páez A (2010) Geographically weighted regression. In: Fischer MM, Getis A (eds) Handbook of applied spatial analysis. Springer, Berlin, pp 461–486

    Chapter  Google Scholar 

  • Wheeler D, Tiefelsdorf M (2005) Multicollinearity and correlation among local regression coefficients in geographically weighted regression. J Geogr Syst 7(2):161–187

    Article  Google Scholar 

  • Wong D (2009) The modifiable areal unit problem (MAUP). In: Fotheringham AS, Rogerson PA (eds) The SAGE handbook of spatial analysis. SAGE, London, pp 105–124

    Google Scholar 

  • Young LJ, Gotway CA (2007) Linking spatial data from different sources: the effects of change of support. Stoch Environ Res Risk Assess 21(5):589–600

    Article  Google Scholar 

  • Young LJ, Gotway CA, Kearney G, Duclos C (2009a) Assessing uncertainty in support-adjusted spatial misalignment problems. Commun Stat Theory Methods 38(16–17):3249–3264

    Article  Google Scholar 

  • Young LJ, Gotway CA, Yang J, Kearney G, Duclos C (2009b) Linking health and environmental data in geographical analysis: it’s so much more than centroids. Spat Spatiotemporal Epidemiol 1(1):73–84

    Article  Google Scholar 

Download references

Acknowledgments

We thank Dr. Haoying Wang (Pennsylvania State University, United States) for his helpful comments at the VII World Conference of the Spatial Econometrics Association. This work was supported by two Grants-in-Aid for Scientific Research from JSPS (Research Project Numbers: 23360219 and 24241053). Further, this paper fulfills the criteria of the Global Climate Risk Management Strategies (S10) project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daisuke Murakami.

Appendix 1: ATP GWR for the extensive variables

Appendix 1: ATP GWR for the extensive variables

A must be determined by considering the volume-preserving property for the extensive variables. In other words, \( {\bar{\mathbf{y}}} \) = Ay must hold under the condition that the elements in y are extensive variables. For example, suppose that the population in an aggregated unit a is \( \bar{y}_{a} \) and unit a comprises two disaggregated units d and d′; then, \( {\bar{\mathbf{y}}} \) = Ay is expressed as

$$ \begin{aligned} & \bar{y}_{a} = \left[ {\begin{array}{*{20}l} {A_{a,d} } \hfill & {A_{{a,d^{\prime}}} } \hfill \\ \end{array} } \right]\left[ {\begin{array}{*{20}l} {Y_{d} } \hfill \\ {Y_{{d^{\prime}}} } \hfill \\ \end{array} } \right], \\ & Y_{d} + Y_{{d^{{\prime }} }} = A_{a,d} Y_{d} + A_{{a,d^{{\prime }} }} Y_{{d^{{\prime }} }} . \\ \end{aligned} $$
(18)

Equation (6) merely states that \( \bar{y}_{a} = Y_{d} + Y_{{d^{{\prime }} }} \) must equal A a,d Y d  + A a,d Y d. This is satisfied by defining A d and A d as ones. In general, \( {\bar{\mathbf{y}}} \) = Ay is fulfilled by defining A a,d as follows:

$$ A_{a,d} = \left\{ {\begin{array}{*{20}l} 1 \hfill & {{\text{if}}\;d \subseteq a} \hfill \\ 0 \hfill & {\text{otherwise}} \hfill \\ \end{array} } \right.. $$
(19)

ATP GWR for the extensive variables is thus defined by defining A a,d , which was defined in Eq. (7), as shown in Eq. (19). In this case, the variance of the error term is scaled by the number of disaggregated units in each aggregated unit.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Murakami, D., Tsutsumi, M. Area-to-point parameter estimation with geographically weighted regression. J Geogr Syst 17, 207–225 (2015). https://doi.org/10.1007/s10109-015-0212-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10109-015-0212-8

Keywords

JEL Classification

Navigation