Area-to-point parameter estimation with geographically weighted regression

Murakami, Daisuke; Tsutsumi, Morito

doi:10.1007/s10109-015-0212-8

Area-to-point parameter estimation with geographically weighted regression

Original Article
Published: 31 March 2015

Volume 17, pages 207–225, (2015)
Cite this article

Journal of Geographical Systems Aims and scope Submit manuscript

Daisuke Murakami¹ &
Morito Tsutsumi²

735 Accesses
11 Citations
Explore all metrics

Abstract

The modifiable areal unit problem (MAUP) is a problem by which aggregated units of data influence the results of spatial data analysis. Standard GWR, which ignores aggregation mechanisms, cannot be considered to serve as an efficient countermeasure of MAUP. Accordingly, this study proposes a type of GWR with aggregation mechanisms, termed area-to-point (ATP) GWR herein. ATP GWR, which is closely related to geostatistical approaches, estimates the disaggregate-level local trend parameters by using aggregated variables. We examine the effectiveness of ATP GWR for mitigating MAUP through a simulation study and an empirical study. The simulation study indicates that the method proposed herein is robust to the MAUP when the spatial scales of aggregation are not too global compared with the scale of the underlying spatial variations. The empirical studies demonstrate that the method provides intuitively consistent estimates.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A modification to geographically weighted regression

Article Open access 31 March 2017

Yin-Yee Leong & Jack C. Yue

Incorporating geography into a new generalized theoretical and statistical framework addressing the modifiable areal unit problem

Article Open access 27 March 2019

M. Tuson, M. Yap, … D. Whyatt

Scale, Aggregation, and the Modifiable Areal Unit Problem

Notes

As an exception, Vichiensan et al. (2006) discuss a COSP using GWR. However, different from our study, they do not consider the volume-preserving property (Lam 1983) that the aggregation of disaggregated variables must be equal to the actual aggregate variable values (see Sect. 3.1).
The effective range of the Gaussian function is defined by $ r\sqrt 3 $.

References

Berrocal VJ, Gelfand AE, Holland DM (2010) A spatio-temporal downscaler for outputs from numerical models. J Agric Biol Environ Stat 15(2):176–197
Article Google Scholar
Berrocal VJ, Gelfand AE, Holland DM (2012) Space–time data fusion under error in computer model output: an application to modeling air quality. Biometrics 68(3):837–848
Article Google Scholar
Butkiewicz T, Ross K (2010) Alleviating the modifiable areal unit problem within probe-based geospatial analyses. Comput Graph Forum 29(3):923–932
Article Google Scholar
Cressie N (1993) Statistics for spatial data. Wiley, New York
Cressie N (1996) Change of support and the modifiable areal unit problem. Geogr Syst 3(2–3):159–180
Google Scholar
Farber S, Páez A, Volz E (2009) Topology and dependency tests in spatial and network autoregressive models. Geogr Anal 41(2):158–180
Article Google Scholar
Finley AO (2011) Comparing spatially-varying coefficients models for analysis of ecological data with non-stationary and anisotropic residual dependence. Methods Ecol Evol 2(2):143–154
Article Google Scholar
Fisher PF, Langford M (1996) Modeling sensitivity to accuracy in classified imagery: a study of areal interpolation. Prof Geogr 48(3):299–309
Article Google Scholar
Fotheringham S, Brunsdon C, Charlton M (2002) Geographically weighted regression: the analysis of spatially varying relationships. Wiley, New York
Google Scholar
Fuentes M, Raftery AE (2005) Model evaluation and spatial interpolation by Bayesian combination of observations with outputs from numerical models. Biometrics 61(1):36–45
Article Google Scholar
Gelfand AE (2010) Misaligned spatial data: the change of support problem. In: Gelfand AE, Diggle PJ, Guttorp P, Fuentes M (eds) Handbook of spatial statistics. CRC Press, Boca Raton, pp 517–539
Chapter Google Scholar
Gelfand AE (2012) Hierarchical modeling for spatial data problems. Spat Stat 1:30–39
Article Google Scholar
Goovaerts P (2008) Kriging and semivariogram deconvolution in the presence of irregular geographical units. Math Geosci 40(1):101–128
Article Google Scholar
Gotway CA, Young LJ (2002) Combining incompatible spatial data. J Am Stat Assoc 97(458):632–648
Article Google Scholar
Gotway CA, Young LJ (2007) A geostatistical approach to linking geographically aggregated data from different sources. J Comput Graph Stat 16(1):115–135
Article Google Scholar
Haining RP, Kerry R, Oliver MA (2010) Geography, spatial data analysis, and geostatistics: an overview. Geogr Anal 42(1):1–31
Article Google Scholar
Harris P, Brunsdon C, Fotheringham AS (2011) Links, comparisons and extensions of the geographically weighted regression model when used as a spatial predictor. Stoch Environ Res Risk Assess 25(2):123–138
Article Google Scholar
Kyriakidis PC (2004) A geostatistical framework for area-to-point spatial interpolation. Geogr Anal 36(3):259–289
Article Google Scholar
Kyriakidis PC, Yoo E-H (2005) Geostatistical prediction and simulation of point values from areal data. Geogr Anal 37(2):124–151
Article Google Scholar
Lam N-S (1983) Spatial interpolation methods: a review. Am Cartogr 10(2):129–149
Article Google Scholar
Lee SJ, Yeatts KB, Serre ML (2009) A Bayesian maximum entropy approach to address the change of support problem in the spatial analysis of childhood asthma prevalence across North Carolina. Spat Spatiotemporal Epidemiol 1(1):49–60
Article Google Scholar
LeSage JP, Pace RK (2009) Introduction to spatial econometrics. Chapman and Hall/CRC, Boca Raton
Book Google Scholar
Müller WG (1998) Collecting spatial data: optimum design of experiments for random fields. Physica-Verlag, Heidelberg
Nagle NN, Sweeney SH, Kyriakidis PC (2011) A geostatistical linear regression model for small area data. Geogr Anal 43(1):38–60
Article Google Scholar
Odoi A, Martin W, Michel P, Holt J, Middleton D, Wilson J (2003) Geographical and temporal distribution of human giardiasis in Ontario Canada. Int J Health Geogr 2:5
Article Google Scholar
Openshaw S (1984) The modifiable areal unit problem. Geo Books, Norwich
Google Scholar
Openshaw S, Rao L (1994) Algorithms for reengineering 1991 Census geography. Environ Plan A 27(3):425–446
Article Google Scholar
Openshaw S, Taylor P (1979) A Million or so correlation coefficients: three experiments on the modifiable areal unit problem. In: Wrigley N (ed) Statistical methods in the spatial sciences. Pion, London, pp 127–144
Google Scholar
Páez A, Uchida T, Miyamoto K (2002) A general framework for estimation and inference of geographically weighted regression models: 1. Location-specific kernel bandwidths and a test for locational heterogeneity. Environ Plan A 34(4):733–754
Article Google Scholar
Páez A, Long F, Farber S (2008) Moving window approaches for hedonic price estimation: an empirical comparison of modeling techniques. Urban Stud 45(8):1565–1581
Article Google Scholar
Páez A, Farber S, Wheeler D (2011) A simulation-based study of geographically weighted regression as a method for investigating spatially varying relationships. Environ Plan A 43(12):2992–3010
Article Google Scholar
Reynolds HD (1998) The modifiable areal unit problem: empirical analysis by statistical simulation. Thesis, University of Toronto
Sahu SK, Gelfand AE, Holland DM (2010) Fusing point and areal level space–time data with application to wet deposition. J R Stat Soc C Appl Stat 59(1):77–103
Article Google Scholar
Schabenberger O, Gotway CA (2005) Statistical methods for spatial data analysis. Chapman and Hall/CRC, Boca Raton
Google Scholar
Siffel C, Strickl MC, Gardner BR, Kirby RS, Correa A (2006) Role of geographic information systems in birth defects surveillance and research. Birth Defects Res A Clin Mol Teratol 76(11):825–833
Article Google Scholar
Swift A, Liu L, Uber J (2008) Reducing MAUP bias of correlation statistics between water quality and GI illness. Comput Environ Urban Syst 32(2):134–148
Article Google Scholar
Tagashira N, Okabe A (2002) The modifiable areal unit problem in a regression model whose independent variable is a distance from a predetermined point. Geogr Anal 34(1):1–19
Google Scholar
Tranmer M, Steel D (1998) Using census data to investigate the causes of the ecological fallacy. Environ Plan A 30(5):817–831
Article Google Scholar
Vichiensan V, Páez A, Kawai K, Miyamoto K (2006) Nonstationary spatial interpolation method for urban model development. Transp Res Rec 1977:103–111
Article Google Scholar
Ware R, Lad F (2003) Approximating the distribution for sums of products of normal variables. University of Canterbury, Christchurch
Google Scholar
Wheeler D (2007) Diagnostic tools and a remedial method for collinearity in geographically weighted regression. Environ Plan A 39(10):2464–2481
Article Google Scholar
Wheeler D (2009) Simultaneous coefficient penalization and model selection in geographically weighted regression: the geographically weighted lasso. Environ Plan A 41(3):722–742
Article Google Scholar
Wheeler D, Páez A (2010) Geographically weighted regression. In: Fischer MM, Getis A (eds) Handbook of applied spatial analysis. Springer, Berlin, pp 461–486
Chapter Google Scholar
Wheeler D, Tiefelsdorf M (2005) Multicollinearity and correlation among local regression coefficients in geographically weighted regression. J Geogr Syst 7(2):161–187
Article Google Scholar
Wong D (2009) The modifiable areal unit problem (MAUP). In: Fotheringham AS, Rogerson PA (eds) The SAGE handbook of spatial analysis. SAGE, London, pp 105–124
Google Scholar
Young LJ, Gotway CA (2007) Linking spatial data from different sources: the effects of change of support. Stoch Environ Res Risk Assess 21(5):589–600
Article Google Scholar
Young LJ, Gotway CA, Kearney G, Duclos C (2009a) Assessing uncertainty in support-adjusted spatial misalignment problems. Commun Stat Theory Methods 38(16–17):3249–3264
Article Google Scholar
Young LJ, Gotway CA, Yang J, Kearney G, Duclos C (2009b) Linking health and environmental data in geographical analysis: it’s so much more than centroids. Spat Spatiotemporal Epidemiol 1(1):73–84
Article Google Scholar

Download references

Acknowledgments

We thank Dr. Haoying Wang (Pennsylvania State University, United States) for his helpful comments at the VII World Conference of the Spatial Econometrics Association. This work was supported by two Grants-in-Aid for Scientific Research from JSPS (Research Project Numbers: 23360219 and 24241053). Further, this paper fulfills the criteria of the Global Climate Risk Management Strategies (S10) project.

Author information

Authors and Affiliations

Center for Global Environmental Research, National Institute for Environmental Studies, 16-2 Onogawa, Tsukuba-shi, Ibaraki, 305-8506, Japan
Daisuke Murakami
Faculty of Engineering, Information and Systems, University of Tsukuba, 1-1-1 Tennodai, Tsukuba-shi, Ibaraki, 305-8573, Japan
Morito Tsutsumi

Authors

Daisuke Murakami
View author publications
You can also search for this author in PubMed Google Scholar
Morito Tsutsumi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daisuke Murakami.

Appendix 1: ATP GWR for the extensive variables

A must be determined by considering the volume-preserving property for the extensive variables. In other words, $ {\bar{\mathbf{y}}} $ = Ay must hold under the condition that the elements in y are extensive variables. For example, suppose that the population in an aggregated unit a is $ \bar{y}_{a} $ and unit a comprises two disaggregated units d and d′; then, $ {\bar{\mathbf{y}}} $ = Ay is expressed as

$$ \begin{aligned} & \bar{y}_{a} = \left[ {\begin{array}{*{20}l} {A_{a,d} } \hfill & {A_{{a,d^{\prime}}} } \hfill \\ \end{array} } \right]\left[ {\begin{array}{*{20}l} {Y_{d} } \hfill \\ {Y_{{d^{\prime}}} } \hfill \\ \end{array} } \right], \\ & Y_{d} + Y_{{d^{{\prime }} }} = A_{a,d} Y_{d} + A_{{a,d^{{\prime }} }} Y_{{d^{{\prime }} }} . \\ \end{aligned} $$

(18)

Equation (6) merely states that $ \bar{y}_{a} = Y_{d} + Y_{{d^{{\prime }} }} $ must equal A _a,d Y _d + A _a,d′ Y _d′. This is satisfied by defining A _d and A _d′ as ones. In general, $ {\bar{\mathbf{y}}} $ = Ay is fulfilled by defining A _a,d as follows:

$$ A_{a,d} = \left\{ {\begin{array}{*{20}l} 1 \hfill & {{\text{if}}\;d \subseteq a} \hfill \\ 0 \hfill & {\text{otherwise}} \hfill \\ \end{array} } \right.. $$

(19)

ATP GWR for the extensive variables is thus defined by defining A _a,d, which was defined in Eq. (7), as shown in Eq. (19). In this case, the variance of the error term is scaled by the number of disaggregated units in each aggregated unit.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Murakami, D., Tsutsumi, M. Area-to-point parameter estimation with geographically weighted regression. J Geogr Syst 17, 207–225 (2015). https://doi.org/10.1007/s10109-015-0212-8

Download citation

Received: 03 October 2013
Accepted: 12 March 2015
Published: 31 March 2015
Issue Date: July 2015
DOI: https://doi.org/10.1007/s10109-015-0212-8

Keywords

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Area-to-point parameter estimation with geographically weighted regression

Abstract

Access this article

Similar content being viewed by others

A modification to geographically weighted regression

Incorporating geography into a new generalized theoretical and statistical framework addressing the modifiable areal unit problem

Scale, Aggregation, and the Modifiable Areal Unit Problem

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix 1: ATP GWR for the extensive variables

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

Area-to-point parameter estimation with geographically weighted regression

Abstract

Access this article

Similar content being viewed by others

A modification to geographically weighted regression

Incorporating geography into a new generalized theoretical and statistical framework addressing the modifiable areal unit problem

Scale, Aggregation, and the Modifiable Areal Unit Problem

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix 1: ATP GWR for the extensive variables

Appendix 1: ATP GWR for the extensive variables

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation