ABSTRACT
The development of areas such as remote and airborne sensing, location based services, and geosensor networks enables the collection of large volumes of spatial data. These datasets necessitate the wide application of spatial databases. Queries on these geo-referenced data often require the aggregation of isolated data points to form spatial clusters and obtain properties of the clusters. However, current SQL standard does not provide an effective way to form and query spatial clusters. In this paper, we aim at introducing Cluster By into spatial databases to allow a broad range of interesting queries to be posted on spatial clusters. We also provide a language construct to specify spatial clustering algorithms. The extension is demonstrated with several motivating examples.
- P. Bonnet, J. Gehrke, and P. Seshadri. Towards sensor database systems. In MDM '01, pages 3--14, 2001. Google ScholarDigital Library
- M. Ester, H.-P. Kriegel, and J. Sander. Spatial data mining: A database approach. In SSD '97, pages 47--66, 1997. Google ScholarDigital Library
- M. Ester, H.-P. Kriegel, J. Sander, M. Wimmer, and X. Xu. Incremental clustering for mining in a data warehousing environment. In VLDB '98, pages 323--333, 1998. Google ScholarDigital Library
- M. Ester, H.-P. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD '96, pages 226--231, 1996.Google Scholar
- S. Guha, R. Rastogi, and K. Shim. Cure: an efficient clustering algorithm for large databases. In SIGMOD '98, pages 73--84, 1998. Google ScholarDigital Library
- ISO/IEC 9075. Database language sql, international standard., 1992.Google Scholar
- C. Li, M. Wang, L. Lim, H. Wang, and K. C.-C. Chang. Supporting ranking and clustering as generalized order-by and group-by. In SIGMOD '07, pages 127--138, 2007. Google ScholarDigital Library
- J. B. MacQueen. Some methods for classification and analysis of multivariate observations. In Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, pages 281--297, 1967.Google Scholar
- S. Madden, M. J. Franklin, J. M. Hellerstein, and W. Hong. The design of an acquisitional query processor for sensor networks. In SIGMOD '03, pages 491--502, 2003. Google ScholarDigital Library
- R. T. Ng and J. Han. Efficient and effective clustering methods for spatial data mining. In VLDB '94, pages 144--155, 1994. Google ScholarDigital Library
- Open GIS Consortium. Open GIS Simple Features Specification for SQL Revision. http://www.opengis.org/public/sfri/sfsqlrev10.pdf, 1998.Google Scholar
- S. Prasher and X. Zhou. Multiresolution amalgamation: dynamic spatial data cube generation. In ADC '04, pages 103--111, 2004. Google ScholarDigital Library
- P. Rigaux, M. Scholl, and A. Voisard. Spatial databases with application to GIS. Morgan Kaufmann Publishers Inc., 2002. Google ScholarDigital Library
- M. Sharifzadeh and C. Shahabi. Supporting spatial aggregation in sensor network databases. In GIS '04, pages 166--175, 2004. Google ScholarDigital Library
- S. Shekhar and S. Chawla. Spatial Databases: A Tour. Prentice Hall, 2003.Google Scholar
- W. Wang, J. Yang, and R. R. Muntz. Sting: A statistical information grid approach to spatial data mining. In VLDB '97, pages 186--195, 1997. Google ScholarDigital Library
- M. Worboys and M. Duckham. Geographic Information Systems: A Computing Perspective (2nd Edition). CRC Press, 2004. Google ScholarDigital Library
- Y. Yao and J. Gehrke. The cougar approach to in-network query processing in sensor networks. SIGMOD Rec., 31(3):9--18, 2002. Google ScholarDigital Library
- T. Zhang, R. Ramakrishnan, and M. Livny. Birch: A new data clustering algorithm and its applications. Data Min. Knowl. Discov., 1(2):141--182, 1997. Google ScholarDigital Library
- X. Zhou, D. Truffet, and J. Han. Efficient polygon amalgamation methods for spatial olap and spatial data mining. In SSD '99, pages 167--187, 1999. Google ScholarDigital Library
Index Terms
- Cluster By: a new sql extension for spatial data aggregation
Recommendations
Getis-Ord's hot- and cold-spot statistics as a basis for multivariate spatial clustering of orchard tree data
We propose a multivariate spatial clustering approach for partitioning orchard data.Data is spatially scaled by Getis-Ord Gi statistic, followed by k-means clustering.Trees are discriminated into spatially homogeneous groups.Feasibility and performance ...
Density-based cluster detection at multiple spatial scales via kullback-leibler divergence of reachability profiles
GeoAI '22: Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge DiscoveryDensity-based clustering methods are frequently used to define spatial clusters and outliers (noise) for location-only data. Different algorithms for solving this problem emerged over the past few decades, with their main difference being the numerical ...
Spatial clustering for the regionalization of maize cultivation in China and its outlier analysis
Regionalization of crop cultivation is the foundation of large-scale planting and a reasonable layout scheme, but current approaches mainly rely on qualitative analysis and heuristic methods, which cannot meet the progressively challenging demands. In ...
Comments