Abstract
This chapter discusses challenges and opportunities in remote sensing big data. Three challenges are discussed. They are data complexity, data quality, and infrastructure change. The growth of remote sensing big data also introduces several new opportunities. The discussed changes are single scale to multiscale, on-premise servers to distributed services, data-focused Sensor Web to modeled-simulation Digital Twins, “meteorology” snapshots to “climate” series, isolated case to intertwined “teleconnections,” and one-level system to hierarchical knowledge graph.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abedin MZ, Guotai C, Hajek P, Zhang T (2022) Combining weighted SMOTE with ensemble learning for the class-imbalanced prediction of small business credit risk. Complex Intell Syst. https://doi.org/10.1007/s40747-021-00614-4
Agustianto K, Destarianto P (2019) Imbalance data handling using neighborhood cleaning rule (NCL) sampling method for precision student modeling. In: 2019 International conference on computer science, information technology, and electrical engineering (ICOMITEE). IEEE, Jember, pp 86–89
Ahmad M, Shabbir S, Raza RA et al (2021) Artifacts of different dimension reduction methods on hybrid CNN feature hierarchy for Hyperspectral Image Classification. Optik 246:167757. https://doi.org/10.1016/j.ijleo.2021.167757
Amani M, Ghorbanian A, Ahmadi SA et al (2020) Google earth engine cloud computing platform for remote sensing big data applications: a comprehensive review. IEEE J Sel Top Appl Earth Obs Remote Sens 13:5326–5350. https://doi.org/10.1109/JSTARS.2020.3021052
Banerjee A, Chen R, Meadows ME et al (2020) An analysis of long-term rainfall trends and variability in the Uttarakhand Himalaya using Google earth engine. Remote Sens 12:709. https://doi.org/10.3390/rs12040709
Benediktsson JA, Wu Z (2021) Distributed computing for remotely sensed data processing [scanning the section]. Proc IEEE 109:1278–1281. https://doi.org/10.1109/JPROC.2021.3094335
Bin Alias MSA, Ibrahim NB, Zin ZBM (2021) Improved sampling data workflow using Smtmk to increase the classification accuracy of imbalanced dataset. Eur J Mol Clin Med 8:2021
Chao G, Luo Y, Ding W (2019) Recent advances in supervised dimension reduction: a survey. Mach Learn Knowl Extr 1:341–358. https://doi.org/10.3390/make1010020
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357. https://doi.org/10.1613/jair.953
Chawla NV, Lazarevic A, Hall LO, Bowyer KW (2003) SMOTEBoost: improving prediction of the minority class in boosting. In: Lavrač N, Gamberger D, Todorovski L, Blockeel H (eds) Knowledge discovery in databases: PKDD 2003. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 107–119
Chen Z, Luo J, Chen N et al (2019) RFim: a real-time inundation extent model for large floodplains based on remote sensing big data and water level observations. Remote Sens 11:1585. https://doi.org/10.3390/rs11131585
Cheng Q, Oberhänsli R, Zhao M (2020) A new international initiative for facilitating data-driven earth science transformation. Geol Soc Lond Spec Publ 499:225–240. https://doi.org/10.1144/SP499-2019-158
Cravero A, Sepúlveda S (2021) Use and adaptations of machine learning in big data—applications in real cases in agriculture. Electronics 10:552. https://doi.org/10.3390/electronics10050552
Dang B, Li Y (2021) MSResNet: multiscale residual network via self-supervised learning for water-body detection in remote sensing imagery. Remote Sens 13:3122. https://doi.org/10.3390/rs13163122
Deng Z, Sun H, Zhou S et al (2018) Multi-scale object detection in remote sensing imagery with convolutional neural networks. ISPRS J Photogramm Remote Sens 145:3–22. https://doi.org/10.1016/j.isprsjprs.2018.04.003
Deren L, Wenbo Y, Zhenfeng S (2021) Smart city based on digital twins. Comput Urban Sci 1:4. https://doi.org/10.1007/s43762-021-00005-y
Devi D, Biswas S k, Purkayastha B (2017) Redundancy-driven modified Tomek-link based undersampling: a solution to class imbalance. Pattern Recogn Lett 93:3–12. https://doi.org/10.1016/j.patrec.2016.10.006
Ding Y, Yang X, Jin H et al (2021) Monitoring coastline changes of the Malay Islands based on Google earth engine and dense time-series remote sensing images. Remote Sens 13:3842. https://doi.org/10.3390/rs13193842
European Commission. Joint Research Centre (2020a) Destination earth: survey on “Digital Twins” technologies and activities, in the Green Deal area. Publications Office, LU
European Commission. Joint Research Centre (2020b) Destination earth: use cases analysis. Publications Office, LU
European Commission. Joint Research Centre (2021) Destination earth: ecosystem architecture description. Publications Office, LU
Fan J, Yan J, Ma Y, Wang L (2017) Big data integration in remote sensing across a distributed metadata-based spatial infrastructure. Remote Sens 10:7. https://doi.org/10.3390/rs10010007
Fan C, Zhang C, Yahja A, Mostafavi A (2021) Disaster city digital twin: a vision for integrating artificial and human intelligence for disaster management. Int J Inf Manag 56:102049. https://doi.org/10.1016/j.ijinfomgt.2019.102049
Farhangkhah N, Samadi S, Khosravi MR, Mohseni R (2021) Overcomplete pre-learned dictionary for incomplete data SAR imaging towards pervasive aerial and satellite vision. Wirel Netw. https://doi.org/10.1007/s11276-021-02821-w
Futia G, Melandri A, Vetrò A et al (2017) Removing barriers to transparency: a case study on the use of semantic technologies to tackle procurement data inconsistency. In: Blomqvist E, Maynard D, Gangemi A et al (eds) The semantic web. Springer International Publishing, Cham, pp 623–637
Galar M, Fernandez A, Barrenechea E et al (2012) A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans Syst Man Cybern Part C Appl Rev 42:463–484. https://doi.org/10.1109/TSMCC.2011.2161285
Gao H, Wu H, Chen Z et al (2022) Multiscale spectral-spatial cross-extraction network for hyperspectral image classification. IET Image Process 16:755–771. https://doi.org/10.1049/ipr2.12382
Ghazikhani A, Yazdi HS, Monsefi R (2012) Class imbalance handling using wrapper-based random oversampling. In: 20th Iranian conference on electrical engineering (ICEE2012). IEEE, Tehran, pp 611–616
Ghazouani F, Farah IR, Solaiman B (2019) A multi-level semantic scene interpretation strategy for change interpretation in remote sensing imagery. IEEE Trans Geosci Remote Sens 57:8775–8795. https://doi.org/10.1109/TGRS.2019.2922908
Goswami T, Roy UB (2021) Classification accuracy comparison for imbalanced datasets with its balanced counterparts obtained by different sampling techniques. In: Kumar A, Mozar S (eds) ICCCE 2020. Springer Singapore, Singapore, pp 45–54
Guan H, Zhang Y, Xian M et al (2021) SMOTE-WENN: solving class imbalance and small sample problems by oversampling and distance scaling. Appl Intell 51:1394–1409. https://doi.org/10.1007/s10489-020-01852-8
Gumma MK, Thenkabail PS, Teluguntla PG et al (2020) Agricultural cropland extent and areas of South Asia derived using Landsat satellite 30-m time-series big-data using random forest machine learning algorithms on the Google Earth Engine cloud. GIScience Remote Sens 57:302–322. https://doi.org/10.1080/15481603.2019.1690780
Guo J, Huang C, Hou J (2022) A scalable computing resources system for remote sensing big data processing using GeoPySpark based on Spark on K8s. Remote Sens 14:521. https://doi.org/10.3390/rs14030521
Gutiérrez R, Rampérez V, Paggi H et al (2022) On the use of information fusion techniques to improve information quality: taxonomy, opportunities and challenges. Inf Fusion 78:102–137. https://doi.org/10.1016/j.inffus.2021.09.017
Han H, Wang W-Y, Mao B-H (2005) Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. In: Huang D-S, Zhang X-P, Huang G-B (eds) Advances in intelligent computing. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 878–887
Hao X, Ji Z, Li X et al (2021) Construction and application of a knowledge graph. Remote Sens 13:2511. https://doi.org/10.3390/rs13132511
Hart P (1968) The condensed nearest neighbor rule (corresp.). IEEE Trans Inf Theory 14:515–516
He H, Bai Y, Garcia EA, Li S (2008) ADASYN: adaptive synthetic sampling approach for imbalanced learning. In: 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence). IEEE, Hong Kong, pp 1322–1328
Hu F, Xia G-S, Yang W, Zhang L (2020) Mining deep semantic representations for scene classification of high-resolution remote sensing imagery. IEEE Trans Big Data 6:522–536. https://doi.org/10.1109/TBDATA.2019.2916880
Hua Y, Mou L, Jin P, Zhu XX (2022) MultiScene: a large-scale dataset and benchmark for multiscene recognition in single aerial images. IEEE Trans Geosci Remote Sens 60:1–13. https://doi.org/10.1109/TGRS.2021.3110314
Huang Y, Chen Z, Yu T et al (2018) Agricultural remote sensing big data: management and applications. J Integr Agric 17:1915–1931. https://doi.org/10.1016/S2095-3119(17)61859-8
Iosifescu Enescu I, de Espona L, Haas-Artho D et al (2021) Cloud optimized raster encoding (CORE): a web-native streamable format for large environmental time series. Geomatics 1:369–382. https://doi.org/10.3390/geomatics1030021
Jadhav JK, Singh RP (2018) Automatic semantic segmentation and classification of remote sensing data for agriculture. Math Models Eng 4:112–137. https://doi.org/10.21595/mme.2018.19840
Jiang B, An X, Xu S, Chen Z (2022) Intelligent image semantic segmentation: a review through deep learning techniques for remote sensing image analysis. J Indian Soc Remote Sens. https://doi.org/10.1007/s12524-022-01496-w
Jiao C, Yu G, Yi X et al (2021) The impact of teleconnections on the temporal dynamics in aboveground net primary productivity of the Mongolian Plateau grasslands. Int J Climatol 41:6541–6555. https://doi.org/10.1002/joc.7211
Juez-Gil M, Arnaiz-González Á, Rodríguez JJ, García-Osorio C (2021) Experimental evaluation of ensemble classifiers for imbalance in Big Data. Appl Soft Comput 108:107447. https://doi.org/10.1016/j.asoc.2021.107447
Kang L, Zhang X, Zhang K, Yuan B (2021) Comparative study on different dimension reduction methods in remote sensing ground object recognition. In: 2021 4th international conference on artificial intelligence and big data (ICAIBD). IEEE, Chengdu, pp 382–386
Karpatne A, Ebert-Uphoff I, Ravela S et al (2019) Machine learning for the geosciences: challenges and opportunities. IEEE Trans Knowl Data Eng 31:1544–1554. https://doi.org/10.1109/TKDE.2018.2861006
Kubat M, Matwin S, Others (1997) Addressing the curse of imbalanced training sets: one-sided selection. In: Icml. Citeseer, p 179
Li H, Zhong X, Ma Z et al (2020) Climate changes and their teleconnections with ENSO over the last 55 years, 1961–2015, in floods-dominated basin, Jiangxi Province, China. Earth Space Sci 7. https://doi.org/10.1029/2019EA001047
Li Y, Kong D, Zhang Y et al (2021a) Robust deep alignment network with remote sensing knowledge graph for zero-shot and generalized zero-shot remote sensing image scene classification. ISPRS J Photogramm Remote Sens 179:145–158. https://doi.org/10.1016/j.isprsjprs.2021.08.001
Li Y, Ma J, Zhang Y (2021b) Image retrieval from remote sensing big data: a survey. Inf Fusion 67:94–115. https://doi.org/10.1016/j.inffus.2020.10.008
Liu P (2015) A survey of remote-sensing big data. Front Environ Sci 3. https://doi.org/10.3389/fenvs.2015.00045
Liu B, Tsoumakas G (2020) Dealing with class imbalance in classifier chains via random undersampling. Knowl-Based Syst 192:105292. https://doi.org/10.1016/j.knosys.2019.105292
Lorenz S, Ghamisi P, Kirsch M et al (2021) Feature extraction for hyperspectral mineral domain mapping: a test of conventional and innovative methods. Remote Sens Environ 252:112129. https://doi.org/10.1016/j.rse.2020.112129
Lu H, Liu Q, Liu X, Zhang Y (2021) A survey of semantic construction and application of satellite remote sensing images and data. J Organ End User Comput 33:1–20. https://doi.org/10.4018/JOEUC.20211101.oa6
Ma Y, Wu H, Wang L et al (2015) Remote sensing big data computing: challenges and opportunities. Future Gener Comput Syst 51:47–60. https://doi.org/10.1016/j.future.2014.10.029
Meng T, Jing X, Yan Z, Pedrycz W (2020) A survey on machine learning for data fusion. Inf Fusion 57:115–129. https://doi.org/10.1016/j.inffus.2019.12.001
Nativi S, Mazzetti P, Craglia M (2021) Digital ecosystems for developing digital twins of the earth: the destination earth case. Remote Sens 13:2119. https://doi.org/10.3390/rs13112119
Pérez-Ortiz M, Gutiérrez PA, Hervás-Martínez C (2013) Borderline kernel based over-sampling. In: Pan J-S, Polycarpou MM, Woźniak M et al (eds) Hybrid artificial intelligent systems. Springer Berlin Heidelberg, Berlin, Heidelberg, pp 472–481
Ramentol E, Caballero Y, Bello R, Herrera F (2012) SMOTE-RSB *: a hybrid preprocessing approach based on oversampling and undersampling for high imbalanced data-sets using SMOTE and rough sets theory. Knowl Inf Syst 33:245–265. https://doi.org/10.1007/s10115-011-0465-6
Ray P, Reddy SS, Banerjee T (2021) Various dimension reduction techniques for high dimensional data analysis: a review. Artif Intell Rev 54:3473–3515. https://doi.org/10.1007/s10462-020-09928-0
Sabri Y, Bahja F, Siham A, Maizate A (2021) Cloud computing in remote sensing: big data remote sensing knowledge discovery and information analysis. Int J Adv Comput Sci Appl 12. https://doi.org/10.14569/IJACSA.2021.01205104
Seiffert C, Khoshgoftaar TM, Van Hulse J, Napolitano A (2010) RUSBoost: a hybrid approach to alleviating class imbalance. IEEE Trans Syst Man Cybern – Part Syst Hum 40:185–197. https://doi.org/10.1109/TSMCA.2009.2029559
Seydi ST, Amani M, Ghorbanian A (2022) A dual attention convolutional neural network for crop classification using time-series Sentinel-2 imagery. Remote Sens 14:498. https://doi.org/10.3390/rs14030498
Shahat E, Hyun CT, Yeom C (2021) City digital twin potentials: a review and research agenda. Sustainability 13:3386. https://doi.org/10.3390/su13063386
Suryono H, Kuswanto H, Iriawan N (2022) Rice phenology classification based on random forest algorithm for data imbalance using Google Earth engine. Procedia Comput Sci 197:668–676. https://doi.org/10.1016/j.procs.2021.12.201
Wagner T (1973) Convergence of the edited nearest neighbor (Corresp.). IEEE Trans Inf Theory 19:696–697. https://doi.org/10.1109/TIT.1973.1055059
Wang H, Skau E, Krim H, Cervone G (2018a) Fusing heterogeneous data: a case for remote sensing and social media. IEEE Trans Geosci Remote Sens 56:6956–6968. https://doi.org/10.1109/TGRS.2018.2846199
Wang L, Ma Y, Yan J et al (2018b) pipsCloud: high performance cloud computing for remote sensing big data management and processing. Future Gener Comput Syst 78:353–368. https://doi.org/10.1016/j.future.2016.06.009
Wang J, Lan Y, Zhang S et al (2020a) Knowledge graph for multi-source data fusion topics research. In: 2020 international conference on high performance big data and intelligent systems (HPBD&IS). IEEE, Shenzhen, pp 1–5
Wang L, Yan J, Ma Y (2020b) Cloud computing in remote sensing. CRC Press, Boca Raton
Wang Z, Gan S, Lv J, Yang M (2021) Lake basin based on long time series remote sensing data. In: 2021 IEEE 2nd international conference on big data, artificial intelligence and internet of things engineering (ICBAIE). IEEE, Nanchang, pp 211–215
Wang D, Cao W, Zhang F et al (2022a) A review of deep learning in multiscale agricultural sensing. Remote Sens 14:559. https://doi.org/10.3390/rs14030559
Wang J, Bretz M, Dewan MAA, Delavar MA (2022b) Machine learning in modelling land-use and land cover-change (LULCC): current status, challenges and prospects. Sci Total Environ 822:153559. https://doi.org/10.1016/j.scitotenv.2022.153559
Wu Z, Sun J, Zhang Y et al (2021) Recent developments in parallel and distributed computing for remotely sensed big data processing. Proc IEEE 109:1282–1305. https://doi.org/10.1109/JPROC.2021.3087029
Yang K, Liu Y (2021) Construction of knowledge graph in the field of grassland plants based on ontology database. In: He Y, Weng C-H (eds) International conference on environmental remote sensing and big data (ERSBD 2021). SPIE, Wuhan, p 13
Yang W, Zheng Y, Li S (2021) Application status and prospect of digital twin for on-orbit spacecraft. IEEE Access 9:106489–106500. https://doi.org/10.1109/ACCESS.2021.3100683
Yao X, Li G, Xia J et al (2019) Enabling the big earth observation data via cloud computing and DGGS: opportunities and challenges. Remote Sens 12:62. https://doi.org/10.3390/rs12010062
Yin Z, Kuang W, Bao Y et al (2021) Evaluating the dynamic changes of urban land and its fractional covers in Africa from 2000–2020 using time series of remotely sensed images on the big data platform. Remote Sens 13:4288. https://doi.org/10.3390/rs13214288
Yu D, He Z (2022) Digital twin-driven intelligence disaster prevention and mitigation for infrastructure: advances, challenges, and opportunities. Nat Hazards. https://doi.org/10.1007/s11069-021-05190-x
Yu X, Fan J, Chen J et al (2021) NestNet: a multiscale convolutional neural network for remote sensing image change detection. Int J Remote Sens 42:4898–4921. https://doi.org/10.1080/01431161.2021.1906982
Zárate M, Buckle C, Mazzanti R et al (2020) Harmonizing big data with a knowledge graph: OceanGraph KG uses case. In: Rucci E, Naiouf M, Chichizola F, De Giusti L (eds) Cloud computing, big data & emerging topics. Springer International Publishing, Cham, pp 81–92
Zaytar MA, El Amrani C (2021) Satellite imagery noising with generative adversarial networks. Int J Cogn Inform Nat Intell 15:16–25. https://doi.org/10.4018/IJCINI.2021010102
Zhao J, Yue P (2019) Spatiotemporal data cube modeling for integrated analysis of multi-source sensing data. In: IGARSS 2019–2019 IEEE international geoscience and remote sensing symposium. IEEE, Yokohama, pp 4791–4794
Zhu L, Su X, Hu Y et al (2021a) A spatio-temporal local association query algorithm for multi-source remote sensing big data. Remote Sens 13:2333. https://doi.org/10.3390/rs13122333
Zhu L, Su X, Tai X (2021b) A high-dimensional indexing model for multi-source remote sensing big data. Remote Sens 13:1314. https://doi.org/10.3390/rs13071314
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Di, L., Yu, E. (2023). Challenges and Opportunities in the Remote Sensing Big Data. In: Remote Sensing Big Data. Springer Remote Sensing/Photogrammetry. Springer, Cham. https://doi.org/10.1007/978-3-031-33932-5_18
Download citation
DOI: https://doi.org/10.1007/978-3-031-33932-5_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33931-8
Online ISBN: 978-3-031-33932-5
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)