Abstract
The increasing power of computer hardware and the sophistication of computer software have brought many new possibilities to information world. On one side the possibility to analyze massive data sets has brought new insight, knowledge and information. On the other, it has enabled to massively distribute computing and has opened to a new programming paradigm called Service Oriented Computing particularly well adapted to cloud computing. Applying these new technologies to the transport industry can bring new understanding to town transport infrastructures. The objective of our work is to manage and aggregate cloud services for managing big data and assist decision making for transport systems. Thus this paper presents our approach to propose a service oriented architecture for big data analytics for transport systems based on the cloud. Proposing big data management strategies for data produced by transport infrastructures, whilst maintaining cost effective systems deployed on the cloud, is a promising approach. We present the advancement for developing the Data acquisition service and Information extraction and cleaning service as well as the analysis for choosing a sharding strategy.
Keywords
This is a preview of subscription content, log in via an institution.
References
Gulisano, V., Jiménez-Peris, R., Patiño-Mart́nez, M., Soriente, C., Valduriez, P.: StreamCloud: an elastic and scalable data streaming system. IEEE Trans. Parallel Distrib. Syst. 23, 2351–2365 (2012)
Lecue, F., Tallevi-Diotallevi, S., Hayes, J., Tucker, R., Bicer, V., Sbodio, M.L., Tommasi, P.: STAR-CITY. In: Proceedings of the 19th international conference on Intelligent User Interfaces - IUI 2014, pp. 179–188 (2014)
Demiryurek, U., Banaei-Kashani, F., Shahabi, C.: TransDec: a spatiotemporal query processing framework for transportation systems. In: Proceedings of 26th IEEE International Conference on Data Engineering, pp. 1197–1200 (2010)
Jagadish, H.V., Gehrke, J., Labrinidis, A., Papakonstantinou, Y., Patel, J.M., Ramakrishnan, R., Shahabi, C.: Big Data and Its Technical Challenges, vol. 57, no. 7 (2014)
Mell, P., Grance, T.: The NIST definition of cloud computing recommendations of the national institute of standards and technology (2008)
Artikis, A., Weidlich, M., Gal, A., Kalogeraki, V., Gunopulos, D.: Self-Adaptive Event Recognition for Intelligent Transport Management, pp. 319–325 (2013)
Thompson, D., McHale, G., Butler, R.: RITA (2014). http://www.its.dot.gov/data_capture/data_capture.htm
Jian, L., Yuanhua, J., Zhiqiang, S., Xiaodong, Z.: Improved design of communication platform of distributed traffic information systems based on SOA. In: 2008 International Symposium on Information Science and Engineering, vol. 2, pp. 124–128 (2008)
Yuan, N.J., Zheng, Y., Zhang, L., Xie, X.: T-finder: a recommender system for finding passengers and vacant taxis. IEEE Trans. Knowl. Data Eng. 25, 2390–2403 (2013)
Ge, Y., Xiong, H., Tuzhilin, A., Xiao, K., Gruteser, M., Pazzani, M.: An energy-efficient mobile recommender system. In: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 2010, p. 899 (2010)
Lee, D.-H., Wang, H., Cheu, R., Teo, S.: Taxi dispatch system based on current demands and real-time traffic conditions. Trans. Res. Rec. 1882, 193–200 (2004)
Talia, D.: Clouds for scalable big data analytics. Computer (Long. Beach. California), vol. 46, no. 5, pp. 98–101 (2013)
Yu, J., Jiang, F., Zhu, T.: RTIC-C: a big data system for massive traffic information mining. In: 2013 International Conference on Cloud Computing and Big Data, pp. 395–402 (2013)
Chen, X., Vo, H., Aji, A., Wang, F.: High performance integrated spatial big data analytics. In: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data - BigSpatial 2014, pp. 11–14 (2014)
Lin, J., Ryaboy, D.: Scaling big data mining infrastructure : the twitter experience. ACM SIGKDD Explor. Newsl. 14(2), 6 (2013)
Tavakoli, S., Mousavi, A.: Adopting user interacted mobile node data to the flexible data input layer architecture. In: 2008 International Conference on Intelligent Sensors, Sensor Networks and Information Processing, pp. 533–538 (2008)
Berson, A., Smith, S., Thearling, K.: An overview of data mining techniques. Data Min. Appl. CRM, pp. 1–49 (2004)
Yan, W., Brahmakshatriya, U., Xue, Y., Gilder, M., Wise, B.: p-PIC: parallel power iteration clustering for big data. J. Parallel Distrib. Comput. 73(3), 352–359 (2013)
Das, S., Haas, P.J., Beyer, K.S.: Ricardo: integrating R and hadoop categories and subject descriptors, pp. 987–998 (2000)
Lim, S.: Scalable SQL and NoSQL data stores, Statistics (Ber) (2008)
Zheng, Z., Zhu, J., Lyu, M.R.: Service-generated big data and big data-as-a-service: an overview. In: 2013 IEEE Proceedings of the International Congress on Big Data, pp. 403–410 (2013)
Demirkan, H., Delen, D.: Leveraging the capabilities of service-oriented decision support systems: Putting analytics and big data in cloud. Decis. Support Syst. 55(1), 412–421 (2013)
Schadt, E.E., Linderman, M.D., Sorenson, J., Lee, L., Nolan, G.P.: Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology. Nat. Rev. Genet. 12(3), 224 (2011)
Li, Z., Yang, C., Jin, B., Yu, M., Liu, K., Sun, M., Zhan, M.: Enabling big geoscience data analytics with a cloud-based, MapReduce-enabled and service-oriented workflow framework. PLoS One 10(3), e0116781 (2015)
Abramova, V., Bernardino, J.: NoSQL databases: a step to database scalability in web environment. In: Proceedings of the International C* Conference on Computer Science Software Engineering - C3S2E 2013, pp. 14–22 (2013)
Hipgrave, S.: Smarter fraud investigations with big data analytics. Netw. Secur. 2013(12), 7–9 (2013)
Tannahill, B.K., Jamshidi, M.: System of Systems and Big Data analytics – Bridging the gap. Comput. Electr. Eng. 40(1), 2–15 (2014)
Sadalage, P.J., Fowler, M.: NoSQL Distilled (2012)
Open, “Openstack,” (2015). http://www.openstack.org/
Buneman, P., Fernandez, M., Suciu, D.: UnQL: a query language and algebra for semistructured data based on structural recursion. VLDB J. 9(1), 76 (2000)
Nance, C., Losser, T., Iype, R., Harmon, G.: NoSQL vs RDBMS - why there is room for both. In: Proceedings Southern Association Information System Conference, pp. 111–116 (2013)
Cattell, R.: Scalable SQL and NoSQL data stores. ACM SIGMOD Rec. 39(4), 12 (2011)
GrandLyon: Smart Data (2015). http://data.grandlyon.com/
Acknowledgement
We thank the Région Rhône-Alpes who finances the thesis work of Gavin Kemp by means of the ARC 7 programme (http://www.arc7-territoires-mobilites.rhonealpes.fr/), as well as the competitiveness cluster LUTB Transport & Mobility Systems, in particularly Mr. Pascal Nief, Mr. Timothée David and Mr. Philippe Gache for putting us in contact with local companies and projects to gather use case scenarios for our work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Kemp, G., Vargas-Solar, G., Da Silva, C.F., Ghodous, P., Collet, C. (2015). Service Oriented Big Data Management for Transport. In: Helfert, M., Krempels, KH., Klein, C., Donellan, B., Guiskhin, O. (eds) Smart Cities, Green Technologies, and Intelligent Transport Systems. SMARTGREENS VEHITS 2015 2015. Communications in Computer and Information Science, vol 579. Springer, Cham. https://doi.org/10.1007/978-3-319-27753-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-27753-0_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27752-3
Online ISBN: 978-3-319-27753-0
eBook Packages: Computer ScienceComputer Science (R0)